Managing large hierarchical datasets with PyTables

Walter C. Koerner Library 1958 Main Mall, Vancouver, British Columbia

Abstract: PyTables is a free and open-source Python library for managing large hierarchical datasets. It is built on top of numpy and the HDF5 scientific dataset library, and it focuses both on performance and interactive analysis of very large datasets. For large data streams (think multi-dimensional arrays or billions of records) it outperforms databases in […]