Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6743809
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 26, 20262026-05-26T12:00:25+00:00 2026-05-26T12:00:25+00:00

There seems to be many choices for Python to interface with SQLite (sqlite3, atpy)

  • 0

There seems to be many choices for Python to interface with SQLite (sqlite3, atpy) and HDF5 (h5py, pyTables) — I wonder if anyone has experience using these together with numpy arrays or data tables (structured/record arrays), and which of these most seamlessly integrate with “scientific” modules (numpy, scipy) for each data format (SQLite and HDF5).

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-26T12:00:25+00:00Added an answer on May 26, 2026 at 12:00 pm

    Most of it depends on your use case.

    I have a lot more experience dealing with the various HDF5-based methods than traditional relational databases, so I can’t comment too much on SQLite libraries for python…

    At least as far as h5py vs pyTables, they both offer very seamless access via numpy arrays, but they’re oriented towards very different use cases.

    If you have n-dimensional data that you want to quickly access an arbitrary index-based slice of, then it’s much more simple to use h5py. If you have data that’s more table-like, and you want to query it, then pyTables is a much better option.

    h5py is a relatively “vanilla” wrapper around the HDF5 libraries compared to pyTables. This is a very good thing if you’re going to be regularly accessing your HDF file from another language (pyTables adds some extra metadata). h5py can do a lot, but for some use cases (e.g. what pyTables does) you’re going to need to spend more time tweaking things.

    pyTables has some really nice features. However, if your data doesn’t look much like a table, then it’s probably not the best option.

    To give a more concrete example, I work a lot with fairly large (tens of GB) 3 and 4 dimensional arrays of data. They’re homogenous arrays of floats, ints, uint8s, etc. I usually want to access a small subset of the entire dataset. h5py makes this very simple, and does a fairly good job of auto-guessing a reasonable chunk size. Grabbing an arbitrary chunk or slice from disk is much, much faster than for a simple memmapped file. (Emphasis on arbitrary… Obviously, if you want to grab an entire “X” slice, then a C-ordered memmapped array is impossible to beat, as all the data in an “X” slice are adjacent on disk.)

    As a counter example, my wife collects data from a wide array of sensors that sample at minute to second intervals over several years. She needs to store and run arbitrary querys (and relatively simple calculations) on her data. pyTables makes this use case very easy and fast, and still has some advantages over traditional relational databases. (Particularly in terms of disk usage and speed at which a large (index-based) chunk of data can be read into memory)

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

There seem to be many excellent web frameworks for Python. Has anyone used any
There seems to be too many attributes/parameters in CSS... I want to know all
There seems to be two major conventions for organizing project files and then many
Sorry if this question seems trivial to many here. In a C++ code there
There seem to be many ways to define singletons in Python. Is there a
There are many open sourced parser implementations available to us in Haskell. Parsec seems
There doesn't seem to be many choices out there for running functional tests on
There seem to many ways to skin this particular cat - but which is
There seem to be many options to create an XML document in .NET. What's
There seem to be many questions asked about this subject here on stackoverflow, but

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.