Feature: Implement multithreaded reads in hdf5 by OmkarAgnihotri · Pull Request #6167 · HDFGroup/hdf5

OmkarAgnihotri · 2026-01-22T14:08:41Z

This Pull Request implements a Proof-Of-Concept for multithreaded chunk
reads for hyperslab selections over chunked datasets in hdf5.

The high level idea is to bypass the thread unsafe chunk cache (H5D_rdcc_t)
and the page buffer (H5PB_t).

This works in two phases:

Serial phase
- Accumulate chunk addresses/offsets
Parallel phase
- Read raw data for the given chunks
- Apply filter pipelines
- Copy data to appropriate locations in the memory space

Important

Implements multithreaded chunk reads for hyperslab selections in HDF5, bypassing thread-unsafe components and using OpenMP for parallel processing.

Behavior:
- Implements multithreaded chunk reads for hyperslab selections in H5D__chunk_read() in H5Dchunk.c.
- Bypasses thread-unsafe H5D_rdcc_t and H5PB_t.
- Uses OpenMP for parallel processing.
Structures:
- Adds H5D_chunk_info_light_t in H5Dpkg.h for parallel I/O operations.
Parallel Processing:
- Serial phase: Accumulates chunk addresses/offsets.
- Parallel phase: Reads raw data, applies filters, and copies data to memory.

^{This description was created by}^{for 8cc195a. You can customize this summary. It will automatically update as commits are pushed.}

This Pull Request implements a Proof-Of-Concept for multithreaded chunk reads for hyperslab selections over chunked datasets in hdf5. The high level idea is to bypass the thread unsafe chunk cache (`H5D_rdcc_t`) and the page buffer (`H5PB_t`). This works in two phases: 1. Serial phase - Accumulate chunk addresses/offsets 2. Parallel phase - Read raw data for the given chunks - Apply filter pipelines - Copy data to appropriate locations in the memory space

gheber · 2026-01-22T14:20:06Z

Great work @OmkarAgnihotri .

Just to clarify the use case/assumptions:

You are not looking for multiple application threads calling the public API (H5Dread), i.e., your proposal is something that would be supported in non-thread-safe library builds.
Fixed‑size datatype, no conversion/xform.
Simple hyperslab selections (no point selections).
Read‑only.
No filters (or only filters that are known re‑entrant and don’t touch global state).
Bypass the chunk cache and metadata cache where possible.

Is that a fair description?

OmkarAgnihotri · 2026-01-22T14:43:27Z

Just to clarify the use case/assumptions:

You are not looking for multiple application threads calling the public API (H5Dread), i.e., your proposal is something that would be supported in non-thread-safe library builds.

Fixed‑size datatype, no conversion/xform.

Simple hyperslab selections (no point selections).

Read‑only.

No filters (or only filters that are known re‑entrant and don’t touch global state).

Bypass the chunk cache and metadata cache where possible.

Is that a fair description?

Yes, all the 6 points exactly describe the use case. Thanks @gheber for the clear summary.

gheber · 2026-01-22T14:52:50Z

I'd add:

The VFD used is known to be re‑entrant for concurrent reads (which sec2 is).
The OMP parallel region does only read + memcpy on disjoint regions.

OmkarAgnihotri requested review from bmribler, brtnfld, byrnHDF, derobins, fortnern, glennsong09, jhendersonHDF, lrknox, mattjala, qkoziol and vchoi-hdfgroup as code owners January 22, 2026 14:08

github-project-automation bot added this to HDF5 - TRIAGE & TRACK Jan 22, 2026

github-project-automation bot moved this to To be triaged in HDF5 - TRIAGE & TRACK Jan 22, 2026

nbagha1 marked this pull request as draft January 22, 2026 19:20

brtnfld assigned fortnern Jan 22, 2026

brtnfld added the Component - C Library Core C library issues (usually in the src directory) label Jan 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

Feature: Implement multithreaded reads in hdf5#6167

Feature: Implement multithreaded reads in hdf5#6167
OmkarAgnihotri wants to merge 1 commit intoHDFGroup:developfrom
OmkarAgnihotri:parallel_reads

OmkarAgnihotri commented Jan 22, 2026 •

edited by ellipsis-dev bot

Loading

Uh oh!

gheber commented Jan 22, 2026 •

edited

Loading

Uh oh!

OmkarAgnihotri commented Jan 22, 2026

Uh oh!

gheber commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Comments

Conversation

OmkarAgnihotri commented Jan 22, 2026 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gheber commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OmkarAgnihotri commented Jan 22, 2026

Uh oh!

gheber commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

OmkarAgnihotri commented Jan 22, 2026 •

edited by ellipsis-dev bot

Loading

gheber commented Jan 22, 2026 •

edited

Loading