Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs #20722

mhaseeb123 · 2025-11-25T01:50:34Z

Description

This PR adds a new example to demonstrate pipelining when reading parquet sources with the new hybrid scan reader in multithreaded environment.

Checklist

copy-pr-bot · 2025-11-25T01:50:38Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

…aseeb123/cudf into fea/hybrid-scan-pipeline

copy-pr-bot · 2025-12-02T21:02:11Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

…123/cudf into fea/hybrid-scan-pipeline

…d wrappers (#20861) Contributes to #20722 and #20879 This PR replaces the use of `thrust::copy_if` and `thrust::count_if` in Parquet and Hybrid scan readers with custom `CUB` based implementations using pinned memory to copy the result from device. Note: I will create one last PR after this one replacing `thrust` utils with their (CUB based) cudf counterparts in `cudf/detail/utilities/algorithm.cuh` across libcudf. Authors: - Muhammad Haseeb (https://github.com/mhaseeb123) - https://github.com/apps/pre-commit-ci - Vukasin Milovanovic (https://github.com/vuule) Approvers: - Bradley Dice (https://github.com/bdice) - Yunsong Wang (https://github.com/PointKernel) - David Wendt (https://github.com/davidwendt) - Paul Mattione (https://github.com/pmattione-nvidia) - MithunR (https://github.com/mythrocks) URL: #20861

Contributes to #20722 This PR replaces the use of small host vectors with pinned vectors to avoid pageable copies and improve pipeline performance when reading parquet files using multiple threads (each using a separate non-blocking stream) Authors: - Muhammad Haseeb (https://github.com/mhaseeb123) Approvers: - Bradley Dice (https://github.com/bdice) - Nghia Truong (https://github.com/ttnghia) - Vukasin Milovanovic (https://github.com/vuule) URL: #20820

mhaseeb123 · 2026-01-10T00:12:27Z

cpp/src/io/parquet/reader_impl.cpp

  cudf::detail::cuda_memcpy_async(
-    cudf::host_span<size_t>(h_initial_str_offsets.data(), initial_str_offsets.size()),
-    cudf::device_span<size_t const>(initial_str_offsets.data(), initial_str_offsets.size()),
+    cudf::host_span<size_t>{h_initial_str_offsets.data(), initial_str_offsets.size()},


Simply using {} instead of ()

mhaseeb123 · 2026-01-10T00:15:06Z

cpp/src/io/parquet/reader_impl.cpp

-  auto host_null_masks = std::vector<bitmask_type*>{};
-  auto host_begin_bits = std::vector<cudf::size_type>{};
-  auto host_end_bits   = std::vector<cudf::size_type>{};
+  auto null_masks = std::vector<bitmask_type*>{};


Simply changed host_xx to xx and instead using a pinned_ prefix for the pinned versions below

mhaseeb123 · 2026-01-10T00:15:20Z

cpp/src/io/parquet/predicate_pushdown.cpp

      return bitmask;
    } else {
-      auto bitmask = cudf::detail::make_host_vector<bitmask_type>(num_bitmasks, stream);
+      auto bitmask = cudf::detail::make_pinned_vector_async<bitmask_type>(num_bitmasks, stream);


Added the missed host to pinned conversion.

mhaseeb123 · 2026-01-10T00:15:48Z

cpp/examples/parquet_io/parquet_io_multithreaded.cpp

  auto resource               = create_memory_resource(is_pool_used);
  auto default_stream         = cudf::get_default_stream();
-  auto stream_pool            = rmm::cuda_stream_pool(thread_count);
+  auto stream_pool = rmm::cuda_stream_pool(thread_count, rmm::cuda_stream::flags::non_blocking);


Create non-blocking streams

mhaseeb123 · 2026-01-10T00:16:03Z

cpp/examples/parquet_io/parquet_io_multithreaded.cpp

-               "input source == thread count\n";
-  for (size_t idx = 0; thread_count > static_cast<int>(parquet_files.size()); idx++) {
-    parquet_files.emplace_back(parquet_files[idx % initial_size]);
+  if (parquet_files.size() < thread_count) {


Only print that we are appending the sources if we need to

mhaseeb123 · 2026-01-10T00:50:42Z

pre-commit.ci autofix

mhaseeb123 · 2026-01-13T03:34:37Z

pre-commit.ci autofix

mhaseeb123 added 4 commits November 25, 2025 01:35

Add example code

3a03f3a

Use multithreaded setup_page_index in hybrid scan reader

f7ce471

Merge branch 'main' into fea/multithreaded-setup-pgidx

034983a

style fix

8b85949

github-actions bot assigned mhaseeb123 Nov 25, 2025

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue labels Nov 25, 2025

mhaseeb123 added this to libcudf Nov 25, 2025

mhaseeb123 added feature request New feature or request 2 - In Progress Currently a work in progress non-breaking Non-breaking change cuIO cuIO issue labels Nov 25, 2025

mhaseeb123 and others added 5 commits November 24, 2025 17:51

Merge branch 'main' into fea/hybrid-scan-pipeline

3fde08a

Minor improvements

e9e8976

Style fix

8b2e81a

Merge branch 'main' into fea/hybrid-scan-pipeline

5d36faa

Merge branch 'fea/multithreaded-setup-pgidx' of https://github.com/mh…

c3050f4

…aseeb123/cudf into fea/hybrid-scan-pipeline

mhaseeb123 and others added 11 commits December 2, 2025 21:03

Updates

dabadfa

Don't use a pool

a5d51a9

Remove extra filter exprs

ba47f3f

Remove unneeded stream syncs

5250102

Make pipelining great again

1517ffe

Make pipelining great again

73e537d

Merge branch 'fea/hybrid-scan-pipeline' of https://github.com/mhaseeb…

dc7554e

…123/cudf into fea/hybrid-scan-pipeline

Make another pageable copy a pinned one

995c4f0

More pageable to pinned small copies

fee9e1b

Flush updates

26b4d94

Minor

ab30655

Merge branch 'main' into fea/hybrid-scan-pipeline

4b54094

Merge branch 'main' into fea/hybrid-scan-pipeline

3e00d0a

mhaseeb123 changed the title ~~Example to demonstrate pipelining with the hybrid scan reader~~ Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs Jan 5, 2026

mhaseeb123 added 2 commits January 5, 2026 18:42

Merge branch 'main' into fea/hybrid-scan-pipeline

4d8a299

Merge branch 'main' into fea/hybrid-scan-pipeline

2dcc3ad

mhaseeb123 removed the DO NOT MERGE Hold off on merging; see PR for details label Jan 9, 2026

mhaseeb123 added 3 commits January 9, 2026 15:44

Merge branch 'main' into fea/hybrid-scan-pipeline

e45aba4

Merge

8939c3c

Merge

7348692

mhaseeb123 commented Jan 10, 2026

View reviewed changes

Minor improvements

50d14b6

pre-commit-ci bot and others added 3 commits January 10, 2026 00:51

[pre-commit.ci] auto code formatting

7a7396a

Update copyright years

eec9f6e

Merge branch 'main' into fea/hybrid-scan-pipeline

0aa6af3

mhaseeb123 added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Jan 10, 2026

mhaseeb123 marked this pull request as ready for review January 10, 2026 00:59

mhaseeb123 requested review from a team as code owners January 10, 2026 00:59

mhaseeb123 requested review from ttnghia and vyasr January 10, 2026 00:59

Minor bug fix

534bff8

GregoryKimball moved this to Burndown in libcudf Jan 12, 2026

pre-commit-ci bot and others added 3 commits January 13, 2026 03:35

[pre-commit.ci] auto code formatting

aed6c39

Merge branch 'main' into fea/hybrid-scan-pipeline

5d2423f

Merge branch 'main' into fea/hybrid-scan-pipeline

fe5e3be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs #20722

Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs #20722

Uh oh!

mhaseeb123 commented Nov 25, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Nov 25, 2025

Uh oh!

copy-pr-bot bot commented Dec 2, 2025

Uh oh!

mhaseeb123 Jan 10, 2026

Uh oh!

mhaseeb123 Jan 10, 2026

Uh oh!

mhaseeb123 Jan 10, 2026

Uh oh!

mhaseeb123 Jan 10, 2026

Uh oh!

mhaseeb123 Jan 10, 2026

Uh oh!

mhaseeb123 commented Jan 10, 2026

Uh oh!

mhaseeb123 commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs #20722

Are you sure you want to change the base?

Example to demonstrate inter-parquet-file pipelining using hybrid scan APIs #20722

Uh oh!

Conversation

mhaseeb123 commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

copy-pr-bot bot commented Nov 25, 2025

Uh oh!

copy-pr-bot bot commented Dec 2, 2025

Uh oh!

mhaseeb123 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhaseeb123 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhaseeb123 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhaseeb123 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhaseeb123 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhaseeb123 commented Jan 10, 2026

Uh oh!

mhaseeb123 commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mhaseeb123 commented Nov 25, 2025 •

edited

Loading