-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Implement groupby all/any via bool-coercion + min/max #22371
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
galipremsagar
merged 41 commits into
rapidsai:pandas3
from
galipremsagar:groupby_bool_reduce
May 13, 2026
+121
−39
Merged
Changes from all commits
Commits
Show all changes
41 commits
Select commit
Hold shift + click to select a range
b288bbc
Implement groupby ``all``/``any`` via bool-coercion + min/max
galipremsagar 8992d39
Apply suggestions from code review
galipremsagar 8285223
Reject pd.NA string-to-object casts in pandas-compatible mode (#22295)
galipremsagar ca6dddc
Remove legacy Dask-based streaming backends (#22358)
madsbk 8995085
Fix StatsCollector.serialize to use value equality instead of object …
Matt711 5255d51
Pass managed pool MR explicitly in NDSH parquet data generation (#22344)
vuule 9407fd6
Fix compile warnings in libcudf examples (#22335)
davidwendt 0e82b62
Add skip axis to all join benchmarks (#22241)
PointKernel c2f583c
Expose __from_arrow__ on masked extension dtype proxies (#22373)
galipremsagar 5c4c21a
Fix datetime resolution for empty data inputs (#22363)
galipremsagar 31732df
Expose additional proxy attributes for IntervalArray, Styler, and _Me…
galipremsagar 4aa57e5
Multi-rank fixes for cudf-polars streaming (#22361)
madsbk aa23377
Fix reading of large CSV files (>64MB) (#22375)
vuule 4aa13f1
Add decimal128 to groupby_max_cardinality benchmark (#22162)
PointKernel c5cb03b
`StreamingEngine._reset()` (#22364)
madsbk 8bdabe7
Validate PDS-DS Q1 (#22389)
Matt711 e304ffd
Improve hstack lowering (#22353)
rjzamora 9edc7dc
Replace `LD_PRELOAD` hack with compute-sanitizer (#22290)
KyleFromNVIDIA 3700502
Run all nvbench benchmarks with timeout in smoketest (#20538)
bdice f0b2a42
Fix a crash in the ORC reader with malformed stripe footers (#22383)
vuule df9ea24
Rename build/probe to right/left in hash_join and distinct_hash_join …
PointKernel d9195b6
remove pylibcudf calls
galipremsagar c84f036
Merge branch 'pandas3' into groupby_bool_reduce
galipremsagar 7d7bd35
Merge branch 'pandas3' into groupby_bool_reduce
galipremsagar e9dd32b
Update python/cudf/cudf/core/groupby/groupby.py
galipremsagar e03db07
Correctly handle blocks with "block byte size" fields in the Avro rea…
vuule 62c8c5a
Use `token.rapids.nvidia.com` when issuing S3 bucket creds in devcont…
trxcllnt 6ffe708
Use static cudart by default (#22397)
KyleFromNVIDIA 6598b63
Fix `to_array` to return non-corrupted data (#22342)
galipremsagar aa0a707
Use cudaStream_t instead of cuda_stream_view in pylibcudf Cython (#22…
vyasr b45c5aa
Use `language: script` for cudf-polars-ir-signatures pre-commit hook …
vyasr 8a0d5f9
Fix potential errors in Parquet page header decode (#22274)
mhaseeb123 be40780
Make RapidsMPF the default runtime for cudf_polars streaming executor…
mroeschke f49d5e8
Use thread pool to submit hybrid scan host IO tasks (#21992)
mhaseeb123 50cee5b
Python bindings and pytests for `cudf::apply_deletion_mask` (#22145)
mhaseeb123 47b699d
Refactor ``sort_actor`` to prepare for ``OrderScheme`` changes (#22350)
rjzamora 996eb35
Run the cudf-polars test suite against `DaskEngine` and `RayEngine` (…
madsbk 7a120b7
Address reviews
galipremsagar 4dcb025
Merge branch 'pandas3' into groupby_bool_reduce
galipremsagar 48c4ccd
Merge
galipremsagar 3d9f864
Merge
galipremsagar File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just confirming, is
np.dtype(np.bool_)return regardless of the pandas string type?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes: