-
Notifications
You must be signed in to change notification settings - Fork 749
perf: "two-pass" seurat hvg via scanpy.get.aggregate
#4013
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
ilan-gold
wants to merge
85
commits into
main
Choose a base branch
from
ig/two_pass_hvg_v3
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+131
−35
Open
Changes from all commits
Commits
Show all changes
85 commits
Select commit
Hold shift + click to select a range
a625c55
perf: "two-pass" seurat hvg3 via `scanpy.get.aggregate`
ilan-gold d839e98
chore: hvg v3 benchmark
ilan-gold 86db499
fix: use counts
ilan-gold d5a6a78
fix: use a batch key
ilan-gold fdc5653
fix: not again
ilan-gold 8f0e426
fix: `compute` single pass!
ilan-gold 8ad893d
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold 7e0390e
fix: unique
ilan-gold 17be530
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold cc0d67e
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold 96c16e9
chore: add new `dask` benchmark
ilan-gold db4bc2c
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold 478af4a
fix: actually use dask lol
ilan-gold 54db31b
chore: really do dask
ilan-gold 4fe84c5
fix: layers support
ilan-gold 35590a4
fix: no view check needed
ilan-gold db81d6e
fix: no layers eeded
ilan-gold b37444e
fix: reduce number of batches
ilan-gold cf65665
fix: a little bit more
ilan-gold 8f4ef78
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold a7b067d
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold 6f7ad6a
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold e624939
perf: chan's parallel mean-var algorithm for dask
ilan-gold 61332fd
fix: params
ilan-gold 1df5fda
fix: iteration
ilan-gold 9a70581
fix: zarr link
ilan-gold 5313ea2
fix: `median` calculation skipped
ilan-gold e19a7d8
fix: no-batch-key accel
ilan-gold 8482561
fix: don't run all benchmarks with dask
ilan-gold 44606f0
Merge branch 'ig/chan_mean_var_main' into ig/two_pass_hvg_v3
ilan-gold 17f706e
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold a2b390b
chore: relnote
ilan-gold 21f5ddc
perf: welford's algorithm for mean-var
ilan-gold 514bd17
chore: relnote
ilan-gold 48230af
Merge branch 'main' into ig/welford
ilan-gold c2cd368
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold b71eb68
njit support for chan algorithm (#4153)
zboldyga 471b989
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold afc24a1
Merge branch 'main' into ig/welford
ilan-gold 35b0ff6
Merge branch 'main' into ig/welford
ilan-gold f793415
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold 8a57e1c
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold 1f43088
chore: integrate welford's directly into chans
ilan-gold 80a4a94
Apply suggestion from @ilan-gold
ilan-gold 8405ff1
Update benchmarks/benchmarks/preprocessing_counts.py
ilan-gold cd73fe2
fix: correct chan unstable step usable
ilan-gold 05daadb
chore: add cancelling test
ilan-gold 0d25b48
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold e437714
chore: finish sentence
ilan-gold ec72679
chore: relnote
ilan-gold 9dcfcc7
chore: add context
ilan-gold 4f074ed
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold 3fa0884
Merge branch 'main' into ig/welford
ilan-gold cd781f0
chore: bring in dask
ilan-gold 9a6275b
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold bd85e03
perf: less memory touches
ilan-gold 4e0ff1a
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold 497809e
refactor: cleanup
ilan-gold 25e6bfc
chore: csc benchmarks
ilan-gold 3f4b1b4
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold cd9ad03
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1a19312
Merge branch 'main' into ig/welford
ilan-gold c649555
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold 81ae72b
Merge branch 'main' into ig/welford
ilan-gold c7d4166
Merge branch 'ig/welford' into ig/chan_mean_var_main
flying-sheep 9004cc0
fix: tests
ilan-gold cc5ac95
Merge branch 'ig/welford' of github.com:scverse/scanpy into ig/welford
ilan-gold eb03735
Merge branch 'main' into ig/welford
ilan-gold d1ad434
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold ff0ac25
chore: spelling
ilan-gold 6ddd745
Merge branch 'ig/welford' into ig/chan_mean_var_main
ilan-gold 6ebc4b3
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold 3c3c5b0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 65a46ea
chore: clean up `counts` key
ilan-gold 88a0150
Merge branch 'ig/chan_mean_var_main' of github.com:scverse/scanpy int…
ilan-gold 196e443
Merge branch 'main' into ig/chan_mean_var_main
ilan-gold c99d04d
fix: try no dask
ilan-gold 31d42ba
fix: back to dask
ilan-gold 83d8db7
Merge branch 'ig/chan_mean_var_main' into ig/two_pass_hvg_v3
ilan-gold 7bf2db4
fix: no defaults
ilan-gold added47
Merge branch 'ig/chan_mean_var_main' into ig/two_pass_hvg_v3
ilan-gold 06ecaa2
fix: var space
ilan-gold 1302d26
chore: relnote
ilan-gold 761f054
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold 3c87db4
Merge branch 'main' into ig/two_pass_hvg_v3
ilan-gold File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1 @@ | ||
| {func}`scanpy.pp.highly_variable_genes` now does only two passes over the data sequentially for `seurat_v3` flavors, greatly reducing `dask` input usage time {smaller}`I Gold` |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this seems a bit verbose for what it is, don’t we have a helper for that or am I thinking f-a-u?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What aspect of it is verbose?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Creating the intermediates. I think I got confused searching for where they are used after, only to realize they aren’t. But maybe that’s just me.
Would this work or can they be non-ndarrays?