Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add oversampling strategies to interleave datasets (#4831)
* add a new strategy for interleave_datasets (oversampling strat) * format code according to the library style * update interleave_datasets description * Add correct Error type for a non implemented strategy in interleave_datasets * correcting an example in the comments * adding comment to the default case of _interleave_map_style_datasets * correct the case of oversampling strategy with no probabilities of _interleave_map_style_datasets and add comments * reformat with datasets's style * add tests for oversampling strategy in interleave_datasets * mention of the sampling strategy of interleave_datasets in the documentation of process.mdx
- Loading branch information
Showing
4 changed files
with
161 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
dc5cb17
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Show benchmarks
PyArrow==6.0.0
Show updated benchmarks!
Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
Show updated benchmarks!
Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json