Issues: Lightning-AI/litdata
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Cache directory resolution issues in Google Colab
bug
Something isn't working
help wanted
Extra attention is needed
#126
opened May 8, 2024 by
awaelchli
Optimizing dictionary data structures fails when using a partially initialized function
#120
opened May 6, 2024 by
enrico-stauss
Time per sample grows as processed samples grows
bug
Something isn't working
help wanted
Extra attention is needed
#119
opened May 5, 2024 by
scritter
Slow Dataset Preprocessing due to CPU affinity (?) issues
bug
Something isn't working
help wanted
Extra attention is needed
#118
opened May 2, 2024 by
mgolub2
optimize function on multiple machine writing to local pathes
enhancement
New feature or request
help wanted
Extra attention is needed
#105
opened Apr 22, 2024 by
rakro101
Dataloading is not working when used in litgpt's debug pretraining example
bug
Something isn't working
help wanted
Extra attention is needed
#103
opened Apr 18, 2024 by
iloshchilov
ValueError: buffer size must be a multiple of element size
bug
Something isn't working
help wanted
Extra attention is needed
#102
opened Apr 18, 2024 by
awaelchli
Question: is there a plan to support streaming from GCS?
enhancement
New feature or request
#101
opened Apr 13, 2024 by
dnnspark
Compression using the optimize function from litdata
bug
Something isn't working
help wanted
Extra attention is needed
#97
opened Apr 11, 2024 by
rakro101
GCSFuse mount + Vertex AI custom training jobs support
enhancement
New feature or request
#94
opened Apr 7, 2024 by
miguelalba96
litdata.optimize
accidentally deletes files from the local filesystem
bug
#93
opened Apr 5, 2024 by
hubertsiuzdak
Assert when deserializing Something isn't working
help wanted
Extra attention is needed
no_header_numpy
or no_header_tensor
.
bug
#92
opened Apr 4, 2024 by
ouj
Issue with StreamingDataset when not using all GPUs on host.
bug
Something isn't working
help wanted
Extra attention is needed
#91
opened Apr 3, 2024 by
gkroiz
Prints inside the worker processes mess up the progress bar
bug
Something isn't working
help wanted
Extra attention is needed
#76
opened Mar 24, 2024 by
carmocca
Allow a StreamingDataset to wrap around when running in a CombinedStreamingDataset
enhancement
New feature or request
#74
opened Mar 14, 2024 by
lantiga
litdata with huggingface instead of S3
enhancement
New feature or request
#64
opened Mar 8, 2024 by
ehartford
The tested speed is not as fast as expected.
bug
Something isn't working
help wanted
Extra attention is needed
#60
opened Mar 7, 2024 by
tikboaHIT
Resuming StreamingDataloader with num_workers=0 fails
bug
Something isn't working
#24
opened Feb 26, 2024 by
tchaton
Append data to pre-optimized dataset
enhancement
New feature or request
#23
opened Feb 26, 2024 by
tchaton
Fast random access for New feature or request
StreamingDataset
enhancement
#14
opened Feb 23, 2024 by
ethanwharris
Support New feature or request
StreamingDataLoader
passed to map
enhancement
#13
opened Feb 23, 2024 by
ethanwharris
ProTip!
Mix and match filters to narrow down what you’re looking for.