-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Issue with offline mode and partial dataset cached #7551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
It seems the problem comes from builder.py / create_config_id() On the first call, when the cache is empty we have
leading to config_id beeing 'default-2935e8cdcc21c613' then, on the second call,
thus explaining why the hash is not the same, despite having the same parameter when calling load_dataset : data_files="fr/fr_part_00038.parquet" |
Same behavior with version 3.5.1 |
Same issue when loading |
I have identified that the issue indeed lies in the A possible solution might be to standardize the formatting of |
Uh oh!
There was an error while loading. Please reload this page.
Describe the bug
Hi,
a issue related to #4760 here when loading a single file from a dataset, unable to access it in offline mode afterwards
Steps to reproduce the bug
Once the file has been cached, I rerun with the HF_HUB_OFFLINE activated an get this error :
Expected behavior
Should be able to access the previously cached files
Environment info
datasets
version: 3.2.0huggingface_hub
version: 0.27.0fsspec
version: 2024.3.1The text was updated successfully, but these errors were encountered: