Manual Configuration
5 datasets showcase YAML configuration on HuggingFace. See docs: https://huggingface.co./docs/hub/datasets-manual-configuration.
Viewer • Updated • 8 • 43Note If you have multiple files and want to define which file goes into which split, you can use YAML at the top of your README.md.
datasets-examples/doc-yaml-2
Viewer • Updated • 8 • 41Note You can select multiple files per split using a list of paths.
datasets-examples/doc-yaml-3
Viewer • Updated • 8 • 37Note Or you can use glob patterns to automatically list all the files you need.
datasets-examples/doc-yaml-4
Viewer • Updated • 8 • 40Note Your dataset might have several subsets of data that you want to be able to use separately. For example each configuration has its own dropdown in the Dataset Viewer the Hugging Face Hub. In that case you can define a list of configurations inside the configs field in YAML.
datasets-examples/doc-yaml-5
Viewer • Updated • 8 • 33Note Not only data_files, but other builder-specific parameters can be passed via YAML, allowing for more flexibility on how to load the data while not requiring any custom code. For example, define which separator to use in which configuration to load your csv files.