Open
Description
Our tutorial on how to create a dataset is a bit misleading.
- In Folder-based builders section it says that we have two folder-based builders as standard builders, but we also have similar builders (that can be created from directory with data of required format) for
csv
,json/jsonl
,parquet
andtxt
files. We have info about these loaders in separate guide for loading but it's worth briefly mentioning them in the beginning tutorial because they are more common and for consistency. Would be helpful to add the link to the full guide. - From local files section lists methods for creating a dataset from in-memory data which are also described in loading guide.
Maybe we should actually rethink and restructure this tutorial somehow.