Skip to content

Improve Create a dataset tutorial  #5805

Open
@polinaeterna

Description

@polinaeterna

Our tutorial on how to create a dataset is a bit misleading.

  1. In Folder-based builders section it says that we have two folder-based builders as standard builders, but we also have similar builders (that can be created from directory with data of required format) for csv, json/jsonl, parquet and txt files. We have info about these loaders in separate guide for loading but it's worth briefly mentioning them in the beginning tutorial because they are more common and for consistency. Would be helpful to add the link to the full guide.
  2. From local files section lists methods for creating a dataset from in-memory data which are also described in loading guide.

Maybe we should actually rethink and restructure this tutorial somehow.

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions