Skip to content

wenet-e2e/WenetSpeech-Family

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

WenetSpeech-Family

WenetSpeech is a family of speech datasets, We means co-creation and Win-Win, Net means internet, Speech means speech data.

Family Overview

Now we have the following datasets, and we will add more datasets in the future.

Dataset Year Hours Paper Github Huggingface 公众号文章
WenetSpeech 2021 10,000 paper github data blog
WenetSpeech4TTS 2024 12,800 paper / data blog
WenetSpeech-Yue 2025 21,800 paper github data blog
WenetSpeech-Chuan 2025 10,000 paper github data blog

About

WenetSpeech family brief overview

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published