[examples] add libritts recipe #56

yinhao0214 · 2025-09-13T08:03:27Z

Add Libritts recipe

robin1001 · 2025-09-13T09:50:02Z

west/models/touch_flow/extractor_touch_flow.py

    def extract(self, item):
        import s3tokenizer
+        waveform, sample_rate = torchaudio.load(item['wav'])
+        item['wav'] = waveform


这两行不需要了，下面的直接用 waveform 和 sample_rate 就可以

audio = torchaudio.transforms.Resample(sample_rate,
16000)(waveform)

robin1001 · 2025-09-13T09:50:19Z

west/models/touch_tts/extractor_touch_tts.py

    def extract(self, item):
        import s3tokenizer
        IGNORE_TOKEN_ID = LabelSmoother.ignore_index
+        waveform, sample_rate = torchaudio.load(item['wav'])


robin1001 · 2025-09-13T09:51:40Z

tools/whisper_asr.py

+en_tn_model = EnNormalizer(overwrite_cache=False)
+# ASR model
+model = whisper.load_model(
+    "/jfs-hdfs/user/xingchen.song/share/whisper/large-v3-turbo.pt"


这个改成 large-v3-turbo，用户使用时自己自动下载

robin1001 · 2025-09-13T09:54:25Z

tools/compute_similarity.py

+import wespeaker
+
+model = wespeaker.load_model(
+    model_dir="/jfs-hdfs/user/binbin.zhang/models/wespeaker/chinese"


这个也改成 wespeaker 可自动下载的模型名，CC @cdliang11

robin1001 · 2025-09-13T09:56:15Z

examples/libritts/tts/conf/touch_tts_config.json

@@ -0,0 +1,7 @@
+{
+  "llm_model_name_or_path": "/bucket/output/jfs-hdfs/user/binbin.zhang/github/west/examples/aishell/tts/model/Qwen2.5-0.5B-Audio",


这个是在 Qwen2.5-0.5B 基础上增加 4096 个 speech token 来的，这个模型的生成我们后续也加进来。

robin1001 · 2025-09-13T09:58:22Z

后续增加一下 README，主要有 Tutorial 和 Results 两部分，参考 aishell。这个 PR 我我们先合并，一些 comment 我们放到后续的 PR 里。

yinhao0214 added 2 commits September 13, 2025 07:59

[example] add libritts recipe

883befd

[fix] fix conflicts

6b0db1d

yinhao0214 marked this pull request as ready for review September 13, 2025 08:30

yinhao0214 requested a review from robin1001 September 13, 2025 08:30

robin1001 reviewed Sep 13, 2025

View reviewed changes

robin1001 merged commit 1f3d703 into main Sep 13, 2025

robin1001 deleted the feat-add-libritts-recipe branch September 13, 2025 09:58

cdliang11 mentioned this pull request Sep 15, 2025

[tools] automatically download speaker models #57

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[examples] add libritts recipe #56

[examples] add libritts recipe #56

Uh oh!

yinhao0214 commented Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Uh oh!

robin1001 commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,7 @@
		{
		"llm_model_name_or_path": "/bucket/output/jfs-hdfs/user/binbin.zhang/github/west/examples/aishell/tts/model/Qwen2.5-0.5B-Audio",

[examples] add libritts recipe #56

[examples] add libritts recipe #56

Uh oh!

Conversation

yinhao0214 commented Sep 13, 2025

Uh oh!

robin1001 Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

robin1001 Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

robin1001 Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

robin1001 Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

robin1001 Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

robin1001 commented Sep 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants