Skip to content

Conversation

@yinhao0214
Copy link
Collaborator

Add Libritts recipe

@yinhao0214 yinhao0214 marked this pull request as ready for review September 13, 2025 08:30
def extract(self, item):
import s3tokenizer
waveform, sample_rate = torchaudio.load(item['wav'])
item['wav'] = waveform
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这两行不需要了,下面的直接用 waveform 和 sample_rate 就可以

audio = torchaudio.transforms.Resample(sample_rate,
16000)(waveform)

def extract(self, item):
import s3tokenizer
IGNORE_TOKEN_ID = LabelSmoother.ignore_index
waveform, sample_rate = torchaudio.load(item['wav'])
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上

en_tn_model = EnNormalizer(overwrite_cache=False)
# ASR model
model = whisper.load_model(
"/jfs-hdfs/user/xingchen.song/share/whisper/large-v3-turbo.pt"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个改成 large-v3-turbo,用户使用时自己自动下载

import wespeaker

model = wespeaker.load_model(
model_dir="/jfs-hdfs/user/binbin.zhang/models/wespeaker/chinese"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个也改成 wespeaker 可自动下载的模型名,CC @cdliang11

@@ -0,0 +1,7 @@
{
"llm_model_name_or_path": "/bucket/output/jfs-hdfs/user/binbin.zhang/github/west/examples/aishell/tts/model/Qwen2.5-0.5B-Audio",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个是在 Qwen2.5-0.5B 基础上增加 4096 个 speech token 来的,这个模型的生成我们后续也加进来。

@robin1001
Copy link
Collaborator

后续增加一下 README,主要有 Tutorial 和 Results 两部分,参考 aishell。这个 PR 我我们先合并,一些 comment 我们放到后续的 PR 里。

@robin1001 robin1001 merged commit 1f3d703 into main Sep 13, 2025
@robin1001 robin1001 deleted the feat-add-libritts-recipe branch September 13, 2025 09:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants