@@ -45,31 +45,32 @@ vox-box start --huggingface-repo-id Systran/faster-whisper-small --data-dir C:\U
45
45
46
46
## Supported Models
47
47
48
- | Model | Type | Link | Verified Platforms |
49
- | ------------------------------- | -------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------- |
50
- | Faster-whisper-large-v3 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v3 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v3 ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
51
- | Faster-whisper-large-v2 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v2 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v2 ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
52
- | Faster-whisper-large-v1 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v1 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v1 ) | |
53
- | Faster-whisper-medium | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-medium ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-medium ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
54
- | Faster-whisper-medium.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-medium.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-medium.en ) | |
55
- | Faster-whisper-small | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-small ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-small ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
56
- | Faster-whisper-small.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-small.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-small.en ) | |
57
- | Faster-distil-whisper-large-v3 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-large-v3 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-large-v3 ) | MacOS ✅ ; |
58
- | Faster-distil-whisper-large-v2 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-large-v2 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-large-v2 ) | MacOS ✅ ; |
59
- | Faster-distil-whisper-medium.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-medium.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-medium.en ) | |
60
- | Faster-whisper-tiny | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-tiny ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-tiny ) | |
61
- | Faster-whisper-tiny.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-tiny.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-tiny.en ) | |
62
- | Paraformer-zh | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-zh ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch ) | |
63
- | Paraformer-zh-streaming | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-zh-streaming ) , [ ModelScope] ( https://modelscope.cn/models/iic/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online ) | Linux ✅ ; , MacOS ✅ ; |
64
- | Paraformer-en | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-en ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-en-16k-common-vocab10020 ) | |
65
- | Conformer-en | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/conformer-en ) , [ Modelscope] ( https://modelscope.cn/models/iic/speech_conformer_asr-en-16k-vocab4199-pytorch ) | |
66
- | SenseVoiceSmall | speech-to-text | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/SenseVoiceSmall ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/SenseVoiceSmall ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
67
- | Bark | text-to-speech | [ Hugging Face] ( https://huggingface.co/suno/bark ) | |
68
- | Bark-small | text-to-speech | [ Hugging Face] ( https://huggingface.co/suno/bark-small ) | |
69
- | CosyVoice-300M-Instruct | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M-Instruct ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-Instruct ) | Linux(ARM not supported), Windows(Not supported), macOS ✅ ; |
70
- | CosyVoice-300M-SFT | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M-SFT ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-SFT ) | Linux(ARM not supported), Windows(Not supported), macOS ✅ ; |
71
- | CosyVoice-300M | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M ) | Linux(ARM not supported), Windows(Not supported), macOS ✅ ; |
72
- | CosyVoice-300M-25Hz | text-to-speech | [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-25Hz ) | Linux(ARM not supported), Windows(Not supported), macOS ✅ ; |
48
+ | Model | Type | Link | Verified Platforms |
49
+ | ------------------------------- | -------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------- |
50
+ | Faster-whisper-large-v3 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v3 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v3 ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
51
+ | Faster-whisper-large-v2 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v2 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v2 ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
52
+ | Faster-whisper-large-v1 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-large-v1 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-large-v1 ) | |
53
+ | Faster-whisper-medium | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-medium ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-medium ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
54
+ | Faster-whisper-medium.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-medium.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-medium.en ) | |
55
+ | Faster-whisper-small | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-small ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-small ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
56
+ | Faster-whisper-small.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-small.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-small.en ) | |
57
+ | Faster-distil-whisper-large-v3 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-large-v3 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-large-v3 ) | MacOS ✅ ; |
58
+ | Faster-distil-whisper-large-v2 | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-large-v2 ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-large-v2 ) | MacOS ✅ ; |
59
+ | Faster-distil-whisper-medium.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-distil-whisper-medium.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-distil-whisper-medium.en ) | |
60
+ | Faster-whisper-tiny | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-tiny ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-tiny ) | |
61
+ | Faster-whisper-tiny.en | speech-to-text | [ Hugging Face] ( https://huggingface.co/Systran/faster-whisper-tiny.en ) , [ ModelScope] ( https://modelscope.cn/models/gpustack/faster-whisper-tiny.en ) | |
62
+ | Paraformer-zh | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-zh ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch ) | |
63
+ | Paraformer-zh-streaming | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-zh-streaming ) , [ ModelScope] ( https://modelscope.cn/models/iic/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online ) | Linux ✅ ; , MacOS ✅ ; |
64
+ | Paraformer-en | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/paraformer-en ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-en-16k-common-vocab10020 ) | |
65
+ | Conformer-en | speech-to-text | [ Hugging Face] ( https://huggingface.co/funasr/conformer-en ) , [ Modelscope] ( https://modelscope.cn/models/iic/speech_conformer_asr-en-16k-vocab4199-pytorch ) | |
66
+ | SenseVoiceSmall | speech-to-text | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/SenseVoiceSmall ) , [ ModelScope] ( https://www.modelscope.cn/models/iic/SenseVoiceSmall ) | Linux ✅ ; , Windows ✅ ; , MacOS ✅ ; |
67
+ | Bark | text-to-speech | [ Hugging Face] ( https://huggingface.co/suno/bark ) | Linux ✅ ; , Windows, MacOS ✅ ; |
68
+ | Bark-small | text-to-speech | [ Hugging Face] ( https://huggingface.co/suno/bark-small ) | Linux ✅ ; , Windows, MacOS ✅ ; |
69
+ | CosyVoice2-0.5B | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice2-0.5B ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice2-0.5B ) | Linux(ARM not supported) ✅ ; , Windows(Not supported), macOS ✅ ; |
70
+ | CosyVoice-300M-Instruct | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M-Instruct ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-Instruct ) | Linux(ARM not supported) ✅ ; , Windows(Not supported), macOS ✅ ; |
71
+ | CosyVoice-300M-SFT | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M-SFT ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-SFT ) | Linux(ARM not supported) ✅ ; , Windows(Not supported), macOS ✅ ; |
72
+ | CosyVoice-300M | text-to-speech | [ Hugging Face] ( https://huggingface.co/FunAudioLLM/CosyVoice-300M ) , [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M ) | Linux(ARM not supported) ✅ ; , Windows(Not supported), macOS ✅ ; |
73
+ | CosyVoice-300M-25Hz | text-to-speech | [ ModelScope] ( https://modelscope.cn/models/iic/CosyVoice-300M-25Hz ) | Linux(ARM not supported) ✅ ; , Windows(Not supported), macOS ✅ ; |
73
74
74
75
## Supported APIs
75
76
0 commit comments