same error like the respository
EAD is now at b1f9d87 Fixes and small update of functionality
remote: Enumerating objects: 13, done.
remote: Counting objects: 100% (11/11), done.
remote: Compressing objects: 100% (4/4), done.
remote: Total 13 (delta 8), reused 7 (delta 7), pack-reused 2 (from 1)
Unpacking objects: 100% (13/13), 4.01 KiB | 158.00 KiB/s, done.
From https://github.com/daswer123/xtts-webui
b1f9d87..5eb0e2f main -> origin/main
Updating b1f9d87..5eb0e2f
Fast-forward
README.md | 10 ++++++
README_pt-BR.md | 8 +++++
README_ru_RU.md | 10 ++++++
modules/text2voice/generation.py | 65 ++++++++++++++++++++++++++++++++-------
requirements.txt | Bin 902 -> 453 bytes
5 files changed, 82 insertions(+), 11 deletions(-)
Das System kann den angegebenen Pfad nicht finden.
Looking in indexes: https://pypi.org/simple, https://pypi.ngc.nvidia.com
Collecting gradio==4.43.0
Downloading gradio-4.43.0-py3-none-any.whl (18.1 MB)
---------------------------------------- 18.1/18.1 MB 6.8 MB/s eta 0:00:00
Requirement already satisfied: torch==2.1.1 in .\venv\lib\site-packages (from -r requirements.txt (line 2)) (2.1.1+cu118)
Requirement already satisfied: torchaudio==2.1.1 in .\venv\lib\site-packages (from -r requirements.txt (line 3)) (2.1.1+cu118)
Requirement already satisfied: faster-whisper==1.0.1 in .\venv\lib\site-packages (from -r requirements.txt (line 4)) (1.0.1)
Collecting coqui-tts[languages]==0.24.2
Downloading coqui_tts-0.24.2-cp310-cp310-win_amd64.whl (1.1 MB)
---------------------------------------- 1.1/1.1 MB 6.9 MB/s eta 0:00:00
Requirement already satisfied: pypinyin in .\venv\lib\site-packages (from -r requirements.txt (line 6)) (0.51.0)
Requirement already satisfied: hangul_romanize in .\venv\lib\site-packages (from -r requirements.txt (line 7)) (0.1.0)
Requirement already satisfied: langid in .\venv\lib\site-packages (from -r requirements.txt (line 8)) (1.1.6)
Requirement already satisfied: noisereduce in .\venv\lib\site-packages (from -r requirements.txt (line 9)) (3.0.2)
Requirement already satisfied: pedalboard in .\venv\lib\site-packages (from -r requirements.txt (line 10)) (0.9.6)
Requirement already satisfied: pydub in .\venv\lib\site-packages (from -r requirements.txt (line 11)) (0.25.1)
Requirement already satisfied: ffmpeg-python in .\venv\lib\site-packages (from -r requirements.txt (line 12)) (0.2.0)
Requirement already satisfied: soundfile in .\venv\lib\site-packages (from -r requirements.txt (line 13)) (0.12.1)
Requirement already satisfied: cutlet in .\venv\lib\site-packages (from -r requirements.txt (line 14)) (0.4.0)
Requirement already satisfied: fugashi[unidic-lite] in .\venv\lib\site-packages (from -r requirements.txt (line 15)) (1.3.2)
Requirement already satisfied: loguru in .\venv\lib\site-packages (from -r requirements.txt (line 16)) (0.7.2)
Requirement already satisfied: omegaconf==2.3.0 in .\venv\lib\site-packages (from -r requirements.txt (line 17)) (2.3.0)
Requirement already satisfied: resampy==0.4.2 in .\venv\lib\site-packages (from -r requirements.txt (line 18)) (0.4.2)
Requirement already satisfied: tabulate==0.8.10 in .\venv\lib\site-packages (from -r requirements.txt (line 19)) (0.8.10)
Requirement already satisfied: requests in .\venv\lib\site-packages (from -r requirements.txt (line 20)) (2.32.3)
Requirement already satisfied: faiss-cpu in .\venv\lib\site-packages (from -r requirements.txt (line 21)) (1.8.0)
Requirement already satisfied: pyworld in .\venv\lib\site-packages (from -r requirements.txt (line 22)) (0.3.4)
Requirement already satisfied: torchcrepe in .\venv\lib\site-packages (from -r requirements.txt (line 23)) (0.0.22)
Requirement already satisfied: praat-parselmouth>=0.4.2 in .\venv\lib\site-packages (from -r requirements.txt (line 24)) (0.4.3)
Requirement already satisfied: translators in .\venv\lib\site-packages (from -r requirements.txt (line 25)) (5.9.2)
Requirement already satisfied: spacy>=3.2.0 in .\venv\lib\site-packages (from -r requirements.txt (line 26)) (3.7.4)
Requirement already satisfied: transformers==4.36.2 in .\venv\lib\site-packages (from -r requirements.txt (line 27)) (4.36.2)
Requirement already satisfied: deepl in .\venv\lib\site-packages (from -r requirements.txt (line 28)) (1.18.0)
Requirement already satisfied: pysubs2 in .\venv\lib\site-packages (from -r requirements.txt (line 30)) (1.7.2)
Requirement already satisfied: whisperx in .\venv\lib\site-packages (from -r requirements.txt (line 31)) (3.1.3)
Requirement already satisfied: silero-tts in .\venv\lib\site-packages (from -r requirements.txt (line 32)) (0.0.4)
Requirement already satisfied: pyyaml<7.0,>=5.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (6.0.1)
Requirement already satisfied: ffmpy in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.3.2)
Requirement already satisfied: ruff>=0.2.2 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.4.7)
Requirement already satisfied: pillow<11.0,>=8.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (10.3.0)
Collecting typer<1.0,>=0.12
Downloading typer-0.15.1-py3-none-any.whl (44 kB)
---------------------------------------- 44.9/44.9 kB ? eta 0:00:00
Requirement already satisfied: orjson~=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.10.3)
Requirement already satisfied: python-multipart>=0.0.9 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.0.9)
Requirement already satisfied: fastapi<0.113.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.110.3)
Requirement already satisfied: anyio<5.0,>=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (4.4.0)
Requirement already satisfied: numpy<3.0,>=1.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (1.26.4)
Requirement already satisfied: urllib3~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.2.1)
Requirement already satisfied: aiofiles<24.0,>=22.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (23.2.1)
Requirement already satisfied: matplotlib~=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.9.0)
Requirement already satisfied: packaging in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (24.0)
Requirement already satisfied: pydantic>=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.7.2)
Requirement already satisfied: pandas<3.0,>=1.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (1.5.3)
Requirement already satisfied: uvicorn>=0.14.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.30.0)
Requirement already satisfied: huggingface-hub>=0.19.3 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.23.2)
Requirement already satisfied: markupsafe~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.1.5)
Collecting gradio-client==1.3.0
Downloading gradio_client-1.3.0-py3-none-any.whl (318 kB)
---------------------------------------- 318.7/318.7 kB 6.6 MB/s eta 0:00:00
Requirement already satisfied: tomlkit==0.12.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.12.0)
Requirement already satisfied: importlib-resources<7.0,>=1.3 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (6.4.0)
Requirement already satisfied: httpx>=0.24.1 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.27.0)
Requirement already satisfied: semantic-version~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.10.0)
Requirement already satisfied: typing-extensions~=4.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (4.12.1)
Requirement already satisfied: jinja2<4.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.1.4)
Requirement already satisfied: networkx in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (2.8.8)
Requirement already satisfied: sympy in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (1.12.1)
Requirement already satisfied: filelock in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (3.14.0)
Requirement already satisfied: fsspec in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (2024.5.0)
Requirement already satisfied: onnxruntime<2,>=1.14 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (1.18.0)
Requirement already satisfied: ctranslate2<5,>=4.0 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (4.2.1)
Requirement already satisfied: tokenizers<0.16,>=0.13 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (0.15.2)
Requirement already satisfied: av==11.* in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (11.0.0)
Requirement already satisfied: librosa>=0.10.1 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.10.2.post1)
Requirement already satisfied: encodec>=0.1.1 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.1.1)
Collecting coqui-tts-trainer==0.1.4
Downloading coqui_tts_trainer-0.1.4-py3-none-any.whl (56 kB)
---------------------------------------- 56.4/56.4 kB 2.9 MB/s eta 0:00:00
Requirement already satisfied: scipy>=1.11.2 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (1.13.1)
Requirement already satisfied: inflect>=5.6.0 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (7.2.1)
Requirement already satisfied: cython>=3.0.0 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (3.0.10)
Requirement already satisfied: num2words>=0.5.11 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.5.13)
INFO: pip is looking at multiple versions of faster-whisper to determine which version is compatible with other requirements. This could take a while.
Collecting faster-whisper==1.0.1
Downloading faster_whisper-1.0.1-py3-none-any.whl (1.5 MB)
---------------------------------------- 1.5/1.5 MB 7.0 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of torchaudio to determine which version is compatible with other requirements. This could take a while.
Collecting torchaudio==2.1.1
Downloading torchaudio-2.1.1-cp310-cp310-win_amd64.whl (2.3 MB)
---------------------------------------- 2.3/2.3 MB 7.4 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of torch to determine which version is compatible with other requirements. This could take a while.
Collecting torch==2.1.1
Downloading torch-2.1.1-cp310-cp310-win_amd64.whl (192.3 MB)
---------------------------------------- 192.3/192.3 MB 7.0 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of to determine which version is compatible with other requirements. This could take a while.
INFO: pip is looking at multiple versions of gradio to determine which version is compatible with other requirements. This could take a while.
ERROR: Cannot install coqui-tts[languages]==0.24.2 and transformers==4.36.2 because these package versions have conflicting dependencies.
The conflict is caused by:
The user requested transformers==4.36.2
coqui-tts[languages] 0.24.2 depends on transformers<4.43.0 and >=4.42.0
To fix this you could try to:
- loosen the range of package versions you've specified
- remove package versions to allow pip attempt to solve the dependency conflict
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts
[notice] A new release of pip available: 22.3.1 -> 24.3.1
[notice] To update, run: d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\Scripts\python.exe -m pip install --upgrade pip
Update complete
Drücken Sie eine beliebige Taste . . .
Xtts starts but without TTS
Start running xtts-webui, this may take a while....
d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\transformers\utils\hub.py:123: FutureWarning: Using TRANSFORMERS_CACHE is deprecated and will be removed in v5 of Transformers. Use HF_HOME instead.
warnings.warn(
2024-12-16 18:15:39.354 | SUCCESS | silero_tts.silero_tts:load_models_config:52 - Models config loaded from: d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\silero_tts\latest_silero_models.yml
2024-12-16 18:15:39.354 | INFO | silero_tts.silero_tts:init_model:152 - Initializing model
Using cache found in C:\Users\kallemst/.cache\torch\hub\snakers4_silero-models_master
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:171 - Setup takes 2.76 seconds
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:173 - Loading model
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:176 - Model to device takes 0.00 seconds
2024-12-16 18:15:42.124 | SUCCESS | silero_tts.silero_tts:init_model:183 - Model is loaded
TTS is not installed.
2024-12-16 18:15:42.124 | INFO | xtts_webui::67 - Start loading model v2.0.2
2024-12-16 18:15:42.124 | INFO | xtts_webui::70 - this dir: D:\xtts2-ui-boltzmann\xtts2-ui_portable\webui
[2024-12-16 18:15:53,104] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2024-12-16 18:15:53,264] torch.distributed.elastic.multiprocessing.redirects: [WARNING] NOTE: Redirects are currently not supported in Windows or MacOs.
[2024-12-16 18:15:53,404] [INFO] [logging.py:96:log_dist] [Rank -1] DeepSpeed info: version=0.11.2+unknown, git-hash=unknown, git-branch=unknown
[2024-12-16 18:15:53,404] [WARNING] [config_utils.py:69:_process_deprecated_field] Config parameter replace_method is deprecated. This parameter is no longer needed, please remove from your call to DeepSpeed-inference
[2024-12-16 18:15:53,404] [WARNING] [config_utils.py:69:_process_deprecated_field] Config parameter mp_size is deprecated use tensor_parallel.tp_size instead
[2024-12-16 18:15:53,404] [INFO] [logging.py:96:log_dist] [Rank -1] quantize_bits = 8 mlp_extra_grouping = False, quantize_groups = 1
[2024-12-16 18:15:53,564] [INFO] [logging.py:96:log_dist] [Rank -1] DeepSpeed-Inference config: {'layer_id': 0, 'hidden_size': 1024, 'intermediate_size': 4096, 'heads': 16, 'num_hidden_layers': -1, 'dtype': torch.float32, 'pre_layer_norm': True, 'norm_type': <NormType.LayerNorm: 1>, 'local_rank': -1, 'stochastic_mode': False, 'epsilon': 1e-05, 'mp_size': 1, 'scale_attention': True, 'triangular_masking': True, 'local_attention': False, 'window_size': 1, 'rotary_dim': -1, 'rotate_half': False, 'rotate_every_two': True, 'return_tuple': True, 'mlp_after_attn': True, 'mlp_act_func_type': <ActivationFuncType.GELU: 1>, 'specialized_mode': False, 'training_mp_size': 1, 'bigscience_bloom': False, 'max_out_tokens': 1024, 'min_out_tokens': 1, 'scale_attn_by_inverse_layer_idx': False, 'enable_qkv_quantization': False, 'use_mup': False, 'return_single_tuple': False, 'set_empty_params': False, 'transposed_mode': False, 'use_triton': False, 'triton_autotune': False, 'num_kv': -1, 'rope_theta': 10000}
2024-12-16 18:15:54.044 | INFO | scripts.tts_funcs:load_model:102 - Pre-create latents for all current speakers
2024-12-16 18:15:54.054 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for calm_female: speakers/calm_female.wav
2024-12-16 18:15:54.520 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for female: speakers/female.wav
2024-12-16 18:15:54.584 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for male: speakers/male.wav
2024-12-16 18:15:54.644 | INFO | scripts.tts_funcs:create_latents_for_all:199 - Latents created for all 3 speakers.
2024-12-16 18:15:54.644 | INFO | scripts.tts_funcs:load_model:106 - Model successfully loaded
['aidar', 'baya', 'kseniya', 'xenia', 'eugene', 'random']
IMPORTANT: You are using gradio version 4.13.0, however version 4.44.1 is available, please upgrade.
d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
torchaudio.set_audio_backend("soundfile")
torchvision is not available - cannot save figures
Running on local URL: http://127.0.0.1:8010
To create a public link, set share=True in launch().
same error like the respository
EAD is now at b1f9d87 Fixes and small update of functionality
remote: Enumerating objects: 13, done.
remote: Counting objects: 100% (11/11), done.
remote: Compressing objects: 100% (4/4), done.
remote: Total 13 (delta 8), reused 7 (delta 7), pack-reused 2 (from 1)
Unpacking objects: 100% (13/13), 4.01 KiB | 158.00 KiB/s, done.
From https://github.com/daswer123/xtts-webui
b1f9d87..5eb0e2f main -> origin/main
Updating b1f9d87..5eb0e2f
Fast-forward
README.md | 10 ++++++
README_pt-BR.md | 8 +++++
README_ru_RU.md | 10 ++++++
modules/text2voice/generation.py | 65 ++++++++++++++++++++++++++++++++-------
requirements.txt | Bin 902 -> 453 bytes
5 files changed, 82 insertions(+), 11 deletions(-)
Das System kann den angegebenen Pfad nicht finden.
Looking in indexes: https://pypi.org/simple, https://pypi.ngc.nvidia.com
Collecting gradio==4.43.0
Downloading gradio-4.43.0-py3-none-any.whl (18.1 MB)
---------------------------------------- 18.1/18.1 MB 6.8 MB/s eta 0:00:00
Requirement already satisfied: torch==2.1.1 in .\venv\lib\site-packages (from -r requirements.txt (line 2)) (2.1.1+cu118)
Requirement already satisfied: torchaudio==2.1.1 in .\venv\lib\site-packages (from -r requirements.txt (line 3)) (2.1.1+cu118)
Requirement already satisfied: faster-whisper==1.0.1 in .\venv\lib\site-packages (from -r requirements.txt (line 4)) (1.0.1)
Collecting coqui-tts[languages]==0.24.2
Downloading coqui_tts-0.24.2-cp310-cp310-win_amd64.whl (1.1 MB)
---------------------------------------- 1.1/1.1 MB 6.9 MB/s eta 0:00:00
Requirement already satisfied: pypinyin in .\venv\lib\site-packages (from -r requirements.txt (line 6)) (0.51.0)
Requirement already satisfied: hangul_romanize in .\venv\lib\site-packages (from -r requirements.txt (line 7)) (0.1.0)
Requirement already satisfied: langid in .\venv\lib\site-packages (from -r requirements.txt (line 8)) (1.1.6)
Requirement already satisfied: noisereduce in .\venv\lib\site-packages (from -r requirements.txt (line 9)) (3.0.2)
Requirement already satisfied: pedalboard in .\venv\lib\site-packages (from -r requirements.txt (line 10)) (0.9.6)
Requirement already satisfied: pydub in .\venv\lib\site-packages (from -r requirements.txt (line 11)) (0.25.1)
Requirement already satisfied: ffmpeg-python in .\venv\lib\site-packages (from -r requirements.txt (line 12)) (0.2.0)
Requirement already satisfied: soundfile in .\venv\lib\site-packages (from -r requirements.txt (line 13)) (0.12.1)
Requirement already satisfied: cutlet in .\venv\lib\site-packages (from -r requirements.txt (line 14)) (0.4.0)
Requirement already satisfied: fugashi[unidic-lite] in .\venv\lib\site-packages (from -r requirements.txt (line 15)) (1.3.2)
Requirement already satisfied: loguru in .\venv\lib\site-packages (from -r requirements.txt (line 16)) (0.7.2)
Requirement already satisfied: omegaconf==2.3.0 in .\venv\lib\site-packages (from -r requirements.txt (line 17)) (2.3.0)
Requirement already satisfied: resampy==0.4.2 in .\venv\lib\site-packages (from -r requirements.txt (line 18)) (0.4.2)
Requirement already satisfied: tabulate==0.8.10 in .\venv\lib\site-packages (from -r requirements.txt (line 19)) (0.8.10)
Requirement already satisfied: requests in .\venv\lib\site-packages (from -r requirements.txt (line 20)) (2.32.3)
Requirement already satisfied: faiss-cpu in .\venv\lib\site-packages (from -r requirements.txt (line 21)) (1.8.0)
Requirement already satisfied: pyworld in .\venv\lib\site-packages (from -r requirements.txt (line 22)) (0.3.4)
Requirement already satisfied: torchcrepe in .\venv\lib\site-packages (from -r requirements.txt (line 23)) (0.0.22)
Requirement already satisfied: praat-parselmouth>=0.4.2 in .\venv\lib\site-packages (from -r requirements.txt (line 24)) (0.4.3)
Requirement already satisfied: translators in .\venv\lib\site-packages (from -r requirements.txt (line 25)) (5.9.2)
Requirement already satisfied: spacy>=3.2.0 in .\venv\lib\site-packages (from -r requirements.txt (line 26)) (3.7.4)
Requirement already satisfied: transformers==4.36.2 in .\venv\lib\site-packages (from -r requirements.txt (line 27)) (4.36.2)
Requirement already satisfied: deepl in .\venv\lib\site-packages (from -r requirements.txt (line 28)) (1.18.0)
Requirement already satisfied: pysubs2 in .\venv\lib\site-packages (from -r requirements.txt (line 30)) (1.7.2)
Requirement already satisfied: whisperx in .\venv\lib\site-packages (from -r requirements.txt (line 31)) (3.1.3)
Requirement already satisfied: silero-tts in .\venv\lib\site-packages (from -r requirements.txt (line 32)) (0.0.4)
Requirement already satisfied: pyyaml<7.0,>=5.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (6.0.1)
Requirement already satisfied: ffmpy in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.3.2)
Requirement already satisfied: ruff>=0.2.2 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.4.7)
Requirement already satisfied: pillow<11.0,>=8.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (10.3.0)
Collecting typer<1.0,>=0.12
Downloading typer-0.15.1-py3-none-any.whl (44 kB)
---------------------------------------- 44.9/44.9 kB ? eta 0:00:00
Requirement already satisfied: orjson~=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.10.3)
Requirement already satisfied: python-multipart>=0.0.9 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.0.9)
Requirement already satisfied: fastapi<0.113.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.110.3)
Requirement already satisfied: anyio<5.0,>=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (4.4.0)
Requirement already satisfied: numpy<3.0,>=1.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (1.26.4)
Requirement already satisfied: urllib3~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.2.1)
Requirement already satisfied: aiofiles<24.0,>=22.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (23.2.1)
Requirement already satisfied: matplotlib~=3.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.9.0)
Requirement already satisfied: packaging in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (24.0)
Requirement already satisfied: pydantic>=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.7.2)
Requirement already satisfied: pandas<3.0,>=1.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (1.5.3)
Requirement already satisfied: uvicorn>=0.14.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.30.0)
Requirement already satisfied: huggingface-hub>=0.19.3 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.23.2)
Requirement already satisfied: markupsafe~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.1.5)
Collecting gradio-client==1.3.0
Downloading gradio_client-1.3.0-py3-none-any.whl (318 kB)
---------------------------------------- 318.7/318.7 kB 6.6 MB/s eta 0:00:00
Requirement already satisfied: tomlkit==0.12.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.12.0)
Requirement already satisfied: importlib-resources<7.0,>=1.3 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (6.4.0)
Requirement already satisfied: httpx>=0.24.1 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (0.27.0)
Requirement already satisfied: semantic-version~=2.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (2.10.0)
Requirement already satisfied: typing-extensions~=4.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (4.12.1)
Requirement already satisfied: jinja2<4.0 in .\venv\lib\site-packages (from gradio==4.43.0->-r requirements.txt (line 1)) (3.1.4)
Requirement already satisfied: networkx in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (2.8.8)
Requirement already satisfied: sympy in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (1.12.1)
Requirement already satisfied: filelock in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (3.14.0)
Requirement already satisfied: fsspec in .\venv\lib\site-packages (from torch==2.1.1->-r requirements.txt (line 2)) (2024.5.0)
Requirement already satisfied: onnxruntime<2,>=1.14 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (1.18.0)
Requirement already satisfied: ctranslate2<5,>=4.0 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (4.2.1)
Requirement already satisfied: tokenizers<0.16,>=0.13 in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (0.15.2)
Requirement already satisfied: av==11.* in .\venv\lib\site-packages (from faster-whisper==1.0.1->-r requirements.txt (line 4)) (11.0.0)
Requirement already satisfied: librosa>=0.10.1 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.10.2.post1)
Requirement already satisfied: encodec>=0.1.1 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.1.1)
Collecting coqui-tts-trainer==0.1.4
Downloading coqui_tts_trainer-0.1.4-py3-none-any.whl (56 kB)
---------------------------------------- 56.4/56.4 kB 2.9 MB/s eta 0:00:00
Requirement already satisfied: scipy>=1.11.2 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (1.13.1)
Requirement already satisfied: inflect>=5.6.0 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (7.2.1)
Requirement already satisfied: cython>=3.0.0 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (3.0.10)
Requirement already satisfied: num2words>=0.5.11 in .\venv\lib\site-packages (from coqui-tts[languages]==0.24.2->-r requirements.txt (line 5)) (0.5.13)
INFO: pip is looking at multiple versions of faster-whisper to determine which version is compatible with other requirements. This could take a while.
Collecting faster-whisper==1.0.1
Downloading faster_whisper-1.0.1-py3-none-any.whl (1.5 MB)
---------------------------------------- 1.5/1.5 MB 7.0 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of torchaudio to determine which version is compatible with other requirements. This could take a while.
Collecting torchaudio==2.1.1
Downloading torchaudio-2.1.1-cp310-cp310-win_amd64.whl (2.3 MB)
---------------------------------------- 2.3/2.3 MB 7.4 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of torch to determine which version is compatible with other requirements. This could take a while.
Collecting torch==2.1.1
Downloading torch-2.1.1-cp310-cp310-win_amd64.whl (192.3 MB)
---------------------------------------- 192.3/192.3 MB 7.0 MB/s eta 0:00:00
INFO: pip is looking at multiple versions of to determine which version is compatible with other requirements. This could take a while.
INFO: pip is looking at multiple versions of gradio to determine which version is compatible with other requirements. This could take a while.
ERROR: Cannot install coqui-tts[languages]==0.24.2 and transformers==4.36.2 because these package versions have conflicting dependencies.
The conflict is caused by:
The user requested transformers==4.36.2
coqui-tts[languages] 0.24.2 depends on transformers<4.43.0 and >=4.42.0
To fix this you could try to:
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts
[notice] A new release of pip available: 22.3.1 -> 24.3.1
[notice] To update, run: d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\Scripts\python.exe -m pip install --upgrade pip
Update complete
Drücken Sie eine beliebige Taste . . .
Xtts starts but without TTS
Start running xtts-webui, this may take a while....
d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\transformers\utils\hub.py:123: FutureWarning: Using
TRANSFORMERS_CACHEis deprecated and will be removed in v5 of Transformers. UseHF_HOMEinstead.warnings.warn(
2024-12-16 18:15:39.354 | SUCCESS | silero_tts.silero_tts:load_models_config:52 - Models config loaded from: d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\silero_tts\latest_silero_models.yml
2024-12-16 18:15:39.354 | INFO | silero_tts.silero_tts:init_model:152 - Initializing model
Using cache found in C:\Users\kallemst/.cache\torch\hub\snakers4_silero-models_master
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:171 - Setup takes 2.76 seconds
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:173 - Loading model
2024-12-16 18:15:42.124 | INFO | silero_tts.silero_tts:init_model:176 - Model to device takes 0.00 seconds
2024-12-16 18:15:42.124 | SUCCESS | silero_tts.silero_tts:init_model:183 - Model is loaded
TTS is not installed.
2024-12-16 18:15:42.124 | INFO | xtts_webui::67 - Start loading model v2.0.2
2024-12-16 18:15:42.124 | INFO | xtts_webui::70 - this dir: D:\xtts2-ui-boltzmann\xtts2-ui_portable\webui
[2024-12-16 18:15:53,104] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2024-12-16 18:15:53,264] torch.distributed.elastic.multiprocessing.redirects: [WARNING] NOTE: Redirects are currently not supported in Windows or MacOs.
[2024-12-16 18:15:53,404] [INFO] [logging.py:96:log_dist] [Rank -1] DeepSpeed info: version=0.11.2+unknown, git-hash=unknown, git-branch=unknown
[2024-12-16 18:15:53,404] [WARNING] [config_utils.py:69:_process_deprecated_field] Config parameter replace_method is deprecated. This parameter is no longer needed, please remove from your call to DeepSpeed-inference
[2024-12-16 18:15:53,404] [WARNING] [config_utils.py:69:_process_deprecated_field] Config parameter mp_size is deprecated use tensor_parallel.tp_size instead
[2024-12-16 18:15:53,404] [INFO] [logging.py:96:log_dist] [Rank -1] quantize_bits = 8 mlp_extra_grouping = False, quantize_groups = 1
[2024-12-16 18:15:53,564] [INFO] [logging.py:96:log_dist] [Rank -1] DeepSpeed-Inference config: {'layer_id': 0, 'hidden_size': 1024, 'intermediate_size': 4096, 'heads': 16, 'num_hidden_layers': -1, 'dtype': torch.float32, 'pre_layer_norm': True, 'norm_type': <NormType.LayerNorm: 1>, 'local_rank': -1, 'stochastic_mode': False, 'epsilon': 1e-05, 'mp_size': 1, 'scale_attention': True, 'triangular_masking': True, 'local_attention': False, 'window_size': 1, 'rotary_dim': -1, 'rotate_half': False, 'rotate_every_two': True, 'return_tuple': True, 'mlp_after_attn': True, 'mlp_act_func_type': <ActivationFuncType.GELU: 1>, 'specialized_mode': False, 'training_mp_size': 1, 'bigscience_bloom': False, 'max_out_tokens': 1024, 'min_out_tokens': 1, 'scale_attn_by_inverse_layer_idx': False, 'enable_qkv_quantization': False, 'use_mup': False, 'return_single_tuple': False, 'set_empty_params': False, 'transposed_mode': False, 'use_triton': False, 'triton_autotune': False, 'num_kv': -1, 'rope_theta': 10000}
2024-12-16 18:15:54.044 | INFO | scripts.tts_funcs:load_model:102 - Pre-create latents for all current speakers
2024-12-16 18:15:54.054 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for calm_female: speakers/calm_female.wav
2024-12-16 18:15:54.520 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for female: speakers/female.wav
2024-12-16 18:15:54.584 | INFO | scripts.tts_funcs:get_or_create_latents:185 - creating latents for male: speakers/male.wav
2024-12-16 18:15:54.644 | INFO | scripts.tts_funcs:create_latents_for_all:199 - Latents created for all 3 speakers.
2024-12-16 18:15:54.644 | INFO | scripts.tts_funcs:load_model:106 - Model successfully loaded
['aidar', 'baya', 'kseniya', 'xenia', 'eugene', 'random']
IMPORTANT: You are using gradio version 4.13.0, however version 4.44.1 is available, please upgrade.
d:\xtts2-ui-boltzmann\xtts2-ui_portable\webui\venv\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
torchaudio.set_audio_backend("soundfile")
torchvision is not available - cannot save figures
Running on local URL: http://127.0.0.1:8010
To create a public link, set
share=Trueinlaunch().