speaker-diarization #520
denmrnngp-cloud
started this conversation in
Ideas
Replies: 2 comments 4 replies
-
|
Could you provide more context How many speakers you were testing, reproducible example (audio + code) and version of MLX-Audio you are using? I personally find sortformer to be very accurate and I have tested upto 3h of audio. |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
That model is actually a pyannote model. I can add it to the backlog 👌🏽 |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi! Thank you for the streaming_sortformer 4spk-v2 port.1. He has a problem with fake speakers, I found a way to fix it through a cosine matrix comparison of vectors and similar merge speakers. But I was infuriated by the number of iterations and the cost of resources. I tested this model https://huggingface.co/FluidInference/speaker-diarization-coreml - and after setting up the parameters, I began to get perfect results for speakers (from 2 to 6) - maybe you can port it to mlx? or is it not worth it?
Beta Was this translation helpful? Give feedback.
All reactions