You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[x] allow for specialized relative positional embeddings in fine transformer based on absolute matching positions of quantizers between coarse and fine
343
343
-[x] allow for grouped residual vq in soundstream (use `GroupedResidualVQ` from vector-quantize-pytorch lib), from <ahref="https://arxiv.org/abs/2305.02765">hifi-codec</a>
344
344
-[x] add flash attention with <ahref="https://arxiv.org/abs/2305.19466">NoPE</a>
345
+
-[x] accept prime wave in `AudioLM` as a path to an audio file, and auto resample for semantic vs acoustic
345
346
346
347
-[ ] design a hierarchical coarse and fine transformer
347
348
-[ ] investigate <ahref="https://openreview.net/forum?id=H-VlwsYvVi">spec decoding</a>, first test in x-transformers, then port over if applicable
348
-
-[ ] accept prime wave in `AudioLM` as a path to an audio file, and auto resample for semantic vs acoustic
349
349
350
350
-[ ] redo the positional embeddings in the presence of groups in residual vq
0 commit comments