Optimize realtime silence path and SOLA hot-loop to reduce CPU/GPU overhead by blaisewf · Pull Request #1238 · IAHispano/Applio

blaisewf · 2026-04-13T21:03:43Z

Reduce repeated heavy pipeline runs during silence to lower CPU/GPU usage and reduce wake-up latency spikes in realtime conversion.
Remove per-chunk allocations and temporary tensors in the SOLA/crossfade hot path to improve throughput and consistency.

Add a throttled keepalive interval controlled via keepalive_ms in **kwargs (default 200 ms) and convert it to keepalive_interval_s, tracking last call with last_keepalive_time.
Introduce _run_silence_keepalive(...) helper that centralizes the periodic background pipeline.voice_conversion(...) call and is invoked from VAD and low-volume early-return paths.
Cache a device-side silence tensor self._silence_template to avoid allocating zeros on each silent chunk and return it for silent fast-paths.
Precompute and reuse sola_norm_kernel and onset_fade_window in VoiceChanger.generate_strength and replace per-request torch.ones(...) and torch.linspace(...) with these cached tensors in the SOLA denominator and onset fade logic.

Optimize realtime silence path and crossfade hot loop

88f32a4

blaisewf added the codex label Apr 13, 2026 — with ChatGPT Codex Connector

Provide feedback