GitHub - HumeAI/tada: Open Source Speech Language Model
github.com/HumeAI/tadaTADA is a unified speech-language model that synchronizes speech and text using 1:1 alignment, achieving high-fidelity synthesis with reduced computational overhead. It leverages a novel tokenizer and architectural design, allowing each autoregressive step to cover one text token, dynamically determining its duration and prosody. TADA supports multilingual speech synthesis and can be used for text-speech continuation.