An Open-Source Project to Unify Audio Processing and Generation
-
Updated
Jan 29, 2026 - Python
An Open-Source Project to Unify Audio Processing and Generation
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
[Official Implementation] Acoustic Autoregressive Modeling 🔥
Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.
Add a description, image, and links to the audio-tokenizer topic page so that developers can more easily learn about it.
To associate your repository with the audio-tokenizer topic, visit your repo's landing page and select "manage topics."