Unmute adds ears and vocal chords to your favorite text-based language model. A seamless plug-and-play augmentation with easy personalisation through voice conditioning and text instructions. We will open-source shortly.
Unmute adds ears and vocal chords to your favorite text-based language model. A seamless plug-and-play augmentation with easy personalisation through voice conditioning and text instructions. We will open-source shortly.
After its preview version in last January, Helium 1 now takes its full expanse, with 2 billions of well used open parameters. 🇧🇬 🇭🇷 🇨🇿 🇩🇰 🇳🇱 🇬🇧 🇪🇪 🇫🇮 🇫🇷 🇩🇪 🇬🇷 🇭🇺 🇮🇪 🇮🇹 🇱🇻 🇱🇹 🇲🇹 🇵🇱 🇵🇹 🇷🇴 🇸🇰 🇸🇮 🇪🇸 🇸🇪
One vertu of open models is to allow one to adapt them to one’s needs. This is even more impactful when finetuning is data- and compute-efficient. This is something we strive for at Kyutai. Let’s start with Moshi, our groundbreaking multi-stream spoken dialogue model.
🔥🔥🔥 CV Folks, I have some news! We're organizing a 1-day meeting in center Paris on June 6th before CVPR called CVPR@Paris (similar as NeurIPS@Paris) 🥐🍾🥖🍷
Registration is open (it's free) with priority given to authors of accepted papers: cvprinparis.github.io/CVPR2025InPa...
Big 🧵👇 with details!
I wish it is a disgraceful video generated by an unhinged AI. Unfortunately, it is the disgraceful new reality. Shame on Trump and Vance.
www.theguardian.com/us-news/2025...
Pushing testing dedication to the next level.
Simultaneous speech-to-speech translation on mobile is a world premiere. In the near future, no one will ever be lost in translation (at least for linguistic reasons).
New sharing step on our journey towards easy-to-use fully-open models.