Tag: speech-text foundation model
-
Hacker News: Moshi: A speech-text foundation model for real time dialogue
Source URL: https://github.com/kyutai-labs/moshi Source: Hacker News Title: Moshi: A speech-text foundation model for real time dialogue Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Moshi,” a speech-text foundation model that enables real-time dialogue using advanced audio processing techniques. It introduces a new neural audio codec, “Mimi,” which supports fully streaming audio…