Cloud Security Alliance News Clipping Site

Tag: neural audio codec

Hacker News: Moshi: A speech-text foundation model for real time dialogue

Sep 18, 2024

—

by

system automation

in Uncategorized

Source URL: https://github.com/kyutai-labs/moshi Source: Hacker News Title: Moshi: A speech-text foundation model for real time dialogue Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Moshi,” a speech-text foundation model that enables real-time dialogue using advanced audio processing techniques. It introduces a new neural audio codec, “Mimi,” which supports fully streaming audio…