From 55632edfb2cb309e5883ceb41fe2ffdad4feae83 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?V=C3=A1clav=20Volhejn?= Date: Mon, 7 Jul 2025 16:50:33 +0200 Subject: [PATCH] Plug Unmute --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 8d37560..bd6d97d 100644 --- a/README.md +++ b/README.md @@ -5,6 +5,7 @@ This repo contains instructions and examples of how to run and [Kyutai Text-To-Speech](#kyutai-text-to-speech) models. These models are powered by delayed streams modeling (DSM), a flexible formulation for streaming, multimodal sequence-to-sequence learning. +See also [Unmute](https://github.com/kyutai-labs/unmute), an voice AI system built using Kyutai STT and Kyutai TTS. But wait, what is "Delayed Streams Modeling"? It is a technique for solving many streaming X-to-Y tasks (with X, Y in `{speech, text}`) that formalize the approach we had with Moshi and Hibiki. A pre-print paper is coming soon!