From 22e0b400e8658faf3792f270c0ccfc3a3dbc7a77 Mon Sep 17 00:00:00 2001 From: Vaclav Volhejn Date: Thu, 11 Sep 2025 13:00:03 +0200 Subject: [PATCH] Replace "pre-print coming soon" --- README.md | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.md b/README.md index a618a56..c9a066b 100644 --- a/README.md +++ b/README.md @@ -3,12 +3,10 @@ This repo contains instructions and examples of how to run [Kyutai Speech-To-Text](#kyutai-speech-to-text) and [Kyutai Text-To-Speech](#kyutai-text-to-speech) models. -These models are powered by delayed streams modeling (DSM), -a flexible formulation for streaming, multimodal sequence-to-sequence learning. See also [Unmute](https://github.com/kyutai-labs/unmute), an voice AI system built using Kyutai STT and Kyutai TTS. But wait, what is "Delayed Streams Modeling"? It is a technique for solving many streaming X-to-Y tasks (with X, Y in `{speech, text}`) -that formalize the approach we had with Moshi and Hibiki. A pre-print paper is coming soon! +that formalize the approach we had with Moshi and Hibiki. See our [pre-print about DSM](https://arxiv.org/abs/2509.08753). ## Kyutai Speech-To-Text