Commit Graph

75 Commits

Author SHA1 Message Date
Václav Volhejn
cafac63222
Run pre-commit correctly in CI (#66)
* fix and break

* Remove intentional error
2025-07-08 10:11:52 +02:00
Václav Volhejn
7336d7a3da
Fix instructions on how to install the Rust server (#65)
* Fix instructions on Rust server installation

* Plug Unmute
2025-07-07 18:00:23 +02:00
Laurent Mazare
70500c620e
Add a device argument to the tts pytorch script. (#62) 2025-07-07 08:36:47 +02:00
Chenghao Mou
f8e97aa4f3
fix minor issues with readme commands (#55) 2025-07-07 08:18:05 +02:00
laurent
91a4d120cb Use moshi 0.2.8. 2025-07-07 08:12:16 +02:00
Laurent
bfc200f6ee Use bfloat16 rather than half by default. 2025-07-05 23:02:58 +02:00
laurent
f9739881e6 Typo. 2025-07-03 19:02:42 +02:00
Alexandre Défossez
99599fa408
Update README.md 2025-07-03 16:15:01 +02:00
Pierre-Hugues HUSSON
3a4165a84f
Fix stt_from_file_pytorch (#39)
1. argparse declares in_file, but code reads file
2. text_tokens.numpy().tolist() is a list of list of list of int
instead of the supported list of list of int.
this is a debugging print just drop it

Co-authored-by: Pierre-Hugues Husson <phhusson@freebox.fr>
2025-07-03 15:26:34 +02:00
Alexandre Défossez
e9bac066ea
Update README.md 2025-07-03 15:09:41 +02:00
Alexandre Défossez
eae5e17975
Some updates to the colab and script (#38)
* changing streaming to be robust to repeated generation

* some changes

* plop

* plop

* plop

* plop
2025-07-03 15:06:37 +02:00
Václav Volhejn
c1d248abba
Fix text tokenizer path (#36) 2025-07-03 14:27:06 +02:00
Václav Volhejn
c6f262346f
Don't install moshi from Git (#37)
* Don't install moshi from Git

* Remove commented-out invalid message send in websocket_client
2025-07-03 13:37:38 +02:00
laurent
3573ee90af Oops. 2025-07-03 13:08:00 +02:00
laurent
25574aa104 Fixes for the notebook. 2025-07-03 13:05:00 +02:00
laurent
1cd9529f65 Json fix. 2025-07-03 12:57:22 +02:00
laurent
0ee2354176 Chunk decoding in the pth notebook. 2025-07-03 12:56:00 +02:00
laurent
dc8bffabe0 Remove the dataset bit. 2025-07-03 12:48:04 +02:00
Laurent Mazare
5f8e924176
Streaming output for the pytorch tts example. (#33)
* Streaming output for the pytorch tts example.

* Run the pre-commit hooks.
2025-07-03 11:05:06 +02:00
Laurent Mazare
d3bed09f9a
Pin the moshi_mlx version. (#35) 2025-07-03 09:53:53 +02:00
Václav Volhejn
ef52b8ef0f
Add Rust server usage example (#32)
* Run Ruff on tts_mlx.py

* Add tts_rust_server.py example

* Remove unused HF repo arguments and reset audio output data in TTS server script
2025-07-03 09:47:50 +02:00
Laurent Mazare
d92e4c2695
Use the on_frame callback in the mlx tts example. (#34) 2025-07-03 09:29:04 +02:00
laurent
6c1e9f12cf Mention the MLX quantization. 2025-07-03 07:52:27 +02:00
laurent
236df522b8 Add some links. 2025-07-03 07:47:16 +02:00
laurent
20cf8d7365 Collapsible sections. 2025-07-03 07:43:56 +02:00
Laurent Mazare
ae575a04c6
Handle stdin in the mlx tts example. (#31) 2025-07-02 18:59:04 +02:00
Václav Volhejn
433dca3751
Shorter names for STT scripts (#30) 2025-07-02 18:21:42 +02:00
Václav Volhejn
07ac744609
Add PyTorch notebook and documentation (#29)
* Add example for PyTorch implementation

* Document PyTorch and MLX examples

* Reorganize for TTS

* Remove waitlist signup CTA
2025-07-02 17:51:27 +02:00
Václav Volhejn
96ff217437
Add Pytorch inference for TTS (#28)
* Add tts_pytorch.py

* Add attempt at interactive playback
2025-07-02 17:02:05 +02:00
Laurent Mazare
7294fbcc3a
Add the MLX TTS example. (#27)
* Add the MLX TTS example.

* Audio playback.

* Allow for playing the audio in a streaming way.
2025-07-02 16:52:26 +02:00
Eugene Kharitonov
c4ef93770a
STT example w/ prompting (#26)
* STT example w/ prompting

* Text-audio prompt example into README.md + cutting prompt transcript.

* A line in README

* formatting in README

---------

Co-authored-by: Eugene <eugene@kyutai.org>
2025-07-02 11:23:11 +02:00
Václav Volhejn
395eaeae95
Fix usage examples and a few small things (#24) 2025-07-02 08:58:45 +02:00
Jean-Jerome Levy
4985940aad
fix: update hf-hub dependency to 0.4.3 to resolve URL parsing error (#21)
- Fixes RelativeUrlWithoutBase error when downloading models from fresh cache
- Resolves issue with kyutai/stt-1b-en_fr-candle model download
- Ensures proper URL handling in Hugging Face Hub interactions
2025-06-27 13:18:27 +02:00
Václav Volhejn
0112245ef7
Refactor Rust server examples (#19)
* Rename examples and add pre-commit

* Fix references to scripts, add implementations overview

* Link to colab notebook via github

* Simplify

* Add auth note

* Allow visualizing VAD

* Remove unused variable

* Add audio samples

* Address review comments
2025-06-26 16:51:43 +02:00
Laurent Mazare
96eef33c4c
Add the pre-commit check to the CI. (#18)
* Add the pre-commit check to the CI.

* CI tweaks.

* Add the pre-commit.
2025-06-26 14:41:57 +02:00
Václav Volhejn
7b5a01dfba
Rename examples and add pre-commit (#16)
* Rename examples and add pre-commit

* Fix references to scripts, add implementations overview

* Link to colab notebook via github
2025-06-26 09:26:11 +02:00
Muhammad Ali Abbas
8bd3f59631
Added mic_test and mic_query scripts (#6)
* Added mic_test and mic_query scripts

* Cleanup the mic example, remove the test.

---------

Co-authored-by: Laurent <laurent.mazare@gmail.com>
2025-06-25 09:17:27 +02:00
Laurent Mazare
473b179cc8
Configurable api key (#15)
* Make the api key configurable.

* Typo.
2025-06-25 08:23:05 +02:00
Laurent Mazare
2198f1d660
Add a script for using the mlx version with the built-in microphone. (#13) 2025-06-23 11:48:23 +02:00
Laurent Mazare
a3ed93d16b
Add a PR template + contributing. (#9) 2025-06-20 18:07:59 +02:00
eugene-kharitonov
ef864a6f38
Moving over STT inference scripts (#7)
* Adding links to STT example scripts

One script for HF dataset inference; another for retrieving
timestamps.

* Moving inference scripts to the delayed-streams-repo

---------

Co-authored-by: Eugene <eugene@kyutai.org>
2025-06-20 15:53:45 +02:00
laurent
dd5cbcbeef Add some moshi-swift details to the readme. 2025-06-20 08:12:12 +02:00
laurent
d7642ff1e9 Add a comment with the different time horizons. 2025-06-19 15:34:49 +02:00
laurent
d473deddaf Add a VAD example in the rust codebase. 2025-06-19 15:21:52 +02:00
laurent
3282de0559 Add a link to the model collections. 2025-06-19 14:52:01 +02:00
Václav Volhejn
5c37f42ff2
Plug mailing list (#3) 2025-06-19 14:39:37 +02:00
laurent
142a02f6da Bump the version. 2025-06-19 12:28:23 +02:00
laurent
5549b61d1c Cosmetic tweaks. 2025-06-19 12:27:19 +02:00
laurent
35c4ea47d8 Use the audio prefix in the rust inference. 2025-06-19 10:12:03 +02:00
laurent
91fb68acc4 Add timestamps to the rust example. 2025-06-19 09:57:53 +02:00