Alexandre Défossez
0b5a5fbed2
plop
2025-07-03 14:55:24 +02:00
Alexandre Défossez
31f8746881
some changes
2025-07-03 14:52:02 +02:00
Alexandre Défossez
b2416b19dd
changing streaming to be robust to repeated generation
2025-07-03 14:38:25 +02:00
Václav Volhejn
c1d248abba
Fix text tokenizer path ( #36 )
2025-07-03 14:27:06 +02:00
Václav Volhejn
c6f262346f
Don't install moshi from Git ( #37 )
...
* Don't install moshi from Git
* Remove commented-out invalid message send in websocket_client
2025-07-03 13:37:38 +02:00
laurent
3573ee90af
Oops.
2025-07-03 13:08:00 +02:00
laurent
25574aa104
Fixes for the notebook.
2025-07-03 13:05:00 +02:00
laurent
1cd9529f65
Json fix.
2025-07-03 12:57:22 +02:00
laurent
0ee2354176
Chunk decoding in the pth notebook.
2025-07-03 12:56:00 +02:00
laurent
dc8bffabe0
Remove the dataset bit.
2025-07-03 12:48:04 +02:00
Laurent Mazare
5f8e924176
Streaming output for the pytorch tts example. ( #33 )
...
* Streaming output for the pytorch tts example.
* Run the pre-commit hooks.
2025-07-03 11:05:06 +02:00
Laurent Mazare
d3bed09f9a
Pin the moshi_mlx version. ( #35 )
2025-07-03 09:53:53 +02:00
Václav Volhejn
ef52b8ef0f
Add Rust server usage example ( #32 )
...
* Run Ruff on tts_mlx.py
* Add tts_rust_server.py example
* Remove unused HF repo arguments and reset audio output data in TTS server script
2025-07-03 09:47:50 +02:00
Laurent Mazare
d92e4c2695
Use the on_frame callback in the mlx tts example. ( #34 )
2025-07-03 09:29:04 +02:00
laurent
6c1e9f12cf
Mention the MLX quantization.
2025-07-03 07:52:27 +02:00
laurent
236df522b8
Add some links.
2025-07-03 07:47:16 +02:00
laurent
20cf8d7365
Collapsible sections.
2025-07-03 07:43:56 +02:00
Laurent Mazare
ae575a04c6
Handle stdin in the mlx tts example. ( #31 )
2025-07-02 18:59:04 +02:00
Václav Volhejn
433dca3751
Shorter names for STT scripts ( #30 )
2025-07-02 18:21:42 +02:00
Václav Volhejn
07ac744609
Add PyTorch notebook and documentation ( #29 )
...
* Add example for PyTorch implementation
* Document PyTorch and MLX examples
* Reorganize for TTS
* Remove waitlist signup CTA
2025-07-02 17:51:27 +02:00
Václav Volhejn
96ff217437
Add Pytorch inference for TTS ( #28 )
...
* Add tts_pytorch.py
* Add attempt at interactive playback
2025-07-02 17:02:05 +02:00
Laurent Mazare
7294fbcc3a
Add the MLX TTS example. ( #27 )
...
* Add the MLX TTS example.
* Audio playback.
* Allow for playing the audio in a streaming way.
2025-07-02 16:52:26 +02:00
Eugene Kharitonov
c4ef93770a
STT example w/ prompting ( #26 )
...
* STT example w/ prompting
* Text-audio prompt example into README.md + cutting prompt transcript.
* A line in README
* formatting in README
---------
Co-authored-by: Eugene <eugene@kyutai.org>
2025-07-02 11:23:11 +02:00
Václav Volhejn
395eaeae95
Fix usage examples and a few small things ( #24 )
2025-07-02 08:58:45 +02:00
Jean-Jerome Levy
4985940aad
fix: update hf-hub dependency to 0.4.3 to resolve URL parsing error ( #21 )
...
- Fixes RelativeUrlWithoutBase error when downloading models from fresh cache
- Resolves issue with kyutai/stt-1b-en_fr-candle model download
- Ensures proper URL handling in Hugging Face Hub interactions
2025-06-27 13:18:27 +02:00
Václav Volhejn
0112245ef7
Refactor Rust server examples ( #19 )
...
* Rename examples and add pre-commit
* Fix references to scripts, add implementations overview
* Link to colab notebook via github
* Simplify
* Add auth note
* Allow visualizing VAD
* Remove unused variable
* Add audio samples
* Address review comments
2025-06-26 16:51:43 +02:00
Laurent Mazare
96eef33c4c
Add the pre-commit check to the CI. ( #18 )
...
* Add the pre-commit check to the CI.
* CI tweaks.
* Add the pre-commit.
2025-06-26 14:41:57 +02:00
Václav Volhejn
7b5a01dfba
Rename examples and add pre-commit ( #16 )
...
* Rename examples and add pre-commit
* Fix references to scripts, add implementations overview
* Link to colab notebook via github
2025-06-26 09:26:11 +02:00
Muhammad Ali Abbas
8bd3f59631
Added mic_test and mic_query scripts ( #6 )
...
* Added mic_test and mic_query scripts
* Cleanup the mic example, remove the test.
---------
Co-authored-by: Laurent <laurent.mazare@gmail.com>
2025-06-25 09:17:27 +02:00
Laurent Mazare
473b179cc8
Configurable api key ( #15 )
...
* Make the api key configurable.
* Typo.
2025-06-25 08:23:05 +02:00
Laurent Mazare
2198f1d660
Add a script for using the mlx version with the built-in microphone. ( #13 )
2025-06-23 11:48:23 +02:00
Laurent Mazare
a3ed93d16b
Add a PR template + contributing. ( #9 )
2025-06-20 18:07:59 +02:00
eugene-kharitonov
ef864a6f38
Moving over STT inference scripts ( #7 )
...
* Adding links to STT example scripts
One script for HF dataset inference; another for retrieving
timestamps.
* Moving inference scripts to the delayed-streams-repo
---------
Co-authored-by: Eugene <eugene@kyutai.org>
2025-06-20 15:53:45 +02:00
laurent
dd5cbcbeef
Add some moshi-swift details to the readme.
2025-06-20 08:12:12 +02:00
laurent
d7642ff1e9
Add a comment with the different time horizons.
2025-06-19 15:34:49 +02:00
laurent
d473deddaf
Add a VAD example in the rust codebase.
2025-06-19 15:21:52 +02:00
laurent
3282de0559
Add a link to the model collections.
2025-06-19 14:52:01 +02:00
Václav Volhejn
5c37f42ff2
Plug mailing list ( #3 )
2025-06-19 14:39:37 +02:00
laurent
142a02f6da
Bump the version.
2025-06-19 12:28:23 +02:00
laurent
5549b61d1c
Cosmetic tweaks.
2025-06-19 12:27:19 +02:00
laurent
35c4ea47d8
Use the audio prefix in the rust inference.
2025-06-19 10:12:03 +02:00
laurent
91fb68acc4
Add timestamps to the rust example.
2025-06-19 09:57:53 +02:00
Václav Volhejn
957edae092
Make a pass over the Readme ( #2 )
2025-06-19 09:37:48 +02:00
laurent
1b362905f9
Tweaks.
2025-06-19 08:52:48 +02:00
Gabriel de Marmiesse
6f4ef1eae8
Add uv instructions and ignore the sample audio files ( #1 )
...
* Add uv instructions and ignore the sample audio file
* Add french sample
* Clarify real-time
* Remove empty space
2025-06-18 12:45:33 +02:00
laurent
de8202bddc
Merge remote-tracking branch 'refs/remotes/origin/main'
2025-06-18 12:32:21 +02:00
laurent
161fc6fc3b
Minor tweaks.
2025-06-18 12:32:14 +02:00
Laurent Mazare
954e0ad45a
Update config-stt-hf.toml
2025-06-18 12:20:23 +02:00
laurent
e22cb0d59c
Tweaks.
2025-06-18 11:39:27 +02:00
laurent
8889177f19
Use some badges.
2025-06-18 11:37:32 +02:00