Compare commits
5 Commits
main
...
give_uv_in
| Author | SHA1 | Date | |
|---|---|---|---|
|
|
403db09953 | ||
|
|
332b2b9daa | ||
|
|
7c9953187a | ||
|
|
6247aee904 | ||
|
|
e202e4bb0a |
4
.gitignore
vendored
4
.gitignore
vendored
|
|
@ -191,4 +191,6 @@ cython_debug/
|
||||||
# exclude from AI features like autocomplete and code analysis. Recommended for sensitive data
|
# exclude from AI features like autocomplete and code analysis. Recommended for sensitive data
|
||||||
# refer to https://docs.cursor.com/context/ignore-files
|
# refer to https://docs.cursor.com/context/ignore-files
|
||||||
.cursorignore
|
.cursorignore
|
||||||
.cursorindexingignore
|
.cursorindexingignore
|
||||||
|
bria.mp3
|
||||||
|
sample_fr_hibiki_crepes.mp3
|
||||||
|
|
|
||||||
17
README.md
17
README.md
|
|
@ -36,6 +36,12 @@ with version 0.2.5 or later, which can be installed via pip.
|
||||||
python -m moshi.run_inference --hf-repo kyutai/stt-2.6b-en bria.mp3
|
python -m moshi.run_inference --hf-repo kyutai/stt-2.6b-en bria.mp3
|
||||||
```
|
```
|
||||||
|
|
||||||
|
If you have `uv` installed, you can skip the installation step and run directly:
|
||||||
|
```bash
|
||||||
|
uvx --with moshi python -m moshi.run_inference --hf-repo kyutai/stt-2.6b-en bria.mp3
|
||||||
|
```
|
||||||
|
It will install the moshi package in a temporary environment and run the speech-to-text.
|
||||||
|
|
||||||
### MLX implementation
|
### MLX implementation
|
||||||
<a href="https://huggingface.co/kyutai/stt-2.6b-en-mlx" target="_blank" style="margin: 2px;">
|
<a href="https://huggingface.co/kyutai/stt-2.6b-en-mlx" target="_blank" style="margin: 2px;">
|
||||||
<img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue" style="display: inline-block; vertical-align: middle;"/>
|
<img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue" style="display: inline-block; vertical-align: middle;"/>
|
||||||
|
|
@ -48,6 +54,12 @@ with version 0.2.5 or later, which can be installed via pip.
|
||||||
python -m moshi_mlx.run_inference --hf-repo kyutai/stt-2.6b-en-mlx bria.mp3 --temp 0
|
python -m moshi_mlx.run_inference --hf-repo kyutai/stt-2.6b-en-mlx bria.mp3 --temp 0
|
||||||
```
|
```
|
||||||
|
|
||||||
|
If you have `uv` installed, you can skip the installation step and run directly:
|
||||||
|
```bash
|
||||||
|
uvx --with moshi-mlx python -m moshi_mlx.run_inference --hf-repo kyutai/stt-2.6b-en-mlx bria.mp3 --temp 0
|
||||||
|
```
|
||||||
|
It will install the moshi package in a temporary environment and run the speech-to-text.
|
||||||
|
|
||||||
### Rust implementation
|
### Rust implementation
|
||||||
<a href="https://huggingface.co/kyutai/stt-2.6b-en-candle" target="_blank" style="margin: 2px;">
|
<a href="https://huggingface.co/kyutai/stt-2.6b-en-candle" target="_blank" style="margin: 2px;">
|
||||||
<img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue" style="display: inline-block; vertical-align: middle;"/>
|
<img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue" style="display: inline-block; vertical-align: middle;"/>
|
||||||
|
|
@ -91,8 +103,9 @@ script.
|
||||||
uv run scripts/asr-streaming-query.py bria.mp3
|
uv run scripts/asr-streaming-query.py bria.mp3
|
||||||
```
|
```
|
||||||
|
|
||||||
The script simulates some real-time processing of the audio. Faster processing
|
The script limits the decoding speed to simulates real-time processing of the audio.
|
||||||
can be triggered by setting the real-time factor, e.g. `--rtf 500` will process
|
Faster processing can be triggered by setting
|
||||||
|
the real-time factor, e.g. `--rtf 500` will process
|
||||||
the data as fast as possible.
|
the data as fast as possible.
|
||||||
|
|
||||||
## Text-to-Speech
|
## Text-to-Speech
|
||||||
|
|
|
||||||
Loading…
Reference in New Issue
Block a user