This commit is contained in:
laurent 2025-07-03 19:02:42 +02:00
parent 99599fa408
commit f9739881e6

View File

@ -116,7 +116,7 @@ However, please bear in mind that is an experimental feature and its behavior is
</a> </a>
The Rust implementation provides a server that can process multiple streaming The Rust implementation provides a server that can process multiple streaming
queries in parallel. Dependening on the amount of memory on your GPU, you may queries in parallel. Depending on the amount of memory on your GPU, you may
have to adjust the batch size from the config file. For a L40S GPU, a batch size have to adjust the batch size from the config file. For a L40S GPU, a batch size
of 64 works well and requests can be processed at 3x real-time speed. of 64 works well and requests can be processed at 3x real-time speed.