Merge pull request #75 from huggingface/doc_parallelism

Added doc for setting tokenizers level of parallelism.
2025-08-22 16:25:30 +00:00 · 2020-01-15 09:42:22 -05:00
parent 65b35385f8 11fb79cee8
commit 7d10dd0fd6
1 changed files with 7 additions and 0 deletions
--- a/tokenizers/README.md
+++ b/tokenizers/README.md
@ -54,3 +54,10 @@ fn main() -> Result<()> {
 	Ok(())
 }
 ```
+
+## Additional information
+
+- tokenizers is designed to leverage CPU parallelism when possible. The level of parallelism is determined
+by the total number of core/threads your CPU provides but this can be tuned by setting the `RAYON_RS_NUM_CPUS`
+environment variable. As an example setting `RAYON_RS_NUM_CPUS=4` will allocate a maximum of 4 threads.
+**_Please note this behavior may evolve in the future_**