mirror of
https://github.com/mii443/tokenizers.git
synced 2025-08-23 00:35:35 +00:00
Improve readme
This commit is contained in:
21
README.md
21
README.md
@ -1,4 +1,17 @@
|
|||||||
# Tokenizers
|
<p align="center">
|
||||||
|
<br>
|
||||||
|
<img src="https://huggingface.co/landing/assets/tokenizers/tokenizers-logo.png" width="600"/>
|
||||||
|
<br>
|
||||||
|
<p>
|
||||||
|
<p align="center">
|
||||||
|
<a href="https://circleci.com/gh/huggingface/transformers">
|
||||||
|
<img alt="Build" src="https://github.com/huggingface/tokenizers/workflows/Rust/badge.svg">
|
||||||
|
</a>
|
||||||
|
<a href="https://github.com/huggingface/tokenizers/blob/master/LICENSE">
|
||||||
|
<img alt="GitHub" src="https://img.shields.io/github/license/huggingface/tokenizers.svg?color=blue">
|
||||||
|
</a>
|
||||||
|
</p>
|
||||||
|
|
||||||
|
|
||||||
Provides an implementation of today's most used tokenizers, with a focus on performance and
|
Provides an implementation of today's most used tokenizers, with a focus on performance and
|
||||||
versatility.
|
versatility.
|
||||||
@ -14,6 +27,12 @@ versatility.
|
|||||||
original sentence that corresponds to a given token.
|
original sentence that corresponds to a given token.
|
||||||
- Does all the pre-processing: Truncate, Pad, add the special tokens your model needs.
|
- Does all the pre-processing: Truncate, Pad, add the special tokens your model needs.
|
||||||
|
|
||||||
|
<p align="center">
|
||||||
|
<br>
|
||||||
|
<img src="https://huggingface.co/landing/assets/tokenizers/tokenizers-repo-example.png" />
|
||||||
|
<br>
|
||||||
|
<p>
|
||||||
|
|
||||||
## Bindings
|
## Bindings
|
||||||
|
|
||||||
We provide bindings to the following languages (more to come!):
|
We provide bindings to the following languages (more to come!):
|
||||||
|
Reference in New Issue
Block a user