Files
tokenizers/docs/source/pipeline.rst
2020-11-02 17:07:27 -05:00

11 lines
250 B
ReStructuredText

The tokenization pipeline
====================================================================================================
TODO: Describe the tokenization pipeline:
- Normalization
- Pre-tokenization
- Tokenization
- Post-processing
- Decoding