The tokenization pipeline ==================================================================================================== TODO: Describe the tokenization pipeline: - Normalization - Pre-tokenization - Tokenization - Post-processing - Decoding