This is the 5th video in the Introduction to LLMs series. Check out the Table of Contents for more information.
Word-level vs Character-level vs Subword level embeddings
The Byte Pair Encoding Strategy
The Hugging Face Tokenizer
Visualizing the Attentions with the Padding Tokken
Here are the latest articles you may have missed: