This is the 5th video in the Introduction to LLMs series. Check out the Table of Contents for more information.
Word-level vs Character-level vs Subword level embeddings
The Byte Pair Encoding Strategy
Special Tokens
The Hugging Face Tokenizer
Visualizing the Attentions with the Padding Tokken
Here are the latest articles you may have missed:
Listen to this episode with a 7-day free trial
Subscribe to The AiEdge Newsletter to listen to this post and get 7 days of free access to the full post archives.