Lab 2: Build a Small Language Model
llms
neural-nets
Build a small language model from scratch to understand how LLMs work.
Overview
Go deeper into Code 3.0 by building a small language model from scratch. Following the Karpathy approach, you’ll implement tokenization, embeddings, and a transformer-style architecture to understand how large language models work from first principles.
Learning Objectives
- Implement tokenization and text encoding
- Build embeddings and understand their role in language models
- Construct and train a small transformer-based model
- Generate text from your trained model
Tasks
To be determined.