2023-12-01

Quickstart on LLMs

Quickstart on LLMs

Quickstart on LLMs

This simple index is aimed to help one get started in the domain of LLMs. Obviously, the area is continuously evolving so this index may become irrelevant soon.

Beginner

If new to LLMs, start with this video to get an overview: State of GTP by Andrej Karpathy
Building LLM applications for production by Chip Huyen
Don’t teach. Incentivize

Intermediate

A Hackers’ Guide to Language Models by Jeremy Howard
1. “The best way to learn about language models is to use them” from the video. And this video will teach you to exactly do that
Master blog of how to serve LLMs in production: Optimizing your LLM in production
How LLMs are trained?
1. RLHF: Reinforcement Learning from Human Feedback by Chip Huyen
How to serve LLMs in memory-constrained environments?
1. KV Cache brief explainer
2. Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More
Understanding basics of LoRa for training LLMs in memory-constrained environments: LoRA explained (and a bit about precision and quantization)
Learn about quantization and flash attention

Advanced

vLLMs: Serving LLMs faster