..

Quickstart on LLMs

  1. Quickstart on LLMs
    1. Beginner
    2. Intermediate
    3. Advanced

Quickstart on LLMs

This simple index is aimed to help one get started in the domain of LLMs. Obviously, the area is continuously evolving so this index may become irrelevant soon.

Beginner

  1. If new to LLMs, start with this video to get an overview: State of GTP by Andrej Karpathy
  2. Building LLM applications for production by Chip Huyen

Intermediate

  1. A Hackers’ Guide to Language Models by Jeremy Howard
    1. “The best way to learn about language models is to use them” from the video. And this video will teach you to exactly do that
  2. Master blog of how to serve LLMs in production: Optimizing your LLM in production
  3. How LLMs are trained?
    1. RLHF: Reinforcement Learning from Human Feedback by Chip Huyen
  4. How to serve LLMs in memory-constrained environments?
    1. KV Cache brief explainer
    2. Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More
  5. Understanding basics of LoRa for training LLMs in memory-constrained environments: LoRA explained (and a bit about precision and quantization)
  6. Learn about quantization and flash attention

Advanced

  1. vLLMs: Serving LLMs faster