Skip to main content

Large Language Model

Resources

  • Understanding large language models (HN)
    • Top 10-ish papers to understand the design, constraints and evolution of LLMs
    • Development of LLMs: Attention weighted encodings, transformer, BERT, GPT, BART
    • Improving the efficiency of LLMs: FlashAttention, Cramming, finetuning methods, Chinchilla model, InstructGPT, and more on reinforcement learning with human feedback (RLHF)
  • What we know about LLMs (Primer) | Will Thompson
    • A simple explainer of what is considered an LLM, what we knew about LLMs and what are the ongoing research
    • Includes a lot of links to other resources. A few concepts introduced include LLMs' capability to generalize knowledge, power law in LLMs' performance, reinforcement learning via human feedback (RLHF), etc.
  • LLM Visualization
    • 3D graphics visualizing parameters of a LLM model at each stage from tokenization to the output
  • LLM Course | GitHub @mlabonne
    • Resources from mathematics, to Python, to neural networks, to NLP
  • Spreadsheets are all you need (HN)
    • Understand GPT with Excel Spreadsheet