Sunday, December 21, 2025

2 Types of Large Language Models (LLM)

Base LLM:

  • Predicts the next word, based on text training data.


Instruction Tuned LLM:

  • Tries to follow instructions.
  • Fine-tune on instructions and good attempts at following those instructions.
  • RLHF: Reinforcement Learning with Human Feedback
  • Trained to be helpful, honest and harmless.

No comments:

Chapter 1: Types and Components of Computer Systems

Dialogue-based interface: No need for a driver to take their hands off the steering wheel. It's useful for people with disabilities. It ...