Sunday, December 21, 2025

2 Types of Large Language Models (LLM)

Base LLM:

  • Predicts the next word, based on text training data.


Instruction Tuned LLM:

  • Tries to follow instructions.
  • Fine-tune on instructions and good attempts at following those instructions.
  • RLHF: Reinforcement Learning with Human Feedback
  • Trained to be helpful, honest and harmless.

No comments:

Common Names and IUPAC Names of Alkanoic Acids

- Formic Acid → Methanoic Acid (1 carbon)   - Acetic Acid → Ethanoic Acid (2 carbons)   - Propionic Acid → Propanoic Acid (3 carbons)   - Bu...