Foundation Models

2026-02-09

Quick index of official model cards / system cards for major frontier (closed) and open(-weights) model families. Each entry includes the standard BibTeX citation following our BibTeX guide. Models are grouped by provider, and different model generations have separate entries. When citing a foundation model in a paper (e.g., as a baseline or backbone), use the canonical citation from this page — see the paper writing guide for more details.

OpenAI

GPT-3 — Language Models are Few-Shot Learners

05/2020 · OpenAI · brown2020language

Scaled autoregressive LM to 175B parameters, demonstrating strong few-shot performance across NLP tasks. Seminal because it revealed emergent in-context learning from scale, shifting the NLP paradigm from fine-tuning to prompting

Page Paper

@inproceedings{brown2020language,
  title     = {{Language Models are Few-Shot Learners}},
  author    = {Brown, T. and Mann, B. and Ryder, N. and Subbiah, M. and Kaplan, J. and Dhariwal, P. and others},
  booktitle = {{Advances in Neural Information Processing Systems (NeurIPS)}},
  year      = {2020}
}

InstructGPT — Training Language Models to Follow Instructions with Human Feedback

03/2022 · OpenAI · ouyang2022training

Introduced RLHF to align GPT-3 with human intent, forming the basis for ChatGPT. Seminal because it established the instruction-following alignment paradigm adopted by virtually every subsequent chat model

Page Paper

@inproceedings{ouyang2022training,
  title     = {{Training Language Models to Follow Instructions with Human Feedback}},
  author    = {Ouyang, L. and Wu, J. and Jiang, X. and Almeida, D. and Wainwright, C. and Mishkin, P. and others},
  booktitle = {{Advances in Neural Information Processing Systems (NeurIPS)}},
  year      = {2022}
}

GPT-4 — GPT-4 Technical Report

03/2023 · OpenAI · openai2023gpt4

Multimodal large language model achieving human-level performance on professional and academic benchmarks. Seminal because it defined the frontier multimodal LLM benchmark and catalyzed mainstream enterprise and developer adoption of LLMs

Page Paper

@misc{openai2023gpt4,
  title         = {{GPT-4 Technical Report}},
  author        = {OpenAI},
  year          = {2023},
  eprint        = {2303.08774},
  archivePrefix = {arXiv}
}

GPT-4o — GPT-4o System Card

08/2024 · OpenAI · openai2024gpt4o

Omnimodal model natively processing text, audio, image, and video; also covers GPT-4o mini

Page Paper

@misc{openai2024gpt4o,
  title         = {{GPT-4o System Card}},
  author        = {OpenAI},
  year          = {2024},
  eprint        = {2410.21276},
  archivePrefix = {arXiv}
}

o1 — OpenAI o1 System Card

12/2024 · OpenAI · openai2024o1

Reasoning-focused LRM trained with large-scale reinforcement learning to perform extended chain-of-thought before responding

Page Paper

@misc{openai2024o1,
  title         = {{OpenAI o1 System Card}},
  author        = {OpenAI},
  year          = {2024},
  eprint        = {2412.16720},
  archivePrefix = {arXiv}
}

o3-mini — OpenAI o3-mini System Card

01/2025 · OpenAI · openai2025o3mini

Smaller reasoning model with adjustable reasoning effort; no arXiv paper

Page Paper

@misc{openai2025o3mini,
  title = {{OpenAI o3-mini System Card}},
  author = {OpenAI},
  year  = {2025},
  url   = {https://openai.com/index/o3-mini-system-card/},
  note  = {Accessed February 9, 2026}
}

GPT-4.5 — GPT-4.5 System Card

02/2025 · OpenAI · openai2025gpt45

Largest pre-trained model focused on broad knowledge and reduced hallucinations; research preview emphasizing unsupervised learning scale

Page Paper

@misc{openai2025gpt45,
  title  = {{GPT-4.5 System Card}},
  author = {OpenAI},
  year   = {2025},
  url    = {https://openai.com/index/gpt-4-5-system-card/},
  note   = {Accessed February 9, 2026}
}

o3 — OpenAI o3 and o4-mini System Card

04/2025 · OpenAI · openai2025o3

Most powerful reasoning model with full tool use (browsing, code, images); first system card under Preparedness Framework v2

Page Paper

@misc{openai2025o3,
  title  = {{OpenAI o3 and o4-mini System Card}},
  author = {OpenAI},
  year   = {2025},
  url    = {https://openai.com/index/o3-o4-mini-system-card/},
  note   = {Accessed February 9, 2026}
}

o4-mini — OpenAI o3 and o4-mini System Card

04/2025 · OpenAI · openai2025o4mini

Cost-efficient reasoning model excelling at math and coding; achieves 99.5% on AIME 2025 with tool use

Page Paper

@misc{openai2025o4mini,
  title  = {{OpenAI o3 and o4-mini System Card}},
  author = {OpenAI},
  year   = {2025},
  url    = {https://openai.com/index/o3-o4-mini-system-card/},
  note   = {Accessed February 9, 2026}
}

GPT-5 — GPT-5 System Card

08/2025 · OpenAI · openai2025gpt5

Unified system with a real-time router dispatching across fast (main) and deep-reasoning (thinking) sub-models, replacing GPT-4o and o3; significant reduction in hallucinations and sycophancy

Page Paper

@misc{openai2025gpt5,
  title         = {{GPT-5 System Card}},
  author        = {OpenAI},
  year          = {2025},
  eprint        = {2601.03267},
  archivePrefix = {arXiv}
}

GPT-5.2 — GPT-5.2 System Card

12/2025 · OpenAI · openai2025gpt52

Most capable model for professional knowledge work; achieves 100% on AIME 2025 competition math

Page

@misc{openai2025gpt52,
  title  = {{GPT-5.2 System Card}},
  author = {OpenAI},
  year   = {2025},
  url    = {https://openai.com/index/introducing-gpt-5-2/},
  note   = {Accessed February 9, 2026}
}

Anthropic

Claude 3 — The Claude 3 Model Family: Opus, Sonnet, Haiku

03/2024 · Anthropic · anthropic2024claude3

Three-tier multimodal model family achieving state-of-the-art on GPQA, MMLU, and MMMU

Page Paper

@misc{anthropic2024claude3,
  title  = {{The Claude 3 Model Family: Opus, Sonnet, Haiku}},
  author = {Anthropic},
  year   = {2024},
  url    = {https://www.anthropic.com/news/claude-3-family},
  note   = {Accessed February 9, 2026}
}

Claude 3.5 — The Claude 3.5 Model Family Addendum

10/2024 · Anthropic · anthropic2024claude35

Updated Claude 3 model card with Claude 3.5 Sonnet and Haiku evaluations

Page Paper

@misc{anthropic2024claude35,
  title  = {{The Claude Model Spec and Evaluations Addendum}},
  author = {Anthropic},
  year   = {2024},
  url    = {https://www.anthropic.com/news/claude-3-5-sonnet},
  note   = {Accessed February 9, 2026}
}

Claude 3.7 Sonnet — Claude 3.7 Sonnet System Card

02/2025 · Anthropic · anthropic2025claude37

First hybrid reasoning model from Anthropic with configurable extended thinking (up to 128K tokens); visible chain-of-thought and dual-mode operation

Page Paper

@misc{anthropic2025claude37,
  title  = {{Claude 3.7 Sonnet System Card}},
  author = {Anthropic},
  year   = {2025},
  url    = {https://www.anthropic.com/claude-3-7-sonnet-system-card},
  note   = {Accessed February 9, 2026}
}

Claude Opus 4 — Claude 4 System Card

05/2025 · Anthropic · anthropic2025opus4

Most powerful Anthropic model capable of autonomous multi-hour workflows; deployed under AI Safety Level 3 Standard

Page Paper

@misc{anthropic2025opus4,
  title  = {{Claude 4 System Card}},
  author = {Anthropic},
  year   = {2025},
  url    = {https://www.anthropic.com/claude-4-system-card},
  note   = {Accessed February 9, 2026}
}

Claude Sonnet 4 — Claude 4 System Card

05/2025 · Anthropic · anthropic2025sonnet4

General-purpose successor to Sonnet 3.7 with improved coding and hybrid thinking; deployed under AI Safety Level 2 Standard

Page Paper

@misc{anthropic2025sonnet4,
  title  = {{Claude 4 System Card}},
  author = {Anthropic},
  year   = {2025},
  url    = {https://www.anthropic.com/claude-4-system-card},
  note   = {Accessed February 9, 2026}
}

Claude Opus 4.5 — Claude Opus 4.5 System Card

11/2025 · Anthropic · anthropic2025opus45

State-of-the-art for coding, agents, and computer use; strong at real-world software engineering, deep research, and agentic workflows

Page Paper

@misc{anthropic2025opus45,
  title  = {{Claude Opus 4.5 System Card}},
  author = {Anthropic},
  year   = {2025},
  url    = {https://www.anthropic.com/claude-opus-4-5-system-card},
  note   = {Accessed February 9, 2026}
}

Google / DeepMind

PaLM 2 — PaLM 2 Technical Report

05/2023 · Google · anil2023palm

Compute-optimal multilingual model powering Bard/Gemini with improved reasoning and coding

Page Paper

@misc{anil2023palm,
  title         = {{PaLM 2 Technical Report}},
  author        = {Anil, R. and Dai, A. and Firat, O. and Johnson, M. and Lepikhin, D. and Passos, A. and others},
  year          = {2023},
  eprint        = {2305.10403},
  archivePrefix = {arXiv}
}

Gemini 1.0 — Gemini: A Family of Highly Capable Multimodal Models

12/2023 · Google DeepMind · geminiteam2023gemini

First natively multimodal frontier model family (Ultra, Pro, Nano). Seminal because it pioneered training multimodality from scratch rather than bolting vision onto a text model, setting the direction for the field

Page Paper

@misc{geminiteam2023gemini,
  title         = {{Gemini: A Family of Highly Capable Multimodal Models}},
  author        = {{Gemini Team} and Anil, R. and Borgeaud, S. and Alayrac, J. and Yu, J. and Soricut, R. and others},
  year          = {2023},
  eprint        = {2312.11805},
  archivePrefix = {arXiv}
}

Gemini 1.5 — Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context

02/2024 · Google DeepMind · geminiteam2024gemini15

Long-context MoE model supporting up to 10M tokens with near-perfect recall

Page Paper

@misc{geminiteam2024gemini15,
  title         = {{Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context}},
  author        = {{Gemini Team} and Reid, M. and Savinov, N. and Teber, D. and Bapna, A. and Bowman, R. and others},
  year          = {2024},
  eprint        = {2403.05530},
  archivePrefix = {arXiv}
}

Gemini 2.0 — Gemini 2.0 Blog

12/2024 · Google DeepMind · google2024gemini2

Agentic multimodal model with native tool use and multimodal output; no standalone technical report

Page

@misc{google2024gemini2,
  title  = {{Gemini 2.0: Our New AI Model for the Agentic Era}},
  author = {{Google DeepMind}},
  year   = {2024},
  url    = {https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/},
  note   = {Accessed February 9, 2026}
}

Gemma — Gemma: Open Models Based on Gemini Research and Technology

02/2024 · Google DeepMind · gemmateam2024gemma

Open-weights 2B/7B models derived from Gemini research

Foundation Models

OpenAI

GPT-3 — Language Models are Few-Shot Learners

InstructGPT — Training Language Models to Follow Instructions with Human Feedback

GPT-4 — GPT-4 Technical Report

GPT-4o — GPT-4o System Card

o1 — OpenAI o1 System Card

o3-mini — OpenAI o3-mini System Card

GPT-4.5 — GPT-4.5 System Card

o3 — OpenAI o3 and o4-mini System Card

o4-mini — OpenAI o3 and o4-mini System Card

GPT-5 — GPT-5 System Card

GPT-5.2 — GPT-5.2 System Card

Meta

LLaMA — LLaMA: Open and Efficient Foundation Language Models

Llama 2 — Llama 2: Open Foundation and Fine-Tuned Chat Models

Llama 3.1 — The Llama 3 Herd of Models

Llama 3.2 — Llama 3.2 Model Card

Llama 3.3 — Llama 3.3 Model Card

Llama 4 — Llama 4 Model Card

Anthropic

Claude 3 — The Claude 3 Model Family: Opus, Sonnet, Haiku

Claude 3.5 — The Claude 3.5 Model Family Addendum

Claude 3.7 Sonnet — Claude 3.7 Sonnet System Card

Claude Opus 4 — Claude 4 System Card

Claude Sonnet 4 — Claude 4 System Card

Claude Opus 4.5 — Claude Opus 4.5 System Card

Google / DeepMind

PaLM 2 — PaLM 2 Technical Report

Gemini 1.0 — Gemini: A Family of Highly Capable Multimodal Models

Gemini 1.5 — Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context

Gemini 2.0 — Gemini 2.0 Blog

Gemma — Gemma: Open Models Based on Gemini Research and Technology

Gemma 2 — Gemma 2: Improving Open Language Models at a Practical Size

Gemma 3 — Gemma 3 Technical Report

Gemini 2.5 Pro — Gemini 2.5 Technical Report

Gemini 2.5 Flash — Gemini 2.5 Flash Model Card

DeepSeek

DeepSeek-V2 — DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

DeepSeek-Coder-V2 — DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepSeek-V3 — DeepSeek-V3 Technical Report

DeepSeek-R1 — DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Mistral AI

Mistral 7B — Mistral 7B

Mixtral 8x7B — Mixtral of Experts

Mixtral 8x22B — Mixtral 8x22B Model Card

Mistral Large 2 — Mistral Large 2 Blog

Mistral Small 3 — Mistral Small 3 Blog

Alibaba / Qwen

Qwen — Qwen Technical Report

Qwen2 — Qwen2 Technical Report

Qwen2.5 — Qwen2.5 Technical Report

Qwen2.5-Coder — Qwen2.5-Coder Technical Report

QwQ — QwQ: Reflect Deeply on the Boundaries of the Unknown

Microsoft

Phi-1 — Textbooks Are All You Need

Phi-1.5 — Textbooks Are All You Need II: phi-1.5 Technical Report

Phi-3 — Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Phi-4 — Phi-4 Technical Report

Cohere

Command R — Command R Model Card

Aya 23 — Aya 23: Open Weight Releases to Further Multilingual Progress

Aya Expanse — Aya Expanse: Connecting Our World

Command A — Command A Model Card

AI21 Labs

Jamba — Jamba: A Hybrid Transformer-Mamba Language Model

Jamba 1.5 — Jamba 1.5: Hybrid Transformer-Mamba Models at Scale

xAI

Grok-1 — Grok-1 Model Card

Grok-2 — Grok-2 Blog

Grok-3 — Grok-3 Blog

Grok-4 — Grok-4 Model Card

01.AI

Yi — Yi: Open Foundation Models by 01.AI

Technology Innovation Institute (TII)

Falcon — The Falcon Series of Open Language Models

Falcon 2 — Falcon 2: An 11 Billion Parameter Large Language Model

Stability AI

Stable LM 2 — Stable LM 2 1.6B Technical Report

NVIDIA