Institution

OpenAI

An AI research and deployment company behind GPT, CLIP, DALL·E, and other frontier systems.

GPT-3 Explained: When the Prompt Became the Programming Interface

GPT-3 is a 175B-parameter autoregressive language model that performs translation, QA, and reasoning tasks from a few in-prompt examples, with no gradient updates or task-specific fine-tuning.

Alignment · OpenAI

InstructGPT: How RLHF Beat a Model 100x Its Size

OpenAI's InstructGPT used human feedback to align GPT-3, and evaluators preferred its 1.3B model over the 175B GPT-3 — more helpful with 100x fewer parameters.

Alignment · OpenAI

PPO Explained: The Clipped Objective Behind RLHF

PPO keeps policy-gradient RL stable with a clipped surrogate objective — almost as well-behaved as TRPO but far simpler — which made it the default RL engine behind RLHF for ChatGPT and InstructGPT.

Speech Recognition · OpenAI

Whisper: 680,000 Hours of Weak Supervision for Robust ASR

OpenAI's Whisper trains a single sequence-to-sequence model on 680,000 hours of web audio. It matches fully supervised systems zero-shot — no fine-tuning — and adds translation and language ID.

Multimodal Models · OpenAI

CLIP: Learning Visual Models From Natural Language Supervision

CLIP trains paired image and text encoders on 400 million internet image-text pairs, then matches the original ResNet-50's ImageNet accuracy zero-shot — without using any of its 1.28M labeled examples.

Text-to-Image · OpenAI

DALL·E 2 (unCLIP): Text-to-Image via CLIP Image Latents

DALL·E 2, called unCLIP in the paper, generates a CLIP image embedding from text with a prior, then renders it with a diffusion decoder — buying more diversity at almost no cost to photorealism or caption match.

Multimodal Models · OpenAI

GPT-4 Technical Report Explained: Benchmarks, Not Blueprints

OpenAI's GPT-4 report is a measurement document, not a recipe. It hits human-level scores on professional and academic exams — bar exam ~top 10% — yet discloses no architecture, data, or compute.