Overview
OpenAI's ecosystem is the most commercially mature of the three major AI companies. Unlike Anthropic (focused, developer-first) or Google (sprawling, integration-heavy), OpenAI has evolved into a full platform company - models, consumer products, developer APIs, coding agents, media generation, and an app store. They move fast and retire things fast. GPT-4o, GPT-4.1, GPT-4.1 mini, o4-mini, GPT-5 Instant, and GPT-5 Thinking were all retired from ChatGPT on February 13, 2026 (yesterday). The current default is GPT-5.2.
The ecosystem breaks down into roughly these areas:
- Models - GPT-5.2 family (Instant, Thinking, Pro), GPT-5.3-Codex, GPT-OSS open weights
- ChatGPT Products - Chat, Deep Research, Operator, Custom GPTs, GPT Store, Sora, Advanced Voice
- Coding Tools - Codex (cloud, CLI, IDE extension, app), AGENTS.md
- Developer Platform - Responses API, Agents SDK, Realtime API, Chat Completions API, built-in tools (web search, file search, computer use)
- Media Generation - GPT Image 1.5 (images), Sora 2 (video), Advanced Voice / gpt-realtime (audio)
- Open Weight Models - GPT-OSS-120b and GPT-OSS-20b (Apache 2.0)
- Pricing & Plans - Free, Go ($8/mo), Plus ($20/mo), Pro ($200/mo), Business ($25/user/mo), Enterprise (custom), Edu
The Models
GPT-5.2 - Current flagship. Released late 2025 / early 2026. Comes in three modes:
- Instant - Fast everyday model. Good for general chat, writing, info-seeking. Available to all users including free tier.
- Thinking - Reasoning mode for harder problems. Configurable thinking time (Light, Standard, Extended). Best for code, math, analysis, complex multi-step work. Available on Plus and above.
- Pro - Maximum compute. Extended reasoning for the hardest problems. Available only on Pro ($200/mo) plan.
GPT-5.2 is described as OpenAI's best model for professional knowledge work. Outperforms industry professionals on 70.9% of GDPval knowledge work tasks. Improvements in spreadsheets, presentations, code, image perception, long context, tool use, and multi-step projects. As of Feb 13, 2026, GPT-5.2 is the default model in ChatGPT - all older models retired.
GPT-5.3-Codex - Released February 5, 2026. Most capable coding model. First to combine Codex + GPT-5 training stacks. ~25% faster than GPT-5.2-Codex. State of the art on SWE-Bench Pro (multi-language) and Terminal-Bench 2.0. Available everywhere you use Codex (app, CLI, IDE extension, web). Moving "beyond writing code to using it as a tool to operate a computer and complete work end to end."
GPT-5.3-Codex-Spark - Released February 12, 2026 (research preview). Smaller version of GPT-5.3-Codex optimized for real-time coding. First model from OpenAI's Cerebras partnership - runs on Cerebras Wafer-Scale Engine at 1,000+ tokens/second. Available in Codex app, CLI, and VS Code extension for Pro users only. Not available via API at launch - API workflows should continue using gpt-5.2-codex.
GPT-5 - Released August 7, 2025. The "unified system" model that merged the GPT and o-series approaches. Smart router that decides when to respond quickly vs think longer. 400k token context window. $1.25/$10 per million tokens (input/output). Still available in API.
GPT-OSS - OpenAI's first open-weight models since GPT-2 (2019). Released August 5, 2025.
- GPT-OSS-120b - Large model. Runs in 80GB memory (MXFP4 quantized). Comparable to frontier models on safety benchmarks.
- GPT-OSS-20b - Small model. Runs in 16GB memory. $0.03/million input tokens (cheapest OpenAI model).
- Apache 2.0 license (maximally permissive). Available on Hugging Face, Ollama, LM Studio. Day-one support from Nvidia, AMD, Azure, AWS.
- Positions OpenAI in the open model race alongside DeepSeek, Qwen, Llama.
Legacy models still in API (retired from ChatGPT): GPT-4o, GPT-4.1, GPT-4.1 mini, o4-mini. API access remains for now but expect eventual sunset.
ChatGPT Products
ChatGPT - Consumer chat interface. Web, iOS, Android, macOS, Windows. The most widely used AI product globally. Now defaults to GPT-5.2 across all tiers.
Deep Research - Asynchronous research agent. Powered by a version of o3 optimized for web browsing and data analysis. Searches, reads, analyzes hundreds of sources, produces comprehensive cited reports. As of Feb 2026: connects to any MCP or app, can restrict web searches to trusted sites, real-time progress tracking, follow-up prompts mid-research. Pro users get 250 queries/month, Plus/Team/Enterprise get 25, Free gets 5.
Operator - Web browsing agent that can take real-world actions (bookings, form filling, purchases). Uses Computer Use tool under the hood.
Custom GPTs - No-code custom versions of ChatGPT with specific instructions, knowledge files, and capabilities. GPT Store for discovery. 3+ million GPTs created. Categories include productivity, education, programming, lifestyle. As of Jan 2026, most live models available for GPTs with custom actions. Apps integration for Business/Enterprise.
GPT Store - Marketplace for custom GPTs. Search, leaderboard, categories. Available to all paid tiers. Revenue sharing being tested with select builders.
Advanced Voice - Real-time voice conversations. Natural, picks up on non-verbal cues (speed, emotion). Available on Plus and above.
Sora 2 - Text-to-video generation with synchronized audio. Launched September 30, 2025. Social iOS/Android app. Characters feature (upload yourself into scenes). Storyboards for frame-by-frame control. 10-25 second videos at 1080p. Disney partnership (reportedly valued at $1B) for 200+ licensed character generation. Pro users get storyboards and longer durations. Video API available (v1/videos endpoint). Primary rival is Google's Veo 3 - Sora 2 wins on character consistency, Veo 3 wins on duration (up to 60s vs Sora's 25s).
ChatGPT Images - Powered by GPT Image 1.5. Inline image generation and editing in chat. Iterative edits, transparent backgrounds, precise style control.
Canvas - Side-by-side writing and code editing surface within ChatGPT.
Projects - Organize conversations, files, and instructions into persistent workspaces.
Tasks - Schedule ChatGPT to do things at specific times (reminders, research, recurring reports).
Memory - ChatGPT remembers details across conversations. Longer memory on paid plans.
Coding Tools
OpenAI has consolidated around "Codex" as the brand for all coding tools. The distinction matters:
Codex Web (Cloud) - The original. Cloud-based agent accessible through ChatGPT sidebar. Each task runs in isolated container with your repo. 1-30 minutes per task. No internet access during execution (security). Can read/edit files, run tests, linters, type checkers. Commits changes, creates PRs. Powered by GPT-5.3-Codex.
Codex CLI - Open-source terminal agent. Runs locally on your machine. Interactive coding, research, scripting. Supports MCP for custom integrations. Can be used as an MCP server itself (for Agents SDK orchestration). AGENTS.md support for repo-specific instructions. Approval modes for human-in-the-loop. Install via npm i -g @openai/codex or brew install --cask codex. Sign in with ChatGPT account.
Codex IDE Extension - VS Code, Cursor, Windsurf, JetBrains (Rider, IntelliJ, PyCharm, WebStorm). Brings Codex into your editor.
Codex App - macOS desktop app released February 2, 2026. Windows alpha testing started. "Command center for agentic coding." Uses virtual git worktrees so agents code on separate branches without locking your file system or touching uncommitted changes - this is the technical magic that enables parallel execution. Built-in cloud environments. Skills system for going beyond code (prototyping, documentation, code understanding). Automations for unprompted work (issue triage, alert monitoring, CI/CD).
Key Codex concepts:
- AGENTS.md - Repo-level instruction file (like .cursorrules but for Codex). Tells the agent about your project structure, conventions, and constraints.
- Skills - Pre-defined capabilities (similar to Claude Code skills / Gemini CLI extensions).
- Automations - Codex works unprompted on triggers (e.g., new issue filed, CI failure).
- Aardvark - OpenAI's security research agent. First offering in the Codex Security product suite. Free codebase scanning for open-source projects including Next.js. $10M in API credits committed for cyber defense.
Honest take: Codex is arguably the most mature coding agent ecosystem. CLI + cloud + IDE + desktop app covers every workflow. But it's also the most confusing - four surfaces, and state varies by surface. The CLI is your terminal researcher (local files, logs, scripting). The App is your project manager (async background tasks, parallel agents on worktrees). The IDE extension is your pair programmer (quick inline edits). Don't use the IDE extension for big refactors - use the App. Don't use the App for "quick fix this function" - use the IDE.
The Developer Platform
Responses API - The primary API for building agents. Replaces/supersedes Chat Completions API. Combines simplicity of completions with tool-use capabilities. Single call can use multiple built-in tools:
- Web search - Real-time cited answers from the web
- File search - RAG over document repositories with metadata filtering
- Computer use - Agent controls browser, fills forms, navigates software
- Conversations API for durable threads and replayable state
- MCP connectors for external context and actions
Chat Completions API - Still supported, still receiving updates. But Responses API is the future. Assistants API being deprecated mid-2026.
Agents SDK - Open-source Python/TypeScript framework for multi-agent workflows. Production-ready upgrade of Swarm. Provider-agnostic (works with 100+ LLMs, not just OpenAI). Key primitives:
- Agents (LLMs with instructions + tools)
- Handoffs (transfer control between agents)
- Guardrails (input/output validation)
- Sessions (automatic conversation history)
- Tracing (visualization, debugging, monitoring)
- Realtime agents for voice
- GitHub: openai/openai-agents-python
Realtime API - Low-latency speech-to-speech for voice agents. gpt-realtime model.
Video API - v1/videos endpoint for Sora 2 programmatic access.
Evals & Fine-tuning - Built-in evaluation tools, graders, reinforcement fine-tuning. Measure -> improve -> ship loop. Inference discounts if you enable data sharing during fine-tuning.
Batch API - Lower prices with higher latency for non-time-sensitive bulk processing.
Flex Processing - Lower prices with higher latency tier.
Priority Processing - Faster processing service tier for time-sensitive requests.
Media Generation
GPT Image 1.5 - Image generation and editing via API. Higher fidelity than earlier models. Available as a built-in tool in the Responses API.
Sora 2 / Sora 2 Pro - Video generation with synchronized audio (dialogue, sound effects, music). Launched September 30, 2025. Text-to-video, image-to-video, video remix. Up to 25 seconds at 1080p. Character cameos. Available via API (v1/videos), Sora app, and Azure. Disney character partnership (reportedly $1B) for licensed content.
Advanced Voice / gpt-realtime - Speech-to-speech. Real-time voice conversations with emotion, interruption detection, context management. API available for building voice agents.
DALL.E - Still accessible via dedicated GPT. Mostly superseded by GPT Image 1.5 for new work.
Open Weight Models
GPT-OSS (Released August 2025)
- GPT-OSS-120b - Frontier-comparable. 80GB memory (MXFP4). Strong on safety benchmarks.
- GPT-OSS-20b - Efficient. 16GB memory. Cheapest OpenAI model ($0.03/M input tokens).
- Apache 2.0 license. Hugging Face, Ollama, LM Studio. Reference implementations for PyTorch and Apple Metal.
- Harmony response format (structured prompt interface with separate reasoning and output channels).
- $500k Red Teaming Challenge for community safety testing.
- Strategically positioned against Chinese open models (DeepSeek, Qwen) and Meta's Llama.
GPT-OSS-Safeguard (Released October 29, 2025)
- gpt-oss-safeguard-120b and gpt-oss-safeguard-20b - Fine-tuned from GPT-OSS for safety classification.
- "Bring your own policy" design - provide your written safety policy at inference time, model classifies content against it with full chain-of-thought reasoning.
- Use cases: content moderation, Trust & Safety pipelines, LLM input/output filtering, policy enforcement.
- Developed with Discord, SafetyKit, and ROOST (Robust Open Online Safety Tools).
- Apache 2.0 license. Available on Hugging Face, Ollama, LM Studio.
- Why it matters for builders: Run your own safety guardrails on-prem without sending data to an external moderation API. Configurable reasoning effort (low/medium/high).
Documentation & Learning
Official docs:
- OpenAI API Documentation - Full API reference
- OpenAI Developers Portal - Guides, cookbooks, code samples
- Codex Documentation - CLI, app, IDE extension, API guides
- Agents SDK Docs - Multi-agent framework
- OpenAI Cookbook - Jupyter notebooks and examples
- Model Release Notes - Ongoing changelog
- ChatGPT Release Notes - Product changelog
Courses & learning:
- OpenAI Academy - Official learning platform
- Developer cookbooks on API usage, prompting, fine-tuning, agents
- Build hours and video guides on Agents SDK, Codex, tool calling
- DeepLearning.AI courses - Andrew Ng's platform has several OpenAI-focused courses
Official GitHub Repos
OpenAI has 230+ repositories. Key ones for builders:
- openai/codex - Codex CLI. Lightweight terminal coding agent.
- openai/openai-agents-python - Agents SDK (Python). Multi-agent workflows.
- openai/openai-agents-js - Agents SDK (TypeScript/JavaScript).
- openai/openai-python - Official Python SDK.
- openai/openai-node - Official Node.js SDK.
- openai/openai-cookbook - Example code and guides.
- openai/tiktoken - Tokenizer library (BPE).
- openai/whisper - Speech recognition model.
- openai/CLIP - Vision-language model.
- openai/evals - Evaluation framework.
- openai/preparedness - Safety benchmarks and evals.
Custom GPTs & Extensions Ecosystem
Custom GPTs - No-code customization of ChatGPT. Instructions, knowledge files, capabilities (web search, image gen, code interpreter, data analysis). Can connect to third-party APIs via custom actions. 3+ million created.
GPT Store - Discovery marketplace. Categories: productivity, education, programming, lifestyle. Community leaderboard. Featured GPTs from partners (AllTrails, Consensus, Khan Academy, Canva, Zapier). Revenue sharing in testing with select builders.
Apps - Newer integration method for Business/Enterprise. Connect approved external apps inside GPTs. Replaces custom actions for enterprise use. Apps-enabled GPTs limited to company GPT store (not public).
MCP Support - Codex supports MCP servers and can run as an MCP server. Deep Research connects to MCP/apps as of Feb 2026. Growing compatibility with the broader MCP ecosystem.
Honest take: The GPT Store hasn't become the "App Store for AI" that OpenAI pitched in 2023. Discovery is limited, monetization is unclear, and most useful GPTs are simple instruction wrappers. The real power is in enterprise custom GPTs with API actions and the newer Apps integration. For builders, the Agents SDK + Responses API is where the actual platform extensibility lives.
Community
- OpenAI Developer Community Forum - Official forum. API discussions, bug reports, feature requests. Active.
- r/ChatGPT - Largest ChatGPT subreddit. Millions of members. Consumer-focused.
- r/OpenAI - General OpenAI discussion. Large community.
- OpenAI Discord - Unofficial but large community Discord.
- OpenAI Blog - Official announcements, research, product launches.
- OpenAI Developers Blog - Developer-focused updates.
- @OpenAI on X/Twitter - Official account. Major announcements.
Note: OpenAI does not have an official Discord server. The community is more fragmented than Anthropic's (which has a strong official Discord) or Google's (official Gemini Discord). The Developer Community Forum is the closest thing to an official gathering place.
Pricing
ChatGPT Consumer Plans
- Free - GPT-5.2 Instant (limited). Basic image gen, web browsing, data analysis. Ad-supported (testing).
- Go - $8/mo. 10x more messages/uploads/image gen vs free. Longer memory. GPT-5.2 Instant unlimited (abuse guardrails). May have ads.
- Plus - $20/mo. GPT-5.2 Thinking. Access to Codex. Sora. Deep Research (25/mo). Legacy model access. No ads.
- Pro - $200/mo. GPT-5.2 Pro (maximum compute). 250 Deep Research/mo. Max memory/context. Early feature previews. No ads.
Business Plans
- Business - $25/user/mo. Workspace features, admin console, SSO, data privacy (no training on your data). GPT-5.2 across all modes.
- Enterprise - Custom pricing. Unlimited access, SOC 2 compliant, HIPAA eligible (with BAA), extended context, dedicated support, custom legal terms, data residency options.
- Edu - Affordable plan for universities. Enhanced security/privacy.
- Nonprofits - Up to 75% discount on Business or Enterprise.
API Pricing (per million tokens, approximate)
- GPT-5.2 - $2.50 input / $10.00 output (estimated from GPT-5 pricing)
- GPT-5.2 Pro - Premium pricing (~$120/M output for deepest reasoning)
- GPT-4o - $2.50 input / $10.00 output
- GPT-OSS-20b - $0.03 input / $0.12 output (cheapest)
- GPT-OSS-120b - $0.30 input / $1.20 output
- Batch API and Flex Processing tiers for lower prices
- Priority Processing for faster responses
- Fine-tuning available with inference discounts for data sharing
Article Framing Notes
Angle for builders: OpenAI is the most complete platform but also the most complex. More products, more models, more surfaces than anyone else. The key value for builders is that OpenAI has the most polished consumer product (ChatGPT), the most mature coding agent (Codex across four surfaces), and the deepest API platform (Responses API + Agents SDK + built-in tools). The tradeoff is complexity - keeping track of which model is current, which API to use, which Codex surface to pick.
Key distinctions to make clear:
- Codex CLI (terminal, local) vs Codex Web (cloud, async) vs Codex App (desktop, parallel agents) vs Codex IDE extension (in-editor)
- Responses API (new, agent-native) vs Chat Completions API (legacy, still supported) vs Assistants API (being deprecated mid-2026)
- GPT-5.2 Instant (fast) vs Thinking (reasoning) vs Pro (max compute)
- Custom GPTs (no-code) vs Agents SDK (code) vs Responses API (API)
Honest takes to include:
- GPT Store hasn't lived up to the "App Store for AI" hype
- The model retirement pace is aggressive - things you build today may need migration in months
- Codex is powerful but confusing with four different surfaces
- Pricing is getting more complex with Go/Plus/Pro/Business/Enterprise + API tiers + credits system
- Open weight models (GPT-OSS) are a genuine strategic shift but still behind Chinese open models in community adoption
- The gpt-oss-safeguard models are genuinely useful for enterprise builders who need on-prem safety classification without external API calls
- Sora 2 is impressive but the Disney deal and social app angle feel more consumer than builder
- The Agents SDK being provider-agnostic (100+ LLMs) is genuinely useful - you're not locked in
Voice/structure should match Anthropic and Google articles: Builder-focused, skip consumer fluff, honest about limitations, inline links throughout.
Sources
- OpenAI official blog and product pages
- OpenAI API documentation and pricing pages
- OpenAI Help Center (model release notes, ChatGPT release notes)
- GitHub repositories (openai org)
- OpenAI Developers Portal and blog
- Third-party analysis (IntuitionLabs, TechCrunch, VentureBeat, IEEE Spectrum, MIT Technology Review)
- Wikipedia (Sora, OpenAI Codex)