Back to Blog
6 min read

OpenAI Previews GPT-5.6 Sol, Terra and Luna Model Family

OpenAI previewed GPT-5.6 on 26 June 2026 with Sol, Terra, and Luna across three price points, Sol scoring 88.8% on TerminalBench 2.1 with a new Sol Ultra sub-agent mode.

OpenAI Previews GPT-5.6 Sol, Terra and Luna Model Family

OpenAI Previews GPT-5.6 Sol, Terra and Luna on 26 June 2026

On 26 June 2026, OpenAI began a limited preview of GPT-5.6, its most capable model family to date, introducing three distinct models named Sol, Terra, and Luna. Sol is the flagship reasoning model for frontier agentic workloads, Terra is the balanced everyday model, and Luna is the fastest and least expensive option for high-volume routine tasks. Access at launch is restricted to approximately 20 partner organisations, with general availability expected in the coming weeks. The rollout follows OpenAI's standard practice of sharing models with the US government before public launch, a step described as ensuring robust safety review before wider deployment.

Three Models, Three Price Points

GPT-5.6 introduces a deliberately tiered pricing structure. Sol, the frontier model, is priced at five dollars per million input tokens and thirty dollars per million output tokens. Terra is available at two dollars and fifty cents per million input tokens and fifteen dollars per million output tokens, benchmarking comparably to GPT-5.5 while delivering results at roughly half the cost. Luna, the fastest and most affordable tier, is priced at one dollar per million input tokens and six dollars per million output tokens, suited to high-throughput workflows where cost efficiency matters more than peak capability.

By offering Sol, Terra, and Luna from a single release, OpenAI is positioning GPT-5.6 to compete simultaneously across the full enterprise AI market. Sol targets workflows currently served by frontier reasoning models from Anthropic and xAI. Terra targets the mid-tier everyday use cases where GPT-5.5 has been the standard. Luna competes with the fastest low-cost models from Google and Mistral on price and throughput.

Sol Ultra: Sub-Agent Mode and the Max Reasoning Tier

Sol introduces two new operating modes above standard usage. A new maximum reasoning effort setting instructs the model to apply more computation to each response at proportionally higher cost. Sol Ultra is a mode in which the model invokes sub-agents internally to tackle complex multi-step tasks, allocating separate reasoning chains to different parts of a problem before synthesising a final answer. This architecture is particularly relevant for long-horizon agentic workflows — automated code generation, research synthesis, and multi-document analysis — where the ability to parallelise reasoning steps within a single model call reduces total time to result.

OpenAI has also announced a partnership with Cerebras to run Sol at speeds of up to 750 tokens per second, which it describes as making frontier intelligence available at unprecedented speed. The Cerebras deployment is targeted for availability in July 2026. New prompt caching behaviour accompanies GPT-5.6, including support for explicit cache breakpoints and a minimum cache lifetime of thirty minutes, improving cost efficiency for applications that repeatedly send similar long system prompts.

TerminalBench 2.1 and Where GPT-5.6 Sits Competitively

The most widely cited benchmark for GPT-5.6 at launch is TerminalBench 2.1, which measures performance on long-horizon agentic and coding tasks. Sol scores 88.8 per cent on TerminalBench 2.1, compared to 88.0 per cent for Anthropic's Claude Mythos 5, 84.3 per cent for both Terra and Claude Fable 5, 83.4 per cent for GPT-5.5, 82.5 per cent for Luna, and 78.9 per cent for Claude Opus 4.8. Sol's 0.8 percentage point lead over Claude Mythos 5 is narrow, and these scores reflect the workloads OpenAI chose for evaluation rather than an independent audit. Competitive positioning at the frontier is measured in single-digit percentage points, and the benchmark landscape shifts as each company releases model updates.

Safety, Access, and the US Government Review

OpenAI described GPT-5.6 Sol as launching with its most robust safety stack to date, including targeted hardening against higher-risk activity, sensitive cybersecurity requests, and patterns indicative of repeated misuse. OpenAI spent multiple weeks testing the model against real-world attack patterns before the preview launch. The initial rollout to approximately 20 partner organisations, after sharing models and release plans with the US government, reflects an unusual degree of gated access for a commercial AI product and is consistent with the national security sensitivity surrounding frontier model launches in the first half of 2026.

What GPT-5.6 Means for Indian Teams Building on OpenAI

For Indian product teams and software companies already running workloads on the OpenAI API, GPT-5.6 introduces immediate model selection decisions. Terra's competitive performance at half the cost of GPT-5.5 makes it a natural candidate for teams looking to reduce API spend on everyday tasks. Sol is the right choice for teams building agentic systems where benchmark performance at the frontier translates directly to product quality, particularly in legal document analysis, complex code generation, and research automation workflows.

Luna's pricing at one dollar per million input tokens opens new product architectures for Indian teams building high-volume applications, including customer service automation, content moderation, and form-filling workflows where response latency and unit economics matter more than peak reasoning depth. The Cerebras-accelerated Sol at 750 tokens per second is also relevant for consumer-facing AI products where response latency is a key metric. The thirty-minute prompt cache lifetime is particularly useful for Indian enterprise teams building retrieval-augmented generation systems, since it reduces repeated token costs on the same base context across a session, a pattern common in multi-turn customer support and document review tools.

The Bottom Line

OpenAI previewed GPT-5.6 on 26 June 2026, introducing Sol, Terra, and Luna. Sol is the frontier reasoning model at five dollars per million input tokens and thirty dollars output, scoring 88.8 per cent on TerminalBench 2.1, narrowly ahead of Claude Mythos 5 at 88.0 per cent. Terra matches Claude Fable 5 at 84.3 per cent on TerminalBench 2.1 at roughly half the cost of GPT-5.5. Luna is the fastest and cheapest tier at one dollar input and six dollars output per million tokens. Sol Ultra adds an internal sub-agent reasoning mode for complex agentic workflows, and a Cerebras partnership will deliver Sol at 750 tokens per second from July. General availability is expected in the coming weeks. For Indian teams on the OpenAI API, Terra is the clearest upgrade from GPT-5.5 on cost, Sol is the choice for agentic quality benchmarks, and Luna opens architectures for high-volume cost-sensitive use cases.

Frequently Asked Questions

What are the three GPT-5.6 models and how are they priced?+

GPT-5.6 introduces three models: Sol, Terra, and Luna. Sol is the flagship reasoning model priced at five dollars per million input tokens and thirty dollars per million output tokens. Terra is the balanced everyday model at two dollars and fifty cents input and fifteen dollars output per million tokens, benchmarking comparably to GPT-5.5 at roughly half the cost. Luna is the fastest and most affordable tier at one dollar input and six dollars output per million tokens, suited to high-volume workflows where cost efficiency matters more than peak capability.

What is Sol Ultra in GPT-5.6 and what workloads is it designed for?+

Sol Ultra is an operating mode within GPT-5.6 Sol in which the model invokes internal sub-agents to tackle complex multi-step tasks. Rather than processing the entire problem in a single reasoning chain, Sol Ultra allocates separate reasoning chains to different components of the task before synthesising a final answer. This architecture is designed for long-horizon agentic workflows such as automated code generation across multiple files, research synthesis requiring information from multiple sources, and complex multi-document analysis. A new maximum reasoning effort setting is also available for tasks where applying more computation per response justifies the additional cost.

How does GPT-5.6 Sol compare to Claude Mythos 5 on TerminalBench 2.1?+

On TerminalBench 2.1, which measures performance on long-horizon agentic and coding tasks, GPT-5.6 Sol scores 88.8 per cent compared to 88.0 per cent for Anthropic's Claude Mythos 5 — a margin of 0.8 percentage points. Terra and Claude Fable 5 are tied at 84.3 per cent, GPT-5.5 scores 83.4 per cent, Luna scores 82.5 per cent, and Claude Opus 4.8 scores 78.9 per cent. These scores reflect workloads chosen by OpenAI rather than an independent third-party audit, and competitive positions at this performance tier shift as each company releases model updates.

When will GPT-5.6 be generally available and what should Indian teams do now?+

OpenAI has indicated general availability of GPT-5.6 Sol, Terra, and Luna in the coming weeks following the limited preview that began on 26 June 2026 with approximately 20 partner organisations. Indian teams currently using GPT-5.5 should evaluate whether Terra's comparable benchmark performance at half the cost justifies migrating everyday workloads once the model is generally available. Teams running agentic coding or research pipelines should assess Sol or Sol Ultra for quality-critical tasks. Teams with high-volume cost-sensitive applications should evaluate Luna, which at one dollar per million input tokens opens product architectures that were previously uneconomical at GPT-5.5 pricing.

TT

Written by

TechPillow Team

Sharing insights on technology, product development, and the Indian tech ecosystem.

Ready to Build Something Extraordinary?

From ideation to launch, we're your end-to-end technology partner.

Book a Free Strategy Call