AI Intel: OpenAI Goes Full Superapp, Pentagon Can't Quit Claude, NVIDIA's $1T Vera Rubin Bet

AI Intel OpenAI Anthropic NVIDIA
Published March 20, 2026 · 6 min read

OpenAI just confirmed it's merging ChatGPT, Codex, and its web browser into a single desktop "superapp." It's the clearest signal yet that the company sees its future not in chatbots, but in becoming the operating system for AI-powered work. Meanwhile, the Pentagon is discovering that dumping Claude is a lot harder than ordering it — and NVIDIA thinks the next two years of AI infrastructure are worth a cool trillion dollars.

OpenAI Merges Everything Into a Desktop Superapp

On Thursday, OpenAI confirmed a Wall Street Journal report: the company is folding its ChatGPT desktop app, Codex coding platform, and its web browser (codenamed Atlas) into a single unified application. Fidji Simo, OpenAI's CEO of Applications, is leading the effort alongside President Greg Brockman.

"Companies go through phases of exploration and phases of refocus; both are critical," Simo wrote on X. "But when new bets start to work, like we're seeing now with Codex, it's very important to double down on them and avoid distractions."

The timing is telling. Earlier this month, Simo held an all-hands meeting where she said OpenAI is "orienting aggressively" toward high-productivity use cases. Translation: the consumer chatbot era is over. OpenAI wants to be where the money is — developer tools and enterprise workflows.

This matters for developers because it signals where OpenAI's engineering resources are going. Codex isn't a side project anymore; it's getting promoted to the main stage. If you're building on OpenAI's APIs, expect the developer tooling to improve fast. If you're using competing coding agents like Claude Code, expect OpenAI to come after that market hard.

The superapp approach also means tighter integration between browsing, coding, and chat. Think: ask ChatGPT to research a library, have Codex implement it, and test it in the built-in browser — all without switching windows. That's the pitch, anyway. Whether it ships clean or turns into a bloated Electron app remains to be seen.

The Pentagon Wants to Dump Claude. Its Own Staff Won't Let It.

The Anthropic-Pentagon saga keeps getting messier. Here's where things stand: Defense Secretary Pete Hegseth labeled Anthropic a "supply chain risk" last month and ordered the department to stop using Claude. Anthropic sued. The Justice Department fired back this week, calling Anthropic "unacceptable" for national security.

But here's the part that matters: according to Reuters, Pentagon staff are "slow-rolling" the replacement. Developers inside the DoD have built entire workflows on Claude — data analysis pipelines, automated briefing generators, intelligence processing chains — and they don't want to throw them away. One Pentagon technologist told Reuters that shifting to new AI agents would mean losing months of work.

The kicker: the DoD's own filing admits that Claude is "currently the only AI model cleared for use" on classified systems. They're scrambling to deploy alternatives from Google, OpenAI, and xAI, but that takes time. You can't just swap one LLM for another when your workflows depend on specific model behaviors.

For developers, this is a case study in vendor lock-in at the highest possible stakes. The Pentagon built critical infrastructure on a single AI provider and is now stuck. The lesson: if you're building anything that matters, design for model portability from day one. Use OpenAI-compatible API formats so you can switch providers without rewriting your stack. Services like KissAPI exist specifically for this — one endpoint, multiple models, swap with a parameter change.

NVIDIA's Vera Rubin: A $1 Trillion Bet on AI Infrastructure

At GTC 2026 this week, Jensen Huang did what Jensen Huang does: he held up a chip and told the world it would change everything. This time it's Vera Rubin, NVIDIA's next-generation AI platform shipping in H2 2026.

But the real headline was the number: Huang expects combined purchase orders for Blackwell and Vera Rubin to hit $1 trillion through 2027. That's not revenue — that's demand. The AI infrastructure buildout is accelerating, not slowing down.

Vera Rubin isn't just a faster GPU. Tom's Hardware broke down the platform as seven chips working together — a complete rethinking of how AI compute is structured. NVIDIA Cloud Partners have now deployed over 1 million GPUs in AI factories globally, representing 1.7 gigawatts of AI capacity. For context, that's roughly the power output of a nuclear reactor, dedicated entirely to running AI models.

What this means for developers: compute costs should eventually come down as supply catches up with demand, but "eventually" might be 2028. In the meantime, the smart play is optimizing your token usage. Use the right model for the right task — Haiku for classification, Sonnet for daily coding, Opus only when you need it. Every token you don't send is money you don't spend.

MiniMax M2.7: The Coding Dark Horse Nobody Expected

MiniMax, the Chinese AI lab that's been quietly shipping competitive models, dropped M2.7 this week. The headline number: 56.22% on SWE-Pro (a multi-language programming benchmark), matching GPT-5.3-Codex. On GDPval-AA, it scored an ELO of 1495 — the highest among open-source models.

MiniMax is calling M2.7 "self-evolving," claiming it can perform 30-50% of reinforcement learning research workflows autonomously. That's a bold claim, and the benchmarks from independent testers on r/LocalLLaMA are mixed — Kilo Code's team tested it against Qwen 3.5-plus, GLM-5, and Kimi K2.5 with competitive but not dominant results.

The real story isn't whether M2.7 is the best model. It's that the gap between frontier and open-source keeps shrinking. A year ago, matching GPT-level coding performance from a Chinese lab would have been front-page news. Now it's Tuesday. The commoditization of AI capability is happening faster than anyone predicted, and that's good news for anyone paying API bills.

Quick Hits

Claude Code prompt engineering is having a moment on Reddit. The top tip making the rounds: use CLAUDE.md files aggressively to set project context, and structure your prompts as "here's what exists, here's what I want changed" rather than open-ended requests. Specific beats vague, every time.
OpenAI's pricing overhaul continues to leak. A $100/month "Pro Lite" tier surfaced this week, sitting between the $20 Plus and $200 Pro plans. OpenAI previously admitted its subscription model was "accidental" — expect more restructuring as they chase enterprise revenue.
Qwen 3.5 fine-tuning is getting serious traction in the open-source community. The 397B parameter model is becoming the go-to base for custom deployments, especially for teams that need multilingual support without paying frontier prices.

Access Every Model Through One API

Claude, GPT-5, Qwen, MiniMax, and 200+ models. One endpoint, one API key, pay-as-you-go. No subscriptions, no regional blocks.

Try KissAPI Free →