EPISODE · WEDNESDAY, MARCH 18, 2026

GPT-5.4 Mini Costs Less Than a Stamp, Google Now Reads Your Gmail, and Why Faster Code Is Making Teams Slower

OpenAI releases GPT-5.4 mini ($0.75/M tokens) and nano ($0.20/M tokens) — production models designed for high-volume workflows. Google expands Personal Intelligence across Search, Gemini, and Chrome, pulling from Gmail and Photos to personalise results. AI overviews cut click-through rates by 58%, but AI-referred visitors convert at higher rates. Stripe's coding agents push 1,300 PRs/week using blueprint hybrid workflows. And the Atrophy Paradox: heavy AI use reduces cognitive engagement and makes output converge.

Ready to Play

0:00 / 7:10

In This Episode

  • • GPT-5.4 nano: $0.20 per million input tokens — 10 novels of processing for 20 cents
  • • GPT-5.4 mini: $0.75 input / $4.50 output per million tokens — production-grade speed
  • • Google Personal Intelligence pulls from Gmail and Photos to personalise Search results
  • • AI overviews cut click-through rates by 58% — but AI-referred visitors convert at higher rates
  • • AEO strategy question: which content to feed AI crawlers, which to block?
  • • Stripe's coding agents push 1,300 PRs/week using blueprint hybrid workflows
  • • Blueprint pattern: deterministic nodes + agent loops = production-ready AI workflows
  • • Theory of Constraints: AI speeds up code writing but non-bottleneck stages create inventory
  • • Atrophy Paradox: heavy AI use reduces cognitive engagement and makes output converge
  • • Ahrefs builds world's 34th fastest supercomputer for AI-native SEO infrastructure
  • • LinkedIn dual algorithm: retrieval decides visibility, ranking orders by predicted interest

Show Notes

  • Introduction — GPT-5.4, Google Personal Intelligence, AEO
  • GPT-5.4 Mini and Nano — Production Models at Stamp Prices
  • Google Personal Intelligence — Search Becomes a Memory Layer
  • AEO and the 58% CTR Drop — Reframe Traffic as Conversion
  • Stripe's 1,300 PRs/Week — Blueprint Hybrid Workflow Pattern
  • AI Code Is Making Teams Slower — Theory of Constraints
  • The Atrophy Paradox — Routine vs Judgment Tasks
  • Quick Hits: Ahrefs Supercomputer, LinkedIn Dual Algorithm
  • GEO Tactic: AEO Content Audit — Top 20 Pages

Transcript

[0:00] Introduction

Welcome to the AI Daily Digest for Wednesday, March 18th, 2026.

Today we have OpenAI dropping two new models at prices that make the old pricing look like a joke. Google quietly turning Search into a memory layer that reads your Gmail. A data point that should reset your entire content strategy — AI overviews are cutting click-through rates by 58%. Stripe's coding agents pushing 1,300 pull requests a week. And a concept called the Atrophy Paradox that explains why heavy AI use might actually be making your team worse.

Let's go.

[0:40] GPT-5.4 Mini and Nano

OpenAI has released GPT-5.4 mini and GPT-5.4 nano. These are not research previews. They are production models designed for workflows where response time shapes the product.

The pricing: mini at $0.75 per million input tokens and $4.50 per million output tokens. Nano at $0.20 input and $1.25 output.

To put that in context — one million tokens is roughly 750,000 words, or about 10 full-length novels. You can process that for 20 cents with nano.

The strategic read: OpenAI is not just competing with Anthropic and Google anymore. It's competing with the cost of doing nothing. At nano pricing, the argument for not using AI in a workflow becomes very hard to make.

The catch: these are capability-limited models. Mini and nano are not GPT-5.4 full. They are optimised for speed and cost, not for the most complex reasoning tasks. But for the majority of production use cases, that trade-off is exactly right.

[1:50] Google Personal Intelligence

Google is expanding Personal Intelligence across Search, the Gemini app, and Chrome in the US. The system pulls from your Gmail, Google Photos, and browsing history to shape answers around your personal context.

This is not a feature update. This is a structural shift in what Search is.

Search is becoming a memory layer. Instead of returning the same results for the same query to every user, Google is now returning results shaped by who you are, what you've bought, where you've been, and what you've written in your email.

The GEO implication: traditional keyword optimisation assumes a universal query. If the same query now returns different results for different users based on personal context, the concept of ranking for a keyword becomes less meaningful.

[2:50] AEO and the 58% CTR Drop

This is the number that should reset your content strategy. AI overviews are slashing click-through rates by 58%.

Here's the counterintuitive finding: people who find companies via AI search convert at higher rates. The traffic volume is lower, but the intent is higher. AI search is filtering out the casual browsers and sending you the serious buyers.

The strategic reframe: stop measuring success by traffic volume. Start measuring by conversion rate and revenue per visitor. If AI search sends you half the traffic but those visitors convert at twice the rate, your revenue is the same — and your content costs are lower.

The AEO question every content team needs to answer: which content should be fed to AI crawlers, and which should be blocked? There is no universal answer.

[3:50] Stripe's 1,300 PRs Per Week

Stripe's coding agents are pushing 1,300 pull requests per week. They built something called Toolshed — an MCP server with 500 tools — and they're using hybrid workflows that mix deterministic nodes with agent loops.

The framework they call blueprints is worth understanding. A blueprint is not a pure agent loop where the AI decides everything. It's a structured workflow where some steps are hardcoded and deterministic, and other steps are handed to an agent.

This is the production-ready AI workflow pattern. Pure agent loops are too unpredictable for financial infrastructure. Pure deterministic workflows are too rigid for complex tasks. Blueprints give you both.

[4:40] AI Code Is Making Teams Slower

Here's the uncomfortable counterpoint to the Stripe story. A detailed analysis this week argues that increasing code writing speed with AI is actively worsening software delivery.

The argument is based on the Theory of Constraints. In any system, there is one bottleneck that limits throughput. If you speed up a non-bottleneck stage, you don't improve overall throughput — you just create inventory that piles up at the real bottleneck.

In most software teams, the bottleneck is not writing code. It's unclear requirements, wait times for review and approval, and fear of releasing. AI does not fix any of those.

The lesson: before you add AI to your development workflow, identify your actual bottleneck. If it's writing code, AI helps. If it's everything else — and it usually is — AI just moves the pile.

[5:30] The Atrophy Paradox

The Atrophy Paradox is a concept worth sitting with. Heavy AI use reduces cognitive engagement. When you outsource thinking to AI, you practice thinking less. Over time, your independent judgment atrophies.

The second effect: output converges. When everyone uses the same AI tools with similar prompts, the outputs start to look the same.

The prescription: distinguish between routine tasks and judgment tasks. Routine tasks — formatting, summarising, drafting boilerplate — automate those. Judgment tasks — strategic decisions, creative direction, relationship management — practice those.

This is the skill that will separate the people who thrive in the AI era from the people who become dependent on it.

[6:10] Quick Hits

Ahrefs has built the world's 34th fastest supercomputer. They're using it for keyword intent decoding, SEO content at scale, brand tracking in chatbots, automating technical SEO, detecting AI content, and fast localisation.

LinkedIn has two algorithms, not one. Retrieval decides if your post is shown at all. Ranking then orders it based on predicted interest. This explains why some posts go viral days after publication.

[6:50] GEO Tactic: AEO Content Audit

This week's tactic: run an AEO content audit. Go through your top 20 pages. For each one, ask three questions. First: is this content structured so an AI can extract a direct answer? Second: does this content answer a specific use-case question, or is it optimised for a keyword? Third: would blocking this from AI crawlers protect a competitive advantage, or would it just make you invisible?

That's the AI Daily Digest for Wednesday, March 18th, 2026. GPT-5.4 nano at 20 cents per million tokens is the price point that changes the calculation for every workflow. The 58% CTR drop is the number that changes the calculation for every content team. And the Atrophy Paradox is the concept that should change how you think about which tasks to automate and which to protect.

Subscribe on Apple Podcasts, Spotify, or YouTube. See you tomorrow. Stay citation-worthy.

Related Episodes

More episodes you might enjoy based on similar topics and categories.

AI Models & Research

Wednesday, March 25, 2026

Siri Becomes an Agent, OpenAI Kills Sora, and Why 97% of AI Content Fails After 3 Months

Apple is testing a standalone Siri app for iOS 27 and macOS 27 that transforms Siri from a voice assistant into a system-wide AI agent — unveiling expected at WWDC June 8. OpenAI has shut down Sora just months after launch; the real reason is compute economics: $13B revenue vs $100B projected spend. A 16-month experiment shows 97% of AI-generated pages fell out of the top 100 after 3 months without authority signals. Plus: Claude Code auto mode, MiniMax M2.7 engineering agent, the SaaS revenue durability crisis, and the end of free trials.

OpenAIGoogleAgentsAnthropic

9:10

AI Models & Research

Tuesday, March 24, 2026

Claude Takes Control of Your Mac, OpenAI's Capital Crunch, and Why Agent Interviews Beat Single Prompts

Anthropic's Claude Computer Use lands on macOS for Pro/Max users — full keyboard, cursor, browser, and file control with permission safeguards. OpenAI is offering private equity 17.5% guaranteed returns, a signal that frontier AI economics are harder than the public narrative suggests. The agent interview technique produces dramatically better AI output than single prompts. Plus: 53% of agency owners see AI as a significant threat, Google Personal Intelligence goes free for all US users, and ChatGPT citation data confirms cluster-based content wins.

OpenAIAgentsProductivityAnthropic

8:13

AI Models & Research

Saturday, March 21, 2026

Grok 4.20 Dethrones Perplexity, Google's Citation Pool Expands Beyond Page One, and WordPress Gets AI Agents

Grok 4.20 achieves a 22% hallucination rate on the AA-Omniscience benchmark — the lowest ever recorded — dethroning Perplexity as the top AI search recommendation. Google's AI Overviews now pull 62% of citations from beyond page-one results, driven by fan-out queries that expand citation eligibility to deep sub-topic content. WordPress adds 19 write operations to its MCP integration, giving AI agents control over 43% of the internet. Andrej Karpathy declares 'Code is dead, agents are everything.' Plus: Palantir's Maven AI becomes a Pentagon program of record and the White House releases a federal AI framework.

GoogleSEOAgentsLLMsResearch

7:00

AI Models & Research

Friday, March 13, 2026

GPT-5.4 Cites Brand Sites 56% of the Time, Google Maps Gets AI, and the Responses API Changes Everything

GPT-5.4 now cites brand sites 56% of the time — up 7x from GPT-5.3. Pricing pages cited 34x more. Google launches Ask Maps with Gemini. OpenAI Responses API gives agents full computer access. Meta's Avocado model delayed to May. Anthropic study: AI impacts knowledge workers most.

OpenAIGoogleSEOGEOMeta

5:48