Marketing Homepage

Marketing Homepage

AI Models & Research

THURSDAY, MARCH 5, 2026

Gemini 3.1 Flash Lite Beats GPT-5 Mini, Claude Code Goes Voice, and Bing Makes GEO Official

Google launches Gemini 3.1 Flash Lite — 2.5x faster than previous version, outperforming GPT-5 mini and Claude 4.5 Haiku at $0.25/1M tokens. Claude hits $2.5B run-rate and overtakes ChatGPT in App Store. Apple's MacBook Neo launches at $599 with A18 Pro chip. Bing officially adds GEO to webmaster guidelines with opt-out mechanism. DeepSeek V4 drops with 1 trillion parameters and 1M context window.

Ready to Play

0:00 / 8:10

In This Episode

  • • Gemini 3.1 Flash Lite is 2.5x faster than Gemini 2.5 Flash at just $0.25/1M input tokens
  • • Gemini 3.1 Flash Lite outperforms GPT-5 mini and Claude 4.5 Haiku on 6 of 11 benchmarks
  • • Claude hits $2.5B annual run-rate with weekly users doubled since January
  • • Claude overtakes ChatGPT in App Store rankings
  • • Claude Code Voice Mode rolling out to 5% of users — speak coding commands directly
  • • MacBook Neo launches at $599 with A18 Pro chip — 50% faster, 16-hour battery
  • • Bing officially adds GEO to webmaster guidelines — first major search engine to do so
  • • DeepSeek V4: 1 trillion parameters, 1M context window, sparse MoE architecture

Transcript

[00:00] Introduction

Welcome to the Daily AI Digest for Thursday, March 5th, 2026.

Today: Gemini 3.1 Flash Lite outperforms GPT-5 mini at a fraction of the price. Claude hits $2.5B run-rate and overtakes ChatGPT in the App Store. Apple's MacBook Neo launches at $599. Bing officially adds GEO to its webmaster guidelines. And DeepSeek drops a one-trillion parameter model.

[00:45] Gemini 3.1 Flash Lite

Google launched Gemini 3.1 Flash Lite — 2.5x faster than Gemini 2.5 Flash, priced at $0.25/1M input tokens and $1.50/1M output tokens.

It outperforms GPT-5 mini and Claude 4.5 Haiku on 6 of 11 standard benchmarks — significant cost reduction without sacrificing capability.

Google also launched a Workspace CLI for Drive, Gmail, Calendar, Sheets, Docs, Chat, and Admin — built with AI agents in mind.

[01:45] Claude $2.5B Run-Rate & Voice Mode

Anthropic's Claude is running at a $2.5 billion annual revenue run-rate. Weekly users doubled since January. Claude has overtaken ChatGPT in App Store rankings.

Claude Code Voice Mode is now rolling out to 5% of users — speak commands like 'refactor the authentication middleware' and Claude executes them directly.

OpenAI updated ChatGPT to GPT-5.3 Instant — fewer hallucinations, fewer refusals, better web search. Full GPT-5.3 'Garlic' API launch expected soon.

[03:00] MacBook Neo at $599

Apple launched the MacBook Neo at $599 — the first Mac with the A18 Pro chip (same as iPhone 16 Pro). 50% faster than Intel Ultra 5, 16-hour battery life.

At $599, it undercuts the MacBook Air by $400 and brings Apple Silicon's Neural Engine to a much broader consumer segment.

Google Play Store fees also dropped to 20% (15% for new installs), with third-party app stores now able to apply for streamlined Android installation.

[03:45] Bing Makes GEO Official

Bing added Generative Engine Optimization to its official webmaster guidelines — the first major search engine to formally acknowledge GEO as a distinct practice.

Site owners can now opt out of AI training through Bing's webmaster tools.

GEO citation lift data: adding statistics lifts AI citations by up to 33.9%, quoting named experts adds up to 32%, citing sources adds up to 30.3%.

Context: AI Overviews appear on 88–99% of informational searches. CTR drops up to 61% when AI Overviews appear. Only 374 clicks per 1,000 US searches reach the open web. 85% of AI Overview citations come from sources with 3+ E-E-A-T signals.

[05:30] DeepSeek V4

DeepSeek V4: 1 trillion parameters, sparse Mixture-of-Experts architecture, 1 million token context window.

The sparse MoE design keeps inference costs manageable despite the enormous parameter count — decoupling parameter count from inference cost.

[06:00] New Tools

Base44: describe any app in plain English, it builds UI, database, auth, and hosting automatically.

Wispr Flow: voice dictation with 89% of messages sent with zero edits, 4x faster than typing.

Bolt.new: added built-in AI image generation with transparent backgrounds directly in the coding environment.

nexos.ai: no-code AI agents connecting to Google Drive, Gmail, and Slack with deep research capabilities.

OpenAI Codex is now available as a native Windows app via the Microsoft Store.

[06:45] What This Means For You

Test Gemini 3.1 Flash Lite against your current model — at $0.25/1M input tokens, cost savings on high-volume workloads could be substantial.

Audit your most important content pages for E-E-A-T signals: statistics, named expert quotes, and cited sources. 85% of AI citations go to high E-E-A-T content.

If you use Claude Code, watch for the voice mode rollout — currently at 5% of users and expanding.

[07:30] Closing

That's your Daily AI Digest for Thursday, March 5th, 2026. Gemini 3.1 Flash Lite reshapes the cost curve for AI APIs. Claude overtakes ChatGPT in the App Store. Bing formalizes GEO. And DeepSeek pushes the parameter frontier to one trillion.

Stay informed, stay ahead — and we'll see you tomorrow.

Free ResourceUpdated from 5 episodes

GEO Tactics That Actually Work

Compiled citation lift data from every episode — statistics (+33.9%), expert quotes (+32%), cited sources (+30.3%), and a 20-item implementation checklist. Everything you need to get cited by AI engines.

+33.9%from statistics
+32%from expert quotes
20 tacticsin the checklist

Never Miss an Episode

Subscribe to receive notifications when new episodes are published. Stay updated with the latest AI, SEO, and GEO insights.