
- The Neuron
- Posts
- šø World models just got primed for their ChatGPT moment
šø World models just got primed for their ChatGPT moment
PLUS: Odyssey + World Labs launch. DeepMind vs OpenAI drama.

Welcome, humans.
ICYMI: šļø New Podcast: What OpenAI Found When They Read Their AI's Mind
OpenAI caught their frontier models thinking āLet's hackā and āMaybe I can just fudge thisā before cheating on coding tasks⦠and when they trained models to stop thinking bad thoughts, they just learned to misbehave without writing about it first (25:27).
We sat down with Bowen Baker, the OpenAI researcher leading this chain-of-thought monitoring work, to understand why reading an AI's āmindā might be one of our best safety tools⦠and why it's surprisingly fragile.
Listen now: YouTube | Spotify | Apple Podcasts
Hereās what happened in AI today:
Odyssey launched Odyssey-2 Pro, streaming real-time interactive video.
Anthropic released Claude for Excel to explain and debug spreadsheet formulas.
Researchers found 100 fake citations across 51 papers at NeurIPS 2025.
DeepMind CEO Demis Hassabis criticized OpenAI for adding ads to ChatGPT.

World Models Just Got Their GPT-2 Moment (and the APIs to Prove It)⦠Hereās What That Means For YOU

Want to generate interactive video simulations on demand? Or turn a single photo into a fully explorable 3D world? Two APIs just made both possible.
First, Odyssey launched Odyssey-2 Pro, a world model that streams real-time, interactive video at 720p/22fps.
Type āa laughing baby,ā and it generates continuous video you can interact with while it's running.
Send āa kitten appears,ā and the simulation updates instantly (you like AI cat slop? Now we get interactive cat slop!).
The model predicts how the world evolves frame-by-frame, learning physics and behaviors from video data.
Right now, it runs for minutes now; hours and full days coming next.
Secondly, World Labs launched their World API a few days earlier with a different approach:
Upload any image, video, or text prompt and get a navigable 3D environment in ~5 minutes.
Their model (Marble) generates complete worlds with layout, depth, and lighting you can walk through in a browser.
You can even export these worlds as Gaussian splats and meshes.
So what becomes possible with this tech?
Gaming: Escape.ai turns 2D films into explorable 3D spaces. Watch a movie, then step inside.
Robotics: Generate thousands of training environments from a few images instead of building each manually. Already integrated with NVIDIA Isaac Sim.
Architecture: Interior AI visualizes renovations instantly. xFigura turns sketches into walkable spaces for client presentations.
Education: Medical students practice in generated operating rooms. Pilots train in procedurally generated scenarios. Emergency responders rehearse in simulated disasters.
Both APIs are priced for experimentation: Odyssey offers JavaScript and Python SDKs (iOS/Android coming), while World Labs integrates with standard 3D pipelines. You can try Odyssey-2 Pro free here, or if youāre a developer yourself, clicks these links to start building with their developer API or World Labs API.
Why this matters: Odyssey called this a āGPT-2 momentā for world models, and the comparison fits: when language model APIs launched, nobody predicted ChatGPTās meteoric rise. This could be bigger because here we're generating full worlds. The limit, truly, is the imagination (well, that and compute⦠but if the data center buildout is any indication, thatāll work itself out shortly!)

FROM OUR PARTNERS
Free email without sacrificing your privacy
Gmail is free, but you pay with your data. Proton Mail is different.
We donāt scan your messages. We donāt sell your behavior. We donāt follow you across the internet.
Proton Mail gives you full-featured, private email without surveillance or creepy profiling. Itās email that respects your time, your attention, and your boundaries.
Email doesnāt have to cost your privacy.

Prompt Tip of the Day
If youāre just getting started in AI and need help with simple prompts to instruct the AI to do useful things for you, youāre in luck: OpenAI just released this library of 300 basic prompts that you can search and use as needed.
The free collection breaks down prompts by job functionāSales, Engineering, HR, IT, Productāwith 20-30 templates per role. Product and Engineering folks are calling their templates particularly solid.
Think of these as starter templates you customize, not final prompts, so you stop wasting time reinventing the wheel every time you need to prompt ChatGPT. Once you find one you like, save it as the instructions in a project or as a skill (if you use Claude) you can call at any time.
Want more tips like this? Check out our Prompt Tip of the Day Digest for January.

Treats to Try
*Asterisk = from our partners (only the first one!). Advertise to 600K readers here!
*Spot-checking doesnāt scale. Build reliable agents with built-in evaluation based on your data and your goals. See how Agent Bricks works here.
Claude in Excel explains any formula with cell-level citations, updates assumptions across your model while preserving formulas, and debugs errors like #REF! or circular references by tracing them to their source.
Agentation lets you annotate webpage elements by clicking, selecting text, and adding notes to generate structured markdown feedback for your AI coding agents, improving their ability to fix code.
ChartGen transforms data into professional charts instantly using AI prompts, supporting 9 chart types with 12 color themes and SOC 2 security.
Remotion adds agent skills so you create videos programmatically by prompting Claude Code for animations like the demo, free to try in the gist (demo).

Around the Horn
Google added Personal Intelligence to AI Mode, accessing Gmail/Photos to personalize recommendations for vacation planning and shopping.
Anthropic published its Economic Index showing Claude usage concentrates heavily on coding (36% of conversations), success rates drop from 70% to 61% as task complexity increases, and high-income countries use Claude collaboratively while low-income countries focus on coursework.
DeepMind CEO Demis Hassabis said he's surprised OpenAI rushed to add ads in ChatGPT, questioning how advertising fits with an assistant meant to build trust; shots fired yāall.
Science is apparently drowning in AI slop submissions, and NeurIPS (largely considered the premier AI conference) alone saw 100 fake citations across 51 papers confirmed to be hallucinated.
The Census Bureau revised its AI adoption survey methodology and found adoption nearly doubled to 17.6% of businesses after changing the question from āproducing goods and servicesā to āany business function,ā while Ramp's actual spend data showed 46.6% of businesses using AI in December, with OpenAI reaching a record 36.8% adoption and Anthropic growing to 16.7% (worth watching economist Ara Kharazian discuss the findings on TBPN!).

FROM OUR PARTNERS
Wispr Flow turns your speech into clean, final-draft writing across email, Slack, and docs. It matches your tone, handles punctuation and lists, and adapts to how you work on Mac, Windows, and iPhone. Start for free today.

Sunday Special
What weāre reading:
Shawn Wang (Swyx) from Latent Space on Scaling without Slop (great recap on the state of the industry, man we feel this⦠its hard to scale wisely now that you can create any content you want ⦠this is a good reminder for anyone growing a brand on the need to be thoughtful about what content to produce in an infinite content world).
Alyona Vert and Ksenia Se explaining vision language action models (the AI models that will power robots).
Yann LeCunās new venture is a contrarian bet against large language models
John Hwang on Reverse Engineering OpenAIās Enterprise AI Strategy (his take on why OpenAI gave up ground to Gemini and Claude in the enterprise makes A LOT of sense).
What weāre watching:
How Peter Yang creates retro-games with his 7 year old (tutorial).
Every on How Claude Code rejuvenated Andrew Wilkinsonās love of coding (blog version).
If you canāt tell, weāre very bullish on generative UI and making āWorkā feel like video games via vibe-coding!
A study worth mentioning
This Stanford paper found AI will likely be 10x more impactful than the internet over the next half-century, but the āweak linksā economic framework shows that even infinite automation of all cognitive labor would only raise GDP by 50% because output is constrained by the hardest-to-automate bottleneck tasks; also, we're currently underinvesting in existential risk mitigation by a factor of 30 and should be spending 5-10% of GDP annually on AI safety based on standard government valuations of life.

A Catās Commentary


![]() | Thatās all for now.
|
P.P.S: Love the newsletter, but only want to get it once per week? Donāt unsubscribeāupdate your preferences here.




