- The Neuron
- Posts
- đ¸ The Top 10 WILDEST GPT-5.2 Demos
đ¸ The Top 10 WILDEST GPT-5.2 Demos
PLUS: Test GPT-5.2, Opus 4.5, and Gemini 3 with us LIVE!

Welcome, humans.
đ´ Weâre LIVE RIGHT NOW to go fully hands on with GPT-5.2 and compare it to Gemini 3.0 and Opus 4.5.
Thanks to everyone who came out to yesterdayâs marketing tool round-up (and an extra special thank you to the folks who stayed a whole extra hour to mess around w/ GPT 5.2 in the playground)!
Weâve turned both yesterdayâs stream AND our stream with Andrew Hsu of Speak into blogs for you to skim through here:
Hereâs what happened in AI today:
We break down the wildests demos of GPT 5.2
Trump signed an AI executive order establishing national AI standards.
DeepMind signed a UK government deal for frontier AI applications.
Google released its Deep Research agent for you to use in your own apps.

Check out these WILD GPT-5.2 Demos
So yesterday Sam Altman and company unveiled GPT-5.2 as âthe smartest generally-available model in the world,â and within hours X filled up with wild demos, leaderboards, and benchmark takes that alternated between âthis is unrealâ and âthis is overhyped.â
The craziest GPT-5.2 demos so far fall into two camps: things that look like magic, and things that look like magic but fall apart under scrutiny.
It one-shot a full 3D graphics engine. Pietro Schirano asked GPT-5.2 Thinking (on high reasoning effort) to build a 3D graphics engine and got a single-file program with interactive camera controls and 4K export in one shotâno iterative prompt-chaining required.
Then it turned into a glowing physics toy. Building on that, Flavio Adamo had GPT-5.2 generate the âHex Bounceâ sceneâglowing balls ricocheting inside a rotating hexagon in three.jsâthat you can poke at in the Hex Bounce Glow playground, which has quickly become a go-to stress test for AI-written physics code.
It reads entire game scripts without losing the plot. To test long-context, Hangsiin fed GPT-5.2 an entire video-game script and reported that it stayed coherent and useful when answering detailed questions, outperforming earlier âlong-contextâ models that tended to forget key details.
It plays PokĂŠmon Crystal live on stream. In a very literal âagenticâ demo, Clad3815 wired GPT-5.2 up to play PokĂŠmon Crystal (Hard Mode) on stream, turning the model into a game-playing agent whose decisions you can watch in real time on Twitch.
It helps scientists think through real problems. Derya Unutmaz, an immunologist, says one of his first tests with GPT-5.2 Pro was asking it for the most innovative, first-principles questions needed to understand the immune systemâusing the model as a collaborator to surface deep research directions he might not have phrased the same way himself.
It can derive nontrivial math conditions on the fly. In a detailed thread, Sebastien Bubeck calls GPT-5.2 OpenAIâs best science model yet, pointing to scores like 92.4% on GPQA and 52.9% on ARC-AGI-2âand showing it independently derive the optimal
1.75/Lstep-size condition for smooth convex optimization, even using code to search for counterexamples.It built a 5,000-cell financial model⌠that valued a lemonade stand at $2.7B. On the âlooks amazing, still wrongâ side, Linas BeliĹŤnas asked GPT-5.2 to build his entire financial model: 30 reasoning tokens, 5,000+ cells, 18 interconnected sheets, modularized projections, dynamic scenarios, sensitivity tables, and pretty charts. As he put it in his LinkedIn write-up, ânone of the numbers added upââthe DCF priced his lemonade stand at $2.7B.
It makes genuinely beautiful, long-context writingâand keeps up over huge inputs. Elie Bakouch shared charts showing GPT-5.2 dramatically outperforming GPT-5.1 on long-context benchmarks, while other testers highlighted literary prompts where 5.2 sustains style and plot over multi-page inputs instead of degrading into repetition.
And yes, it still generates mesmerizing coding toys. Beyond the flagship demos, posts from builders like Jeremy Mack and Flavio Adamo show GPT-5.2 Purple and its siblings generating ball-physics sandboxes and neon-lit three.js scenes that feel closer to interactive art pieces than âhello worldâ tutorials.
So GPT-5.2 LOOKS LIKE the most capable generally-available model today⌠but then again, so does Gemini 3 and Opus 4.5. So weâre going to test all three of them on todayâs livestream to find out how capable they really are⌠and which is the best for your use case. Come join!

FROM OUR PARTNERS
Agents that donât suck
Are your agents working? Most agents never reach production.
Agent Bricks helps you build high-quality agents grounded in your data. We mean âhigh-qualityâ in the practical sense: accurate, reliable and built for your workflows.
Generic benchmarks donât cut it. Agent Bricks measures performance on the tasks that matter to your business.
Evaluate agents automatically, and keep improving accuracy with human feedback. With research-backed techniques for building, evaluating and optimizing, you can turn your business data into production agents faster â with governance built in from day one.

Prompt Tip of the Day
Use GPT-5.2 as your workflow architect, not just your answer machine.
Instead of asking, âHelp me with X,â try this two-step meta-prompt:
Step 1 â Design the workflow
âYouâre my AI workflow architect. I want to <goal> (e.g. âreview long contractsâ / âdebug large codebasesâ / âplan experimentsâ).
List 3 different workflows I could use with GPT-5.2 to do this reliably.
For each, include: the core loop (what I send you each step), which tools/context to attach, and where a human should verify or override the AI.
Then recommend the one you think I should start with and explain why in 3 sentences.â
Step 2 â Turn it into reusable prompts
âGreat, letâs implement Workflow #<n>.
Write me a single âmaster promptâ I can save and reuse for this workflow.
Then write 3â5 short follow-up prompts (âcheck for errorsâ, âsummarizeâ, âturn this into codeâ, etc.) that I can paste in as buttons/macros.â
The move here is simple: stop improvising new prompts every time, and make GPT-5.2 design and document a repeatable system for your use caseâcomplete with built-in verification steps so you donât end up with a $2.7B lemonade stand.

Treats to Try
*Asterisk = from our partners (only the first one!). Advertise to 600K readers here!
*Ideas move fast; typing slows them down. Wispr Flow matches your tone, handles punctuation and lists, and adapts to how you work on Mac, Windows, and iPhone. No start-stop fixing, no reformatting, just thought-to-text that keeps pace with you. When writing stops being a bottleneck, work flows.
Give your hands a break â start flowing for free today.
Gemini Deep Research conducts autonomous web research for youâit searches, finds gaps, searches again, then delivers a comprehensive report with citationsâpaid only rn ($2 per million input tokens).
Worktrace observes your teamâs day-to-day work and suggests concrete automations you can deploy, instead of making you guess where to use AI (raised $9M).
Roboflow Rapid makes you a computer vision model in minutes; just upload a video, describe what you care about, and spin up a working vision model and API in a few minutes instead of hand-labeling images for hours (demo).
Stripeâs Agentic Commerce Suite gives you the power to upload a catalog once and sell through many AI agents with unified discovery, checkout, and fraud protection.
AutoGLM is an open-source agent that understands Android phone workflowsâlike navigating maps or batching notifications (code, HuggingFace).
Ampâs âLook At Thisâ tool sends big PDFs and images to a helper model and returns only the relevant bits, saving your main coding agentâs context windowâno pricing details shared.

Around the Horn
Salesforce is hiking fees on apps that sync or copy customer data out of its platform, squeezing data-integration providers such as Fivetran.
US President Trump signed an executive order directing federal agencies to establish a single national AI regulatory framework and challenge state AI laws
Google DeepMindâs UK government deal will apply frontier AI to better public services, faster science, and tougher national security.
Agility Robotics will roll out its Digit humanoid robots in a Mercado Libre warehouse in San Antonio, Texas under a year-long exclusive deal.
Anthropic is now taking applications for its May and July 2026 AI safety Fellows cohorts, offering four-month paid stints focused on alignment and safety work (they also have a security track too!).

FROM OUR PARTNERS
Turn AI Into Your Income Stream
The AI economy is booming, and smart entrepreneurs are already profiting. Subscribe to Mindstream and get instant access to 200+ proven strategies to monetize AI tools like ChatGPT, Midjourney, and more. From content creation to automation services, discover actionable ways to build your AI-powered income. No coding required, just practical strategies that work.

Friday Trivia
We didnât have time to fit Thursday Trivia in the newsletter yesterday (emergency NL, OpenAI has a new model, yâknow how it is), so today youâre getting it on Friday!
A.

B.

Which is AI, and which is real?The answer is below, but place your vote to see how your guess everyone else (no cheating now!) |

A Catâs Commentary

Also, can I just say? Shout out to this couple, who totally DOMINATES the first row of results for âselfie couple horizontal instagramâ
![]() | Thatâs all for today!
|






