• The Neuron
  • Posts
  • 😺 Two free 3D world models dropped this week

😺 Two free 3D world models dropped this week

PLUS: the "trust battery" method for AI employees.

Welcome, humans.

Good news! You can now hire your robot dog to take your actual dog for a walk!

IDK why but this is giving 90s sled dog rescue movie Balto, which slaps

Bad news… you might never see either of them again… follow the northern lights home, Balto!

Moving on! Time for a check in: What tools are y’all using? Has your “daily driver” (the AI you use every day for your typical AI use cases) changed lately, or stayed the same? Fill out the poll below and let us know, and we’ll share everyone’s answers in tomorrow’s blurb.

What AI tools are you using the most right now?

Pick the option below that you use THE MOST, or if not listed, choose "other" and tell us what you're working with!

Login or Subscribe to participate in polls.

A side benefit to answering this poll? If we know what tools you all are actually using on a daily basis, it’ll help guide us on what we decide to cover more or less often!

Here’s what happened in AI today:

  • 😼 Two Chinese giants and NVIDIA shipped world models the same week; only two of them open-sourced theirs.

  • 📰 Factory raised $150M at a $1.5B valuation for autonomous coding agents.

  • 📰 OpenAI launched GPT-Rosalind, its first frontier reasoning model built for life sciences.

  • 🍪 NVIDIA's Lyra 2.0 turns a single image into a 3D world you can fly through.

  • 🎓 The "trust battery" method for giving your AI employee escalating autonomy.

P.S: Want to reach 675,000 AI-hungry readers? Click here to advertise with us.

P.P.S: Love robots? We’re starting a new robotics newsletter! Sign up early here.

😼 Two Open World Models Shipped This Week. Alibaba Waitlisted Theirs.

Three AI giants shipped navigable 3D worlds this week. Each one picked a different strategy:

Tencent open-sourced HY-World 2.0 on Hugging Face (the place to download and run your own AI models) with a full commercial license; take it, build on it, sell it. NVIDIA dropped Lyra 2.0 there too, but stamped it "research-only." And Alibaba, previously known for its popular open-source models, launched its new system behind a waitlist.

Here's what happened:

  • Tencent open-sourced HY-World 2.0, a multi-modal 3D world model that turns text, images, or video into editable 3D scenes (meshes and 3D Gaussian splats, a rendering method that uses fuzzy 3D "blobs" to represent scenes) you can drop straight into Unity, Unreal, or Blender.

    • Available w/ full commercial license. Code, weights, and a Gradio demo are all live on GitHub.

  • NVIDIA dropped Lyra 2.0 on Hugging Face the same week, a 14B framework that turns a single 480×832 image into a persistent, explorable 3D Gaussian scene you can fly through in real time (paper, project page).

    • License caveat: research use only, no production, no commercial output.

  • Alibaba launched Happy Oyster the same day Tencent open-sourced HY-World 2.0. It has two modes (Directing steers a generated scene in real time for up to 3 minutes at 480p or 720p; Wandering lets you move through a generated world for up to 1 minute via WASD).

    • Catch: limited early access, not open weights. Alibaba wants to monetize world-model compute through its cloud.

  • If it means something to ya, HuggingFace's Merve Noyan paired HY-World 2.0 and Lyra 2.0 as the week's two most important open-weight 3D drops.

Why this matters: “World models”, meaning in this case AI systems that understand 3D space, physics, and object permanence well enough to generate navigable environments, are becoming foundational infrastructure for robotics, game development, VR, and autonomous vehicles.

Until this week, the best work was locked behind Google DeepMind's Genie paywall, Fei-Fei Li’s World Labs API, or NVIDIA's enterprise tooling. Now game studios, indie devs, and robotics researchers can pull commercial-use weights off Hugging Face for free. The cost of entry dropped by several orders of magnitude in 48 hours.

Our take: The real signal here is the open-vs-closed divide:

  • Tencent went fully open (commercial license, run it anywhere) because it's playing the long game: commoditize the layer, profit on the applications.

  • NVIDIA went research-only because it wants researchers experimenting freely without cannibalizing its Omniverse business.

  • Alibaba went closed-early-access because its goal is Alibaba Cloud revenue, the same playbook it's running with Happy Horse video.

We believe open tools will win on sheer adoption, allowing commercial applications to be built on top of it. But more importantly, we believe world models like these will be the foundation for generating consistent, coherent, 3D worlds for games and eventually movies, too. Love to see the progress!

FROM OUR PARTNERS

More than 50 companies are backing Arm’s move into silicon — spanning cloud, chip design and software.

From AWS, Google and Microsoft to NVIDIA, Samsung, SK hynix and TSMC, the ecosystem signals a broader shift in how AI infrastructure is being built.

Rather than isolated components, the stack is becoming more tightly integrated — from architecture through to deployment.

Explore how this is taking shape: Arm Everywhere keynote

🎓 AI Skill of the Day: Give Your AI Agent a "Trust Battery" Before Handing It Real Autonomy

Builder Nityesh just shared the cleanest framework we've seen for scaling how much you let an AI agent do unsupervised. He calls it a "trust battery," borrowed from Shopify CEO Tobi Lütke's concept for human work.

The core idea: Every new AI agent starts at 20%. The battery charges with clean execution and thoughtful anticipation; it drains every time you repeat yourself. Only hand the agent tasks that match its current charge:

  1. 20-40% (Pair mode): You review every output. For money, clients, or permanent side effects.

  2. 40-60% (Async with checkpoints): Agent works alone, stops at key decisions.

  3. 60-80% (Full delegation with audit): Agent runs end-to-end; you spot-check the next morning.

  4. 80%+ (Autonomous): Agent decides when to ask.

The part that makes it work: run a nightly reflection against everything the agent did that day.

Review today's task outputs. For each one: (1) rate execution quality 1-5,(2) flag any repeated corrections from me, (3) propose one update to your CLAUDE.md or system prompt that would prevent the correction next time, and (4) decide whether your trust battery should go up, down, or flat. Output a single JSON summary I can paste into our ops doc tomorrow.

Drop it in a Claude Routine scheduled for midnight. Every morning you'll see whether your agent earned more autonomy overnight, and the self-updating CLAUDE.md loop means it keeps improving without you rewriting prompts.

Total AI beginner? Start here.

Have a specific skill you want to learn? Request it here. 

Replay: We Tested Claude Opus 4.7 LIVE

On Thursday morning, hours after Anthropic shipped Opus 4.7, Grant and his dev friend Kyle went live on YouTube to stress-test the new model in real time. They pushed the upgraded vision through a Final Fantasy Tactics sprite redesign, dug into the suspicious benchmark drops that back Nick Saave's theory that 4.7 is a distilled Mythos Preview, and caught Claude Code silently farming work to Haiku sub-agents. Learn Kyle's three-model workflow for squeezing more intelligence out of fewer tokens, and watch out for the long-context regression nobody's talking about.

🎧 Watch the replay on: YouTube | LinkedIn | Read the blog companion here.

🍪 Treats to Try

*Asterisk = from our partners (only the first one!). Advertise to 700K+ readers here!

  1. Nous Portal's Tool Gateway bundles 300+ models plus web scraping, browser automation, image generation, cloud terminal, and text-to-speech into one subscription with no separate API keys —paid only (Nous Portal subscription; trial available).

  2. Mozilla Thunderbolt is an open-source, self-hosted enterprise AI client letting organizations chat, search, research, and automate workflows across any models or data pipelines with full sovereignty, on web plus native apps —free and open source.

  3. QuiverAI lets you sketch, prompt, or describe a design and get editable vector output back (SVGs you can actually modify, not static bitmaps) —free public beta, open signup.

  4. Sam is a voice-first AI companion that sits in a senior's home doing daily check-ins, holding smart conversations, and sending real-time alerts to family when something seems off; pitched as safety plus connection instead of another gadget —paid (check site for pricing).

  5. CodeBurn is an interactive TUI cost dashboard for Claude Code, Codex, and Cursor that shows exactly where your AI coding tokens go at the task level —free, open source.

  6. MacMind is a complete single-layer transformer (embeddings, positional encoding, self-attention, backpropagation, gradient descent) implemented entirely in HyperTalk, the scripting language Apple shipped with HyperCard in 1987, running on a 1989 Mac —free, open source, absurdly nostalgic.

📰 Around the Horn

  • Proximal Labs launched FrontierSWE, an open ultra-long-horizon coding benchmark with 20-hour budgets on real tasks like optimizing video-rendering libraries; even GPT-5.4 and Opus 4.6 rarely finish in time.

  • Lucy Shi shared π0.7, a new steerable generalist robot model showing emergent compositional generalization from diverse training data: zero-shot shirt folding on unseen UR5e arms, operating an air fryer from verbal coaching alone (blog).

  • Adobe data shared via TechCrunch shows AI traffic to US retail sites rose 393% in Q1 2026 (up 269% in March alone), and AI-sourced shoppers convert better and spend more than non-AI visitors.

  • Roblox's AI assistant got agentic tools: Planning Mode turns prompts into editable action plans, Mesh Generation and Procedural Model Generation build assets, and self-correcting playtesting agents close the loop for end-to-end game creation.

FROM OUR PARTNERS

Are you tracking agent views on your docs?

AI agents already outnumber human visitors to your docs — now you can track them.

🌟 Sunday Special: The Week in Five Stories

Catching up after a week of touching grass? Here's what mattered.

  1. Anthropic shipped Opus 4.7. OpenAI countered hours later. Vision jumped to 82.1%, SWE-bench Pro to 64.3%, and a hidden new tokenizer quietly raised your Claude bill up to 35%.

  2. OpenAI's Codex overhaul landed the same afternoon with Mac computer use, 111 plugins, and scheduled automations (Full breakdown).

  3. Canva declared itself an AI platform with design tools. COO Cliff Obrecht said the quiet part aloud onstage: "Until now, Canva has been a design platform with AI tools. Now we become an AI platform with design tools." 

  4. Same day, Anthropic CPO Mike Krieger quit Figma's board ahead of Anthropic shipping competing design software (more on that tomorrow).

  5. OpenAI launched GPT-Rosalind, a frontier reasoning model built for biology, drug discovery, and translational medicine, deployed under trusted-access terms to Moderna, Amgen, the Allen Institute, and Thermo Fisher. The biggest shift since OpenAI started naming models after historical figures.

A Cat’s Commentary

That’s all for now.

What'd you think of today's email?

Login or Subscribe to participate in polls.

P.P.S: Love the newsletter, but only want to get it once per week? Don’t unsubscribe—update your preferences here.