This demo shows how different LLM strategies work by analogy: a Roomba navigating a room with obstacles, carpet (slow zones), and a target to reach.
How to play (Human mode):
- Explore — Drag away from the Roomba and release to send a ghost scouting the room
- Aim & Shoot — After exploring, drag again to aim the Roomba and release to launch
AI Modes to compare against:
- Standard LLM — fires once and hopes for the best
- Reasoning (CoT) — thinks first (ghost explores), then takes a calculated shot
- Agentic — moves, pauses to re-aim, and course-corrects iteratively
Battery = token budget. Carpet slows you down. Fog hides the room until explored. Can you beat the AI?