News
publication
Apr 26, 2026
War Games: A Frame-Based RL Environment for Long-Horizon Agents in Complex Real-Time Worlds
War Games measures whether frontier models can sustain long-horizon planning and adaptation in complex, non-stationary real-time games, with human performance as the benchmark.
research
Mar 25, 2026
Introducing Computer World Models
A theoretical framework for a learned supervisory runtime that infers machine state from continuous observation and produces two output streams: a rendered pixel surface and a guarded action stream grounded in a real host execution boundary, with Linux as the first concrete ABI.