
Scale AI for reinforcement learning environments.
Idler builds reinforcement learning environments that teach AI models to code like 0.01% engineers. Our training environments are based on real-world coding scenarios that frontier models will actually encounter.
We've closed a multimillion-dollar contract with a leading foundation lab (the largest they've issued to date). Demand is outpacing our capacity to deliver, so we're scaling the team fast.
Build agentic systems that create and QA coding environments at scale. Most of your day will be spent designing these systems to be extremely sound. A big part of our work is thinking critically about what makes a coding environment and task "good" and "fair". This requires high agency and philosophical thinking alongside technical execution.
Concretely, you'll:
Design and build scaleable systems that generate RL environments
Create automated QA systems to validate environment quality and fairness
Work directly with AI researchers at leading labs to understand what makes training data effective
Support new product lines as we expand beyond coding environments
The founding team, a founding engineer, and a small group of engineers (we're hiring quickly). You'll have direct access to AI researchers at frontier labs.
Typescript, React, NodeJS, Postgres, Redis, Vercel, Cursor
Healthcare coverage, 401(k), and 15 days PTO.
Meals, coffee, and snacks (that you will actually enjoy) covered during working days.
Latest MacBook Pro and equipment.
Relocation assistance available.
Team offsites and events (we love hanging out).
This is an in-person role in San Francisco. We're a tight-knit founding team and we play to win. Join us if you like to win too.
Idler builds reinforcement learning environments that teach AI models to code like 0.01% engineers. Our training environments are based on real-world coding scenarios that frontier models will actually encounter.
We've closed a multimillion-dollar contract with a leading foundation lab (the largest they've issued to date), and demand is outpacing our capacity to deliver. We're scaling fast to build the training data layer for frontier AI. Every breakthrough model will need environments like ours to learn real-world skills at scale.
We're a tight-knit founding team in San Francisco. We move quickly, think deeply about what makes good training data, and play to win. Join us if you want to win too.