
Aptura AI
We build evaluation datasets and RL environments that make AI reliable where mistakes are expensive: finance, healthcare, and legal. We design expert-curated training data, calibrated rubrics, and RL environments for frontier AI labs and startups pushing the frontier of what models can do. We are a small London-based team running multiple active projects, shipping work directly to labs, startups, and internal domain experts. Recent work includes SpreadsheetBench v2 and ultra long-horizon task research.