Blog

Training Strong Models to Verify IMO Solutions Using Unstructured Internet Data with Tinker

Training models to accurately grade International Mathematical Olympiad problems using only unstructured internet data and reinforcement learning.