According to Beating, Prime Intellect has open-sourced general-agent, a self-evolving agent training environment that uses dual-player task generation mechanics. The system automatically generated 4,504 tasks and over 8,000 unique tools by alternating between a task synthesizer and solver, categorizing challenges into five difficulty tiers through nine strategies including constraint conditions, noisy instructions, and cross-entity coupling.
In testing, fine-tuning a 30B parameter model on 4,400+ trajectories from the environment improved tool-calling accuracy from 18.9% to 52.3% on the BFCL benchmark, demonstrating the framework's ability to generate semantically verified training data without reliance on manually annotated static datasets.