V4-Pro Achieves 67% Coding Pass Rate in Internal Dogfooding Test, Approaching Opus 4.5 Performance

Gate News message, April 24 — V4 has publicly disclosed internal dogfooding data for its V4-Pro model. The company collected approximately 200 real-world engineering tasks from over 50 engineers, covering feature development, bug fixes, refactoring, and diagnostics across tech stacks including PyTorch, CUDA, Rust, and C++. After rigorous filtering, 30 tasks were retained for the benchmark evaluation.

V4-Pro-Max achieved a 67% coding pass rate, significantly outperforming Sonnet 4.5 at 47% and approaching Opus 4.5 at 70%. However, it trails Opus 4.5 Thinking (73%) and Opus 4.6 Thinking (80%), while substantially exceeding Haiku 4.5 at 13%.

In an internal survey with 85 respondents, all participants reported using V4-Pro for agentic coding in daily workflows. 52% endorsed V4-Pro as their default primary coding model, 39% leaned toward approval, and less than 9% expressed disapproval. Reported issues included low-level errors, misinterpretation of ambiguous prompts, and occasional over-thinking behavior.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.
Comment
0/400
No comments