🔥SemiAnalysis Actual test: GPT-5.5 returns to the forefront, but SWE-bench Pro is surpassed by Opus 4.7


Semiconductor and AI analysis organization SemiAnalysis releases a comparative evaluation of programming assistants, covering GPT-5.5, Opus 4.7, and DeepSeek V4. GPT-5.5 is based on the new pre-training codenamed "Spud," marking OpenAI's first return to the cutting edge of programming models in half a year, with SemiAnalysis engineers switching between Codex and Claude Code. Actual tests show division of strengths: Claude excels at planning new projects, while Codex is stronger in reasoning-intensive bug fixes. But the article reveals that Ope…
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin