Best SWE-bench Verified Score

Highest score resolving real GitHub issues autonomously — core software engineering capability

93.9%

Trend YoY growth is +17.2%, slowing by 1.9 pp/month over the last 2Y. Latest: +21.2%, 4.0 pp above trend, a 0.7σ deviation.

Level

YoY Change

y = 54.0% 1.9 pp/mo · t

Deviation from trend

Forecast

Rolling 1Y trend forecast

Forecast made inMay '25Jun '25Jul '25Aug '25Sep '25Oct '25Nov '25Dec '25Jan '26Feb '26Mar '26Apr '26May '26Jun '26Jul '26Aug '26Sep '26Oct '26Nov '26Dec '26Jan '27Feb '27Mar '27Apr '27
May '25+46.1%+45.8%+45.5%+45.2%+44.8%+44.5%+44.2%+43.9%+43.5%+43.2%+42.9%+42.6%+42.3%+41.9%+41.6%+41.3%+41.0%+40.7%+40.3%+40.0%+39.7%+39.4%+39.1%+38.7%
Jun '25+43.9%+43.3%+42.6%+42.0%+41.4%+40.7%+40.1%+39.5%+38.8%+38.3%+37.6%+37.0%+36.4%+35.8%+35.1%+34.5%+33.9%+33.2%+32.6%+32.0%+31.3%+30.7%+30.1%
Jul '25+43.3%+42.6%+42.0%+41.4%+40.7%+40.1%+39.5%+38.8%+38.3%+37.6%+37.0%+36.4%+35.8%+35.1%+34.5%+33.9%+33.2%+32.6%+32.0%+31.3%+30.7%+30.1%
Aug '25+42.6%+42.0%+41.4%+40.7%+40.1%+39.5%+38.8%+38.3%+37.6%+37.0%+36.4%+35.8%+35.1%+34.5%+33.9%+33.2%+32.6%+32.0%+31.3%+30.7%+30.1%
Sep '25+37.0%+35.8%+34.6%+33.4%+32.1%+30.9%+29.8%+28.5%+27.3%+26.1%+24.9%+23.6%+22.4%+21.2%+20.0%+18.8%+17.5%+16.3%+15.2%+13.9%
Oct '25+33.0%+31.2%+29.5%+27.7%+25.9%+24.3%+22.5%+20.8%+19.0%+17.2%+15.5%+13.7%+11.9%+10.1%+8.4%+6.6%+4.8%+3.2%+1.4%
Nov '25+26.4%+23.9%+21.4%+18.8%+16.5%+14.0%+11.5%+9.0%+6.5%+4.0%+1.4%-1.1%-3.6%-6.1%-8.6%-11.1%-13.4%-16.0%
Dec '25+23.1%+20.4%+17.7%+15.3%+12.6%+10.0%+7.3%+4.6%+1.9%-0.8%-3.4%-6.1%-8.7%-11.4%-14.1%-16.6%-19.3%
Jan '26+20.4%+17.7%+15.3%+12.6%+10.0%+7.3%+4.6%+1.9%-0.8%-3.4%-6.1%-8.7%-11.4%-14.1%-16.6%-19.3%
Feb '26+15.1%+12.2%+9.1%+6.0%+2.8%-0.2%-3.4%-6.6%-9.6%-12.8%-15.9%-19.1%-22.2%-25.1%-28.3%
Mar '26+11.9%+8.8%+5.8%+2.7%-0.3%-3.4%-6.5%-9.5%-12.6%-15.6%-18.7%-21.8%-24.6%-27.7%
Apr '26+15.0%+12.7%+10.3%+8.0%+5.7%+3.3%+1.0%-1.3%-3.6%-6.0%-8.4%-10.5%-12.9%