<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
    <channel>
        <title>Benchmark - 标签 - Simi Studio</title>
        <link>/tags/benchmark/</link>
        <description>Benchmark - 标签 - Simi Studio</description>
        <generator>Hugo -- gohugo.io</generator><language>zh-CN</language><managingEditor>simi@simi.studio (Simi)</managingEditor>
            <webMaster>simi@simi.studio (Simi)</webMaster><lastBuildDate>Sun, 22 Feb 2026 10:00:00 &#43;0800</lastBuildDate><atom:link href="/tags/benchmark/" rel="self" type="application/rss+xml" /><item>
    <title>2026 年编程 Agent Benchmark：Claude Code vs Cursor vs Copilot vs Devin</title>
    <link>/posts/coding-agent-benchmark/</link>
    <pubDate>Sun, 22 Feb 2026 10:00:00 &#43;0800</pubDate>
    <author>simi@simi.studio (Simi)</author>
    <guid>/posts/coding-agent-benchmark/</guid>
    <description><![CDATA[2026 年初，各家编程 Agent 能力对比。用同一套测试题测试：完成率、代码质量、速度、成本，给一个客观横评。]]></description>
</item>
<item>
    <title>AI 编程智能评估：2026 年初各模型真实能力对比</title>
    <link>/posts/ai-code-intelligence-benchmarks/</link>
    <pubDate>Sun, 18 Jan 2026 10:00:00 &#43;0800</pubDate>
    <author>simi@simi.studio (Simi)</author>
    <guid>/posts/ai-code-intelligence-benchmarks/</guid>
    <description><![CDATA[2026 年初，Claude 3.7、GPT-4o、o3-mini、Gemini 2.0 各有高低。这篇文章给一个客观的编程能力横向对比，不吹不黑。]]></description>
</item>
</channel>
</rss>
