<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
    <channel>
        <title>Reasoning Models - Tag - Simi Studio</title>
        <link>/en/tags/reasoning-models/</link>
        <description>Reasoning Models - Tag - Simi Studio</description>
        <generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>simi@simi.studio (Simi)</managingEditor>
            <webMaster>simi@simi.studio (Simi)</webMaster><lastBuildDate>Thu, 08 Jan 2026 14:15:00 &#43;0800</lastBuildDate><atom:link href="/en/tags/reasoning-models/" rel="self" type="application/rss+xml" /><item>
    <title>o3 Real Performance on Engineering Tasks: Not Every Problem Is Worth the Wait</title>
    <link>/en/posts/openai-o3-reasoning-analysis/</link>
    <pubDate>Thu, 08 Jan 2026 14:15:00 &#43;0800</pubDate>
    <author>simi@simi.studio (Simi)</author>
    <guid>/en/posts/openai-o3-reasoning-analysis/</guid>
    <description><![CDATA[o3 launched with viral coverage. But honestly, not every scenario is worth o3's price. This article gives an objective evaluation of o3's engineering capability.]]></description>
</item>
<item>
    <title>Gemini Reasoner: First Model to Surpass Human Average on Complex Reasoning</title>
    <link>/en/posts/gemini-reasoner-analysis/</link>
    <pubDate>Mon, 05 Jan 2026 10:00:00 &#43;0800</pubDate>
    <author>simi@simi.studio (Simi)</author>
    <guid>/en/posts/gemini-reasoner-analysis/</guid>
    <description><![CDATA[On January 5, 2026, Google DeepMind released Gemini Reasoner—the first model to systematically outperform human average on complex cross-modal reasoning tasks including scientific hypothesis generation, causal inference, and long-horizon planning.]]></description>
</item>
</channel>
</rss>
