rLLM and DeepSWE release

mananroongta · mananroongta · commit 71d6d01a4cf6 · 2025-07-02T17:50:56.000-07:00
diff --git a/blog.html b/blog.html
@@ -38,6 +38,38 @@ <h1>Archive</h1>
     <div class="year-label">2025
     <div class="posts-list">
 
+    <article class="blog-post">
+      <div class="date">
+        <h2>July<sup>1</sup></h2>
+      </div>
+      <div class="post-content">
+        <h3>
+          <a href="https://pretty-radio-b75.notion.site/rLLM-A-Framework-for-Post-Training-Language-Agents-21b81902c146819db63cd98a54ba5f31">
+            rLLM: Reinforcement Learning for Language Agents
+          </a>
+        </h3>
+        <p class="post-meta">
+          Date: July 1, 2025 | Estimated Reading Time: 10 min | Author: Sijun Tan, Michael Luo, Colin Cai
+        </p>
+      </div>
+    </article>
+
+    <article class="blog-post">
+      <div class="date">
+        <h2>July<sup>2</sup></h2>
+      </div>
+      <div class="post-content">
+        <h3>
+          <a href="https://pretty-radio-b75.notion.site/DeepSWE-Training-a-Fully-Open-sourced-State-of-the-Art-Coding-Agent-by-Scaling-RL-22281902c1468193aabbe9a8c59bbe33">
+            DeepSWE: Training a Fully Open-sourced, State-of-the-Art Coding Agent by Scaling RL
+          </a>
+        </h3>
+        <p class="post-meta">
+          Date: July 1, 2025 | Estimated Reading Time: 20 min | Author: Agentica x Together AI
+        </p>
+      </div>
+    </article>
+
     <article class="blog-post">
       <div class="date">
         <h2>April<sup>1</sup></h2>
@@ -65,15 +97,6 @@ <h3><a href="https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Previ
           <p class="post-meta">Date: February 10, 2025 | Estimated Reading Time: 10 min | Author: Michael Luo, Sijun Tan</p>
         </div>
       </article>
-      <!-- <article class="blog-post">
-        <div class="date">
-          <h2>July<sup>1</sup></h2>
-        </div>
-        <div class="post-content">
-          <h3>I love Teedy, my favorite dog.</h3>
-          <p class="post-meta">Date: January 1, 2025 | Estimated Reading Time: 1200 min | Author: Teedy</p>
-        </div>
-      </article> -->
     </div>
   </section>
 
diff --git a/images/people/manan_roongta.jpg b/images/people/manan_roongta.jpg
diff --git a/index.html b/index.html
@@ -36,7 +36,7 @@ <h1 class="logo-text">Agentica</h1>
     <div class="hero-text">
       <h2>Welcome to the Agentica Project! 👋 </h2>
       <p class="main-paragraph">
-        We are an open-source initiative to democratize reinforcement learning (RL) techniques and develop scalable systems for large language models (LLMs) and agents.
+        We are a open-source initiative spawning from Berkeley Sky Computing Lab to democratize reinforcement learning (RL) techniques and develop scalable systems for large language models (LLMs) and agents.
       </p>
       <div class="social-icons">
         <a href="mailto:agenticaproject@gmail.com">
@@ -57,12 +57,25 @@ <h2>Welcome to the Agentica Project! 👋 </h2>
 
   <!-- ========== PROJECT SECTION ========== -->
   <section id="project">
+    <a href="https://pretty-radio-b75.notion.site/rLLM-A-Framework-for-Post-Training-Language-Agents-21b81902c146819db63cd98a54ba5f31" rel="noopener noreferrer">
+      <h3>rLLM: Reinforcement Learning for Language Agents</h3>
+      <p class="main-paragraph">
+        We release rLLM, an open-source framework for post-training language agents via reinforcement learning. With rLLM, you can easily build their custom agents and environments, train them with reinforcement learning, and deploy them for real-world workloads.
+      </p>
+      <div class="post-meta">Date: July 1, 2025 | Estimated Reading Time: 10 min | Author: Sijun Tan, Michael Luo, Colin Cai </div>
+    </a>
+    <a href="https://pretty-radio-b75.notion.site/DeepSWE-Training-a-Fully-Open-sourced-State-of-the-Art-Coding-Agent-by-Scaling-RL-22281902c1468193aabbe9a8c59bbe33" rel="noopener noreferrer">
+      <h3>DeepSWE: Training a Fully Open-sourced, State-of-the-Art Coding Agent by Scaling RL</h3>
+      <p class="main-paragraph">
+        We release DeepSWE-Preview, a 32B software engineering agent (SWE) trained with purely RL that achieves 59% on SWEBench-Verified with test-time scaling,(42.2% Pass@1), topping the SWEBench leaderboard for open-weight models.      </p>
+      <div class="post-meta">Date: July 1, 2025 | Estimated Reading Time: 20 min | Author: Agentica x Together AI </div>
+    </a>
     <a href="https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51" rel="noopener noreferrer">
       <h3>DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level</h3>
       <p class="main-paragraph">
         We release DeepCoder-14B-Preview, a code reasoning model finetuned from Deepseek-R1-Distilled-Qwen-14B via distributed RL. It achieves an impressive 60.6% Pass@1 accuracy on LiveCodeBench (+8% improvement), matching the performance of o3-mini-2025-01-031 (Low) and o1-2024-12-17 with just 14B parameters.
       </p>
-      <div class="post-meta">Date: April 8, 2025 | Estimated Reading Time: 15 min | Author: Agentica x Together AI </div>
+      <div class="post-meta">Date: July 1, 2025 | Estimated Reading Time: 15 min | Author: Agentica x Together AI </div>
     </a>
     <a href="https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2" rel="noopener noreferrer">
       <h3>DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL</h3>