Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

Source: Venture Beat | Published: December 12, 2025, 5:00 am | Read Original

Strong Bullish 83.8

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

The Allen Institute for AI (Ai2) recently released what it calls its most powerful family of models yet, Olmo 3. But the company kept iterating on the models, expanding its reinforcement learning (RL) runs, to create Olmo 3.1.The new Olmo 3.1 models focus on efficiency, transparency, and control for enterprises. Ai2 updated two of the three versions of Olmo 2: Olmo 3.1 Think 32B, the flagship model optimized for advanced research, and Olmo 3.1 Instruct 32B, designed for instruction-following, multi-turn dialogue, and tool use. Olmo 3 has a third version, Olmo 3-Base for programming, comprehension, and math. It also works well for continue fine-tuning. Ai2 said that to upgrade Olmo 3 Think 32B to Olmo 3.1, its researchers extended its best RL run with a longer training schedule. “After the

Read Source Login to use Pulse AI

Pulse AI Analysis

Pulse analysis not available yet. Click "Get Pulse" above.

This analysis was generated using Pulse AI, Glideslope's proprietary AI engine designed to interpret market sentiment and economic signals. Results are for informational purposes only and do not constitute financial advice.

Pulse AI Analysis

Related Insights

More Like This

Jury says Johnson & Johnson owes $40M to 2 cancer patients who used talcum powders

Engine failure forces United Airlines flight to return to Washington, D.C.-area airport

Why Disney’s $1B licensing deal with OpenAI makes sense. 💰

Flight Returns to Dulles After Engine Failure During Takeoff, F.A.A. Says

Reddit Sues Australia Over New Social Media Ban For Kids, They Want To Control Kids

The Securities and Exchange Commission publishes crypto custody guide

Market & Industry Analysis Straight to Your Inbox

My Notes