AIVentureBeatabout 2 hours ago

Weibo's 3B model matches giant AI reasoning scores

12 min read

Sina Weibo researchers published a paper claiming their 3-billion-parameter VibeThinker-3B model matches or exceeds reasoning performance of much larger systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek. VibeThinker-3B scored 94.3 on the AIME 2026 math exam. The claim has sparked debate over benchmark validity.

Level

Hype check

Tap to vote and see what everyone thinks.

#weibo #ai #benchmarks

Weibo's 3B model matches giant AI reasoning scores

More to chew on!

More to chew on!