AIMarkTechPostabout 5 hours ago

Harness-1: 20B retrieval subagent trained with reinforcement learning on gpt-oss-20b

9 min read

Harness-1, a 20B parameter retrieval subagent, is trained with reinforcement learning on gpt-oss-20b within a stateful search harness. The model operates inside a stateful search harness that manages evidence tracking and claim verification. Researchers from University of Illinois Urbana-Champaign, UC Berkeley, and Chroma developed it to separate search decisions from bookkeeping. The approach reduces optimization conflicts in search agent training.

Level

Hype check

Tap to vote and see what everyone thinks.

#harness-1 #reinforcement-learning #gpt-oss-20b

Read full story

More to chew on!

AIabout 24 hours ago

LLM Research Papers: The 2026 List (January to May)

Techabout 14 hours ago

Universal Memory Protocol, a shared format for agent memory