RL-based Reasoning

  • SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution [paper]

    • FAIR at Meta, UIUC, GenAI at Meta, CMU

Note:

See The State of Reinforcement Learning for LLM Reasoning for a nice overview

Last updated