RL-based Reasoning
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution [paper]
FAIR at Meta, UIUC, GenAI at Meta, CMU
Note:
See The State of Reinforcement Learning for LLM Reasoning for a nice overview
Last updated
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution [paper]
FAIR at Meta, UIUC, GenAI at Meta, CMU
Note:
See The State of Reinforcement Learning for LLM Reasoning for a nice overview
Last updated