s1: Simple test-time scaling [paperarrow-up-right]
Stanford University.,University of Washington, Seattle. Allen Institute for AI, Contextual AI.
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters [paperarrow-up-right]
UC Berkeley, Google DeepMind
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback [paperarrow-up-right]
Shanghai AI Laboratory, The Chinese University of Hong Kong
S*: Test Time Scaling for Code Generation [paperarrow-up-right]
University of California, Berkeley
Last updated 1 month ago