Inference-time scaling
s1: Simple test-time scaling [paper]
Stanford University.,University of Washington, Seattle. Allen Institute for AI, Contextual AI.
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters [paper]
UC Berkeley, Google DeepMind
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback [paper]
Shanghai AI Laboratory, The Chinese University of Hong Kong
S*: Test Time Scaling for Code Generation [paper]
University of California, Berkeley
Last updated