Inference-time scaling

s1: Simple test-time scaling [paper]
- Stanford University.,University of Washington, Seattle. Allen Institute for AI, Contextual AI.
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters [paper]
- UC Berkeley, Google DeepMind
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback [paper]
- Shanghai AI Laboratory, The Chinese University of Hong Kong
S*: Test Time Scaling for Code Generation [paper]
- University of California, Berkeley

Last updated 1 month ago