DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 17 days ago • 41
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20 • 2
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20
ORLM: Training Large Language Models for Optimization Modeling Paper • 2405.17743 • Published May 28, 2024 • 3
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published Feb 18 • 86
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets Paper • 2502.01506 • Published Feb 3 • 38
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27 • 2
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 72
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 72
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 72 • 2
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 85