Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 11 days ago • 14
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 11 days ago • 14
Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction Paper • 2512.18880 • Published 12 days ago • 23
Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction Paper • 2512.18880 • Published 12 days ago • 23
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions Paper • 2512.11995 • Published 21 days ago • 9
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions Paper • 2512.11995 • Published 21 days ago • 9
Corpus-Steered Query Expansion with Large Language Models Paper • 2402.18031 • Published Feb 28, 2024 • 1
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers Paper • 2402.16914 • Published Feb 25, 2024
Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs Paper • 2511.07419 • Published Nov 10, 2025 • 26
BLIP3o-NEXT: Next Frontier of Native Image Generation Paper • 2510.15857 • Published Oct 17, 2025 • 24
ChartAB: A Benchmark for Chart Grounding & Dense Alignment Paper • 2510.26781 • Published Oct 30, 2025
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29, 2025 • 140
Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era Paper • 2508.19570 • Published Aug 27, 2025
Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory Paper • 2509.14662 • Published Sep 18, 2025 • 13
CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning Paper • 2507.00045 • Published Jun 23, 2025 • 1
VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding Paper • 2508.07493 • Published Aug 10, 2025 • 8
Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning Paper • 2507.22887 • Published Jul 30, 2025
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34
Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models Paper • 2505.21765 • Published May 27, 2025