Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale Paper • 2601.10338 • Published 4 days ago • 4
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published 4 days ago • 8
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 3 days ago • 17
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published 3 days ago • 17
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 4 days ago • 17
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 3 days ago • 21
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published 4 days ago • 25
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering Paper • 2601.10402 • Published 4 days ago • 34
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning Paper • 2601.07641 • Published 6 days ago • 42
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 4 days ago • 74
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 4 days ago • 150
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs Paper • 2601.06786 • Published 8 days ago • 5
VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding Paper • 2601.07290 • Published 7 days ago • 6
JudgeRLVR: Judge First, Generate Second for Efficient Reasoning Paper • 2601.08468 • Published 6 days ago • 5
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 8 days ago • 74