UTEXAS (University of Texas at Austin)

MarioBarbeque

authored a paper 11 days ago

Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training

Paper • 2512.13706 • Published 28 days ago • 1

SP2001

authored a paper 2 months ago

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Paper • 2510.13913 • Published Oct 15, 2025 • 3

SP2001

authored 2 papers 3 months ago

EgoVLM: Policy Optimization for Egocentric Video Understanding

Paper • 2506.03097 • Published Jun 3, 2025

Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

Paper • 2510.13744 • Published Oct 15, 2025 • 5

SP2001

authored a paper 4 months ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Paper • 2509.06283 • Published Sep 8, 2025 • 17

abao

authored a paper 4 months ago

Concentration of Measure for Distributions Generated via Diffusion Models

Paper • 2501.07741 • Published Jan 13, 2025

fcyin

authored a paper 7 months ago

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Paper • 2501.05414 • Published Jan 9, 2025 • 2

gdhe17

authored 2 papers 7 months ago

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 2

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

Paper • 2405.04233 • Published May 7, 2024 • 3

ajd12342

authored a paper 7 months ago

Rhapsody: A Dataset for Highlight Detection in Podcasts

Paper • 2505.19429 • Published May 26, 2025 • 1

gdhe17

authored a paper 7 months ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9, 2025 • 30

fcyin

authored a paper 8 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 17

ajd12342

authored a paper 9 months ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published Mar 6, 2025 • 1

gdhe17

authored 2 papers 10 months ago

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Paper • 2503.01103 • Published Mar 3, 2025 • 5

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Paper • 2502.15894 • Published Feb 21, 2025 • 20

SP2001

authored 3 papers 10 months ago

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Paper • 2407.06249 • Published Jul 8, 2024

SFR-RAG: Towards Contextually Faithful LLMs

Paper • 2409.09916 • Published Sep 16, 2024 • 1

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

Paper • 2410.03727 • Published Sep 30, 2024 • 2

XCLiu

authored a paper 11 months ago

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Paper • 2502.06608 • Published Feb 10, 2025 • 39

ajd12342

authored a paper 12 months ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 3

AI & ML interests

Team members 240

UTEXAS's activity