22 25 6

Jintao Zhang

jt-zhang

https://jt-zhang.github.io/

jt-zhang

AI & ML interests

Efficient ML

Recent Activity

authored a paper 5 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

updated a collection 5 days ago

efficient ml

upvoted a paper 5 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

View all activity

Organizations

authored a paper 5 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 11 days ago • 42

updated a collection 5 days ago

efficient ml

Collection

11 items • Updated 5 days ago • 2

upvoted a paper 5 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 11 days ago • 42

submitted a paper to Daily Papers 5 days ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 11 days ago • 42

authored a paper 5 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 12 days ago • 51

upvoted a paper 6 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 12 days ago • 51

updated a collection 6 days ago

efficient ml

Collection

11 items • Updated 5 days ago • 2

commented a paper 6 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 12 days ago • 51 •

submitted a paper to Daily Papers 6 days ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 12 days ago • 51

upvoted a paper 7 days ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published 17 days ago • 9

submitted a paper to Daily Papers 7 days ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published 17 days ago • 9

authored 2 papers 19 days ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published 25 days ago • 33

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 22 days ago • 33

upvoted 2 papers 20 days ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published 25 days ago • 33

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 22 days ago • 33

upvoted a paper 29 days ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published Jan 22 • 188

commented a paper about 2 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 95 •

authored a paper 2 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 95

updated a collection 2 months ago

efficient ml

Collection

11 items • Updated 5 days ago • 2

upvoted a paper 2 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 95

Jintao Zhang

AI & ML interests

Recent Activity

Organizations

jt-zhang's activity