Siyuan Huang's picture

4 11 6

Siyuan Huang

chamber111

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

upvoted a paper about 2 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

upvoted a paper about 2 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

View all activity

Organizations

upvoted a paper about 1 month ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published Nov 25, 2025 • 25

upvoted 3 papers about 2 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 42

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9, 2025 • 24

updated a collection about 2 months ago

VPPO Model

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 4 items • Updated Nov 7, 2025 • 4

liked a model about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7, 2025 • 8 • 2

updated 2 models about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7, 2025 • 8 • 2

chamber111/VPPO-7B

Image-Text-to-Text • 8B • Updated Nov 7, 2025 • 120 • 5

published a model about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7, 2025 • 8 • 2

upvoted a paper 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 82

updated 3 datasets 3 months ago

chamber111/VPPO-Eval

Preview • Updated Oct 16, 2025 • 2.23k • 1

chamber111/VPPO_MMK12_validation

Viewer • Updated Oct 16, 2025 • 2k • 838 • 1

chamber111/VPPO_ViRL39K_train

Viewer • Updated Oct 16, 2025 • 38.9k • 854 • 1

updated a model 3 months ago

chamber111/VPPO-32B

33B • Updated Oct 16, 2025 • 17 • 2

New activity in chamber111/VPPO-7B 3 months ago

Add missing metadata tags

#1 opened 3 months ago by

New activity in chamber111/VPPO-Eval 3 months ago

Add task category, sample usage, and prominent links

#2 opened 3 months ago by

New activity in chamber111/VPPO_ViRL39K_train 3 months ago

Add task categories and update paper link

#1 opened 3 months ago by

New activity in chamber111/VPPO_MMK12_validation 3 months ago

Add task category to dataset card

#2 opened 3 months ago by

upvoted 2 collections 3 months ago

VPPO Data

Official training and evaluation datasets for the VPPO project. • 4 items • Updated Oct 13, 2025 • 3

VPPO Model

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 4 items • Updated Nov 7, 2025 • 4