Runzhe Zhan's picture

9 16 2

Runzhe Zhan

rzzhan

·

https://runzhe.me/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

liked a model 1 day ago

MikaStars39/PeRL

upvoted a paper about 2 months ago

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

View all activity

Organizations

None yet

upvoted a paper 1 day ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published 6 days ago • 42

liked a model 1 day ago

MikaStars39/PeRL

Updated Dec 2, 2025 • 3

upvoted 5 papers about 2 months ago

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Paper • 2410.22995 • Published Oct 30, 2024 • 3

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 42

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9, 2025 • 24

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

liked a model 2 months ago

ThinkMorph/ThinkMorph-7B

Any-to-Any • Updated Nov 3, 2025 • 82 • 12

upvoted a paper 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 82

New activity in rzzhan/ThinMQM-8B 2 months ago

Update model card: correct license, add pipeline_tag and library_name

#1 opened 2 months ago by

authored 4 papers 2 months ago

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Paper • 2410.22995 • Published Oct 30, 2024 • 3

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios

Paper • 2410.23746 • Published Oct 31, 2024 • 1

Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

Paper • 2405.04286 • Published May 7, 2024

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Paper • 2510.20780 • Published Oct 23, 2025 • 4

upvoted a paper 2 months ago

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Paper • 2510.20780 • Published Oct 23, 2025 • 4

updated a collection 2 months ago

ThinMQM

ThinMQM (automated translation evaluation, MQM) model and data collection. • 5 items • Updated Oct 27, 2025

commented a paper 2 months ago

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Paper • 2510.20780 • Published Oct 23, 2025 • 4 •

New activity in rzzhan/ExGRPO-Llama3.1-8B-Instruct 2 months ago

Improve model card: Add pipeline tag, library, paper & code links, introduction, and installation

#1 opened 3 months ago by

New activity in rzzhan/ExGRPO-Llama3.1-8B-Zero 2 months ago

Improve model card: Add pipeline tag, library name, and detailed content

#1 opened 3 months ago by

New activity in rzzhan/ExGRPO-Qwen2.5-Math-1.5B-Zero 2 months ago

Improve model card: Add metadata, links, usage example, and evaluation results

#1 opened 3 months ago by