10 10 9

Wenyi Hong

wenyi

wenyihong

AI & ML interests

multi-modal, pretrain

Recent Activity

liked a model 21 days ago

zai-org/GLM-4.6V-Flash

upvoted a paper 23 days ago

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

liked a model 23 days ago

zai-org/GLM-4.6V

View all activity

Organizations

liked a model 21 days ago

zai-org/GLM-4.6V-Flash

Image-Text-to-Text • 10B • Updated 21 days ago • 240k • • 520

upvoted a paper 23 days ago

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Paper • 2512.05905 • Published 25 days ago • 19

liked a model 23 days ago

zai-org/GLM-4.6V

Image-Text-to-Text • 108B • Updated 22 days ago • 162k • • 351

liked a Space 24 days ago

MotionBench Leaderboard

🐨

Submit and view leaderboard data for model evaluations

updated a Space 25 days ago

MotionBench Leaderboard

🐨

Submit and view leaderboard data for model evaluations

upvoted 3 papers about 1 month ago

MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning

Paper • 2511.06805 • Published Nov 10 • 12

WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation

Paper • 2511.06251 • Published Nov 9 • 13

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Paper • 2511.08195 • Published Nov 11 • 31

commented a paper about 1 month ago

UI2Code$^\text{N}$: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Paper • 2511.08195 • Published Nov 11 • 31 •

updated a model about 2 months ago

zai-org/UI2Code_N

Image-Text-to-Text • 10B • Updated Nov 12 • 174 • 16

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.75k

The secrets to building world-class LLMs

upvoted a paper 2 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 67

liked 2 models 5 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25 • 44.5k • • 699

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 22.9k • • 1.39k

liked a model 6 months ago

zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated Oct 25 • 197k • • 760

authored 5 papers 6 months ago

CogView: Mastering Text-to-Image Generation via Transformers

Paper • 2105.13290 • Published May 26, 2021

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Paper • 2402.04236 • Published Feb 6, 2024 • 9

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Paper • 2309.03350 • Published Sep 4, 2023

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

Paper • 2204.14217 • Published Apr 28, 2022

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

Paper • 2205.15868 • Published May 29, 2022 • 1

Wenyi Hong

AI & ML interests

Recent Activity

Organizations

wenyi's activity

MotionBench Leaderboard

MotionBench Leaderboard

The Smol Training Playbook