Tang Zhenyu

Tzy010822

Tzy010822

AI & ML interests

computer vision

Recent Activity

upvoted a paper 29 days ago

Soft Adaptive Policy Optimization

upvoted a paper 29 days ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

upvoted a paper 30 days ago

Back to Basics: Let Denoising Generative Models Denoise

View all activity

Organizations

upvoted 2 papers 29 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 30 days ago • 41

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published 30 days ago • 31

upvoted a paper 30 days ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17 • 66

upvoted a paper 3 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 95

upvoted a paper 4 months ago

GenCompositor: Generative Video Compositing with Diffusion Transformer

Paper • 2509.02460 • Published Sep 2 • 25

updated a model 8 months ago

Tzy010822/unified_original_cfg

Updated May 8

published a model 8 months ago

Tzy010822/unified_original_cfg

Updated May 8

upvoted a paper 9 months ago

NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations

Paper • 2503.23162 • Published Mar 29 • 10

upvoted a paper about 1 year ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69

liked a Space over 1 year ago

MeshFormer

🌟

Generate 3D mesh from an image

upvoted 2 papers over 1 year ago

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28, 2024 • 27

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 94

liked a dataset over 1 year ago

ShareGPT4Video/ShareGPT4Video

Viewer • Updated Mar 7 • 40.2k • 2.85k • 199

upvoted a paper over 1 year ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 74

liked a model over 1 year ago

internlm/internlm-xcomposer2-4khd-7b

Visual Question Answering • Updated Apr 18, 2024 • 7.64k • 73

updated a model over 1 year ago

Tzy010822/caption

1B • Updated Apr 20, 2024

upvoted a paper almost 2 years ago

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29, 2024 • 53

Tang Zhenyu

AI & ML interests

Recent Activity

Organizations

Tzy010822's activity

MeshFormer