1 90 31

Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Agentic Reasoning for Large Language Models

upvoted a paper 9 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper 10 days ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 7 days ago • 170

upvoted a paper 9 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 19 days ago • 134

upvoted a paper 10 days ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 11 days ago • 25

liked a model 12 days ago

lovis93/next-scene-qwen-image-lora-2509

Image-to-Image • Updated Oct 21, 2025 • 51.9k • • 567

liked a Space 12 days ago

LTX-2 Video Fast

🎥

167

Fast high quality video with audio generation

upvoted 2 papers 13 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 17 days ago • 205

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 16 days ago • 161

upvoted a paper 18 days ago

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published 19 days ago • 99

liked a model about 1 month ago

facebook/pe-av-large

2B • Updated Dec 23, 2025 • 481 • 48

upvoted 6 papers about 1 month ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published Dec 9, 2025 • 119

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 145

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published Dec 11, 2025 • 29

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper • 2512.09824 • Published Dec 10, 2025 • 28

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 46

upvoted 5 papers about 2 months ago

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published Dec 8, 2025 • 17

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published Dec 8, 2025 • 50

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 132

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published Nov 29, 2025 • 26

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 59

Kyu Song

AI & ML interests

Recent Activity

Organizations

kyunocap's activity

LTX-2 Video Fast