view reply Chinese Translation of this Article: https://huggingface.co/blog/vansin/one-year-since-the-deepseek-moment-cn
End-to-End Video Character Replacement without Structural Guidance Paper • 2601.08587 • Published 12 days ago • 8
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published about 1 month ago • 25 • 4
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published about 1 month ago • 25
view post Post 293 QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management See translation 👍 1 1 + Reply
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published Dec 17, 2025 • 62 • 9
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published Dec 17, 2025 • 62 • 9
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 253 • 6
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published Dec 1, 2025 • 92 • 5
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27, 2025 • 97
SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation Paper • 2510.06303 • Published Oct 7, 2025 • 15