ky666 (k) – Likes

liked a Space 3 months ago

The Smol Training Playbook

📚

2.96k

The secrets to building world-class LLMs

liked a model 4 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 1.5k • 362

liked 3 models 7 months ago

liked a model 10 months ago

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18, 2025 • 67k • 508

liked 2 models 11 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 262k • • 3.09k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 46.6k • • 2.89k

liked a dataset 12 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 146 • 216

liked 2 models 12 months ago

microsoft/OmniParser-v2.0

Updated Mar 28, 2025 • 918 • 1.31k

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • 8B • Updated May 10, 2025 • 26.4k • 293

liked a dataset 12 months ago

Conard/fortune-telling

Viewer • Updated Feb 17, 2025 • 207 • 342 • 168

liked a Space 12 months ago

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

liked 6 models 12 months ago

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Reinforcement Learning • 8B • Updated Apr 7, 2025 • 565 • 33

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

Reinforcement Learning • 33B • Updated Apr 7, 2025 • 102 • 33

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

Text Generation • 18B • Updated Feb 14, 2025 • 2.45k • 29

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

33B • Updated Jan 25, 2025 • 12.6k • 144

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

Reinforcement Learning • 8B • Updated Mar 26, 2025 • 1.06k • 227

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.31k • 820

liked a dataset 12 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 12.6k • 706

k

AI & ML interests

Organizations

The Smol Training Playbook

microsoft/UserLM-8b

unsloth/Qwen3-32B-unsloth-bnb-4bit

unsloth/Qwen3-14B-unsloth-bnb-4bit

unsloth/GLM-Z1-32B-0414

ByteDance-Seed/UI-TARS-1.5-7B

deepseek-ai/DeepSeek-V3-0324

Qwen/QwQ-32B

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

microsoft/OmniParser-v2.0

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Conard/fortune-telling

The Ultra-Scale Playbook

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

open-r1/OpenR1-Math-220k

k

AI & ML interests

Organizations

ky666's activity

The Smol Training Playbook

The Ultra-Scale Playbook