Community Blog & Articles

Community Articles

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Why You Should Care About Partial Differential Equations (PDEs)

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

EuroLLM-22B

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Gotchas in Tokenizer Behavior Every Developer Should Know

Make and publish your Reachy Mini App

Muon vs MuonClip vs Muon+AdamW for Fine-Tuning

Phare LLM benchmark V2: Reasoning models don't guarantee better security

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

KV Caching Explained: Optimizing Transformer Inference Efficiency

What is the Hugging Face Community Building?

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Diffusion Language Models: The New Paradigm

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

Uncensor any LLM with abliteration

Mastering Tensor Dimensions in Transformers

Topic 23: What is LLM Inference, it's challenges and solutions for it

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

December 17, 2025

CUGA on Hugging Face: Democratizing Configurable AI Agents

December 15, 2025

New in llama.cpp: Model Management

December 11, 2025

llmfine-tuningopen-source

Codex is Open Sourcing AI models

December 11, 2025

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

December 9, 2025

swifthubopen-source

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

December 5, 2025

llmreasoningagents

DeepMath: A lightweight math reasoning Agent with smolagents

December 4, 2025

llmfine-tuningopen-source

We Got Claude to Fine-Tune an Open Source LLM

December 4, 2025

transformersv5community

Transformers v5: Simple model definitions powering the AI ecosystem

December 1, 2025

diffusersfluxquantization

Diffusers welcomes FLUX-2

+4

November 25, 2025

transformerspytorchoptimization

Continuous batching from first principles

November 25, 2025

Building Deep Research: How we Achieved State of the Art

November 24, 2025

OVHcloud on Hugging Face Inference Providers 🔥

November 24, 2025

llmexperimentationfine-tuning

20x Faster TRL Fine-tuning with RapidFire AI

November 21, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Why You Should Care About Partial Differential Equations (PDEs)

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

EuroLLM-22B

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Gotchas in Tokenizer Behavior Every Developer Should Know

Make and publish your Reachy Mini App

Muon vs MuonClip vs Muon+AdamW for Fine-Tuning

Phare LLM benchmark V2: Reasoning models don't guarantee better security

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

KV Caching Explained: Optimizing Transformer Inference Efficiency

What is the Hugging Face Community Building?

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Diffusion Language Models: The New Paradigm

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

Uncensor any LLM with abliteration

Mastering Tensor Dimensions in Transformers

Topic 23: What is LLM Inference, it's challenges and solutions for it

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all articles