Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

33

Full-text search

Active filters: nebius

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.42M • • 5.14k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 7.58M • • 4.07k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 938k • • 12k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 4.24M • • 4.25k

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8 • 398k • • 1.54k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 748k • • 4.46k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 1.72M • • 1.74k

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated Sep 17 • 574k • • 700

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated 15 days ago • 1.02M • • 815

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 408k • • 2.6k

zai-org/GLM-4.5-Air

Text Generation • 110B • Updated Aug 11 • 555k • • 540

google/gemma-2-2b-it

Text Generation • 3B • Updated Aug 27, 2024 • 737k • • 1.25k

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7 • 888k • • 486

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Nov 7 • 61k • • 2.28k

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 113k • • 735

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • 12B • Updated 23 days ago • 37.8k • • 145

Qwen/Qwen3-32B

Text Generation • 33B • Updated Jul 26 • 4.26M • • 600

Qwen/Qwen3-Coder-480B-A35B-Instruct

Text Generation • 480B • Updated Aug 21 • 113k • • 1.26k

Qwen/Qwen3-30B-A3B-Thinking-2507

Text Generation • 31B • Updated Aug 17 • 565k • • 326

google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 150k • • 751

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 107k • • 572

BAAI/bge-multilingual-gemma2

Feature Extraction • 9B • Updated Oct 13 • 312k • • 193

Qwen/Qwen2.5-Coder-7B

Text Generation • 8B • Updated Nov 18, 2024 • 28.2k • • 130

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 58.1k • • 385

NousResearch/Hermes-4-405B

Text Generation • 406B • Updated Sep 2 • 209 • • 78

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • 253B • Updated Oct 15 • 170k • • 341

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 475k • • 2.39k

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 24.1k • • 1.39k

NousResearch/Hermes-4-70B

Text Generation • 71B • Updated Sep 2 • 2.02k • • 160

intfloat/e5-mistral-7b-instruct

Feature Extraction • 7B • Updated Apr 23, 2024 • 137k • • 552