Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nm-testing 's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models

LLM Compressor testing

updated Nov 17
Upvote
-

  • nm-testing/tinysmokellama-3.2

    354k • Updated Sep 17 • 34.5k

  • nm-testing/llama2.c-stories42M-pruned2.4

    Updated Oct 29 • 553

  • nm-testing/tinyllama-fp8-dynamic-compressed

    1B • Updated Oct 9, 2024 • 404

  • nm-testing/tinyllama-w4a16-compressed

    0.3B • Updated Oct 9, 2024 • 829

  • nm-testing/tinyllama-w8a8-compressed

    1B • Updated Oct 9, 2024 • 877

  • nm-testing/tinyllama-w8a16-dense

    1B • Updated Oct 9, 2024 • 249

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-compressed

    1B • Updated Jan 14 • 589

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-uncompressed

    1B • Updated Jan 14 • 166

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-compressed

    0.3B • Updated Jan 14 • 195

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-uncompressed

    1B • Updated Jan 14 • 63

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed

    1B • Updated Jan 14 • 209

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed

    1B • Updated Jan 14 • 73

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed

    0.4B • Updated Jan 14 • 576

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed

    1B • Updated Jan 14 • 159
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs