Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

4,396

Full-text search

Active filters: quantized

aphoticshaman/qwen-72b-math-nf4

Text Generation • 73B • Updated 1 day ago • 18 • 1

MaziyarPanahi/Hermes-4.3-36B-GGUF

Text Generation • 36B • Updated 1 day ago • 60 • 1

steampunque/Ministral-3-3B-Instruct-2512-Hybrid-GGUF

3B • Updated about 20 hours ago • 40 • 1

aphoticshaman/deepseek-coder-v2-lite-nf4

Text Generation • 16B • Updated 1 day ago • 9 • 1

Doradus/Hermes-4.3-36B-FP8

Text Generation • 36B • Updated 1 day ago • 1

thesby/Qwen2.5-VL-7B-NSFW-Caption-V4-W8A16

Text Generation • 4B • Updated Oct 5 • 135 • 3

ravenscroftj/CodeGen-350M-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 2

ravenscroftj/CodeGen-2B-multi-ggml-quant

Text Generation • Updated Aug 5, 2023 • 2

ravenscroftj/CodeGen-6B-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 9

ethzanalytics/dolly-v2-12b-sharded-8bit

Text Generation • Updated Apr 29, 2023 • 11 • 4

ethzanalytics/dolly-v2-7b-sharded-8bit

Text Generation • Updated Jun 28, 2023 • 15 • 1

pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit

Summarization • 3B • Updated Jan 21 • 10

ethzanalytics/stablelm-tuned-alpha-7b-sharded-8bit

Text Generation • Updated May 4, 2023 • 13 • 2

rozek/OpenLLaMA_7B_300BT_q4

Text Generation • Updated May 5, 2023 • 1

ethzanalytics/stablelm-tuned-alpha-3b-gptq-4bit-128g

Text Generation • Updated May 7, 2023 • 7

kyo-takano/open-calm-7b-8bit

Text Generation • Updated May 28, 2023 • 20 • 10

CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA

Text Generation • Updated Jul 20, 2023 • 8

CONCISE/LLaMa_V2-13B-Chat-Uncensored-GGML

Text Generation • Updated Aug 7, 2023 • 8 • 7

CONCISE/LLaMa_V2-13B-Instruct-Uncensored-GGML

Text Generation • Updated Aug 17, 2023 • 8 • 5

rozek/LLaMA-2-7B-32K_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 782 • 9

rozek/LLaMA-2-7B-32K-Instruct_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 586 • 4

RedHatAI/bge-small-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 86 • 9

RedHatAI/bge-base-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 485 • 4

RedHatAI/bge-large-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 58 • 22

afrideva/TinyLlama-1.1B-intermediate-step-715k-1.5T-GGUF

1B • Updated Nov 4, 2023 • 182

afrideva/tinyllama-colorist-v2-GGUF

Text Generation • 1B • Updated Nov 4, 2023 • 172

afrideva/stablelm-3b-4e1t-GGUF

Text Generation • 3B • Updated Nov 5, 2023 • 1.33k • 1

afrideva/tiny-llama-miniguanaco-1.5T-GGUF

Text Generation • 1B • Updated Nov 6, 2023 • 169

afrideva/Hermes-Trismegistus-Mistral-7B-GGUF

Text Generation • 7B • Updated Nov 5, 2023 • 147

afrideva/TinyLlama-1.1B-alpaca-chat-v1.5-GGUF

Text Generation • 1B • Updated Nov 6, 2023 • 179 • 2