Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

641

Full-text search

Active filters: quantization

stabilityai/stable-diffusion-3.5-large-tensorrt

Text-to-Image • Updated Oct 20, 2025 • 673 • 53

Octen/Octen-Embedding-8B-INT8

Sentence Similarity • 8B • Updated 3 days ago • 43 • 3

169Pi/Alpie-Core

Text Generation • Updated 14 days ago • 50 • 5

ArtusDev/requests-exl

Updated Oct 13, 2025 • 6

EricRollei/HunyuanImage-3-NF4-ComfyUI

Text-to-Image • 83B • Updated Nov 24, 2025 • 36 • 2

coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-nvfp4

Updated Dec 13, 2025 • 150 • 1

drbaph/Qwen-Image-Edit-2511-FP8

Image-to-Image • Updated 29 days ago • 4.76k • 6

Shifusen/Negative_LLAMA_70B-NVFP4

Text Generation • 41B • Updated 28 days ago • 6 • 1

goniz/MiniMax-M2.1-REAP-30-GGUF

162B • Updated 7 days ago • 1.79k • 2

ryukin164/LFM2.5-1.2B-Q4-JP

Text Generation • 1B • Updated 4 days ago • 403 • 1

JEILDLWLRMA/Qwen3VL-8B-Instruct-FP8

Image-to-Text • 9B • Updated 1 day ago • 15 • 1

krisaujla/BitLinear

Other • Updated about 17 hours ago • 1

ethzanalytics/gpt-j-6B-8bit-sharded

Text Generation • 6B • Updated Jan 10, 2025 • 4 • 7

ethzanalytics/gpt-j-8bit-daily_dialogues

Text Generation • 6B • Updated Dec 25, 2024 • 10 • 4

ethzanalytics/gpt-j-8bit-KILT_WoW_10k_steps

Text Generation • Updated Nov 27, 2022 • 13

leumastai/t5-large-quantized

Updated Mar 16, 2023 • 3 • 1

pszemraj/stablelm-7b-sft-v7e3-autogptq-4bit-128g

Text Generation • Updated 24 days ago • 6 • 3

limcheekin/flan-t5-small-ct2

Updated May 24, 2023 • 3

limcheekin/flan-t5-xl-ct2

Updated Jun 3, 2023 • 7 • 1

limcheekin/flan-t5-xxl-ct2

Updated May 30, 2023 • 1 • 1

limcheekin/fastchat-t5-3b-ct2

Text Generation • Updated Jun 28, 2023 • 1 • 2

limcheekin/flan-alpaca-gpt4-xl-ct2

Updated Jun 4, 2023

limcheekin/mpt-7b-storywriter-ct2

Updated Jun 27, 2023

limcheekin/falcon-7b-instruct-ct2

Updated Jun 19, 2023 • 1 • 1

limcheekin/mpt-7b-instruct-ct2

Updated Jun 19, 2023 • 1

limcheekin/redpajama-chat-7b-ct2

Updated Jun 9, 2023 • 3

seonglae/wizardlm-7b-uncensored-gptq

Text Generation • Updated Jul 19, 2023 • 11

seonglae/llama-2-7b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 2

seonglae/llama-2-13b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 4

clibrain/Llama-2-7b-ft-instruct-es-gptq-4bit

Text Generation • Updated Sep 1, 2023 • 9 • 9