Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

27,427

Full-text search

Active filters: 8-bit

0xShonen/Affine-8888888

Text Generation • 22B • Updated 11 days ago • 12.4k • 1

mlx-community/gemma-3-270m-8bit

Text Generation • 0.1B • Updated 11 days ago • 93 • 1

mlx-community/gemma-3-270m-it-8bit

Text Generation • 0.1B • Updated 11 days ago • 510 • 1

huizimao/gpt-oss-120b-uncensored-mxfp4

117B • Updated 10 days ago • 63 • 2

mlx-community/Jan-v1-4B-8bit

Text Generation • 1B • Updated 9 days ago • 598 • 3

EpistemeAI/gpt-oss-20b-unsloth-Multilingual-Thinking

Text Generation • 12B • Updated 7 days ago • 15 • 1

mlx-community/Kimi-VL-A3B-Thinking-2506-8bit

Image-Text-to-Text • Updated 5 days ago • 142 • 1

aquigpt/open0-2

Text Generation • 21B • Updated 4 days ago • 5 • 1

echarlaix/distilbert-base-uncased-finetuned-sst-2-english-int8-dynamic

Text Classification • Updated Jun 13, 2023 • 1.74k • 1

Intel/distilbert-base-uncased-finetuned-sst-2-english-int8-dynamic-inc

Text Classification • Updated Mar 21, 2024 • 5 • 1

Intel/distilbert-base-uncased-distilled-squad-int8-static-inc

Question Answering • Updated Mar 29, 2024 • 1.73k • 5

ethzanalytics/gpt-j-6B-8bit-sharded

Text Generation • 6B • Updated Jan 10 • 33 • 7

pszemraj/tiny-gpt2-magicprompt

Text Generation • 0.0B • Updated Mar 26, 2023 • 37 • 1

ethzanalytics/gpt-j-8bit-daily_dialogues

Text Generation • 6B • Updated Dec 25, 2024 • 37 • 4

ethzanalytics/gpt-j-8bit-KILT_WoW_10k_steps

Text Generation • Updated Nov 27, 2022 • 18

Norod78/gpt-fluentui-flat-svg

Text Generation • 0.2B • Updated Mar 19, 2023 • 30 • 21

BreadAi/MuseCan-1-2

Text Generation • 0.2B • Updated Aug 5, 2023 • 27

ybelkada/bloom-560m-8bit

Text Generation • Updated Apr 12, 2023 • 21

ybelkada/bloom-1b7-8bit

Text Generation • 2B • Updated Apr 17, 2023 • 1.2k • 6

thes41d/fr-boris-8bit

Text Generation • Updated May 20, 2023 • 18

RajuKandasamy/dolly-v2-3b-8bit

Text Generation • Updated Apr 15, 2023 • 18

RajuKandasamy/dolly-v2-7b-8bit

Text Generation • Updated Apr 15, 2023 • 18

KumaTea/twitter-int8

Feature Extraction • 6B • Updated Feb 8 • 36

KumaTea/twitter-int4

Feature Extraction • 3B • Updated Feb 8 • 26 • 1

seongcho/GenerAd-AI

Text Generation • Updated Apr 21, 2023 • 48

Linus4Lyf/Llama-1epoch-Plato

Text Generation • Updated Apr 21, 2023 • 14

Linus4Lyf/Llama-5epoch-Plato

Text Generation • Updated Apr 22, 2023 • 14

Linus4Lyf/Llama-10epoch-Plato

Text Generation • Updated Apr 22, 2023 • 14

ethzanalytics/dolly-v2-12b-sharded-8bit

Text Generation • Updated Apr 29, 2023 • 14 • 4

ethzanalytics/dolly-v2-7b-sharded-8bit

Text Generation • Updated Jun 28, 2023 • 15 • 1