Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Replicate
Fireworks
fal
SambaNova
Nebius AI Studio
Novita
Hyperbolic
HF Inference API
Misc
Reset Misc
arxiv:
2408.15237
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Misc with no match
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
21
Full-text search
Edit filters
Sort: Trending
Active filters:
2408.15237
Clear all
JunxiongWang/mamba_0_5_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
11
JunxiongWang/mamba_0_5_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
9
JunxiongWang/mamba_0_875_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
12
•
1
JunxiongWang/mamba_0_875_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
6
JunxiongWang/mamba_0_75_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
6
JunxiongWang/mamba_0_75_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
6
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
70
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
142
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
44
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
93
JunxiongWang/Mamba2InLlama_0_875
Updated
Sep 2, 2024
•
103
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
72
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
199
•
1
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
87
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
255
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
38
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
152
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
7
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
8
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
11
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
26