Jamba GGUF
Collection
Current GGUF's conversion of the Jamba models. Will be updated as support in llama.cpp merges/ https://github.com/ggerganov/llama.cpp/pull/7531
•
4 items
•
Updated
•
2
This is the first GGUF of the new Jamba architecture recently hacked with llama.cpp using this Repo https://github.com/ggerganov/llama.cpp/tree/compilade/refactor-kv-cache
Model: pszemraj/jamba-900M-v0.13-KIx2
16-bit