base_model: HuggingFaceTB/SmolLM-135M | |
data:image/s3,"s3://crabby-images/da3f3/da3f36a65e36cdb4f84c1df836e9d2b0278cfc41" alt="" | |
# QuantFactory/Biggie-SmoLlm-0.15B-Base-GGUF | |
This is quantized version of [nisten/Biggie-SmoLlm-0.15B-Base](https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base) created using llama.cpp | |
# Original Model Card | |
###EVEN SMALLER Frankenstein of smolLm-0.13b upped to 0.15b | |
Use this frankenbase for training. | |
Done via semi-automated continuous merging to figure out the recipe. | |
Model is more coherent. | |
data:image/s3,"s3://crabby-images/47a07/47a07fb47a41ce20e1309b419154fdc33734e7b0" alt="image/png" | |
```bash | |
wget https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base/resolve/main/Biggie_SmolLM_0.15B_Base_bf16.gguf | |
``` | |
```verilog | |
llama-cli -ngl 99 -co --temp 0 -p "How to build a city on Mars via calculating Aldrin-Cycler orbits?" -m Biggie_SmolLM_0.15B | |
_Base_bf16.gguf | |
``` | |
The temperature settings and min p etc need to be adjusted but even at default temp0 it was coherent for first 100 tokens. | |
Amazing option for further training. And this is a merge of the base, not the instruct! | |
data:image/s3,"s3://crabby-images/4a4b1/4a4b118c09d006b598bee09c53b69edbea8e3407" alt="image/png" | |
I don't understand how the f a 150mb file can talk but it can | |