Experimental Test Model of Llama-3-8B (base)

Finetuned using the ChatML formatting. Only 50% of epoch 1 was done out of two epochs.

Extended to 1 Million context using the PoSE technique.

Safetensors

Model size

8.03B params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Model tree for tavtav/Pyg-Llama-8B-1M-0.25

Quantizations