MathBite's picture
Upload fine-tuned and merged model.
1a6aa18 verified
metadata
license: llama3.1
language: en
base_model: meta-llama/Llama-3.1-8B-Instruct

MathBite/self_corrective_llama_3.1_8B

This is a fine-tuned version of meta-llama/Llama-3.1-8B-Instruct that has been trained to detect and mitigate hallucinations in generated text.

How to Use

Because this model uses a custom architecture, you must use trust_remote_code=True when loading it.

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "MathBite/self_corrective_llama_3.1_8B"
tokenizer = AutoTokenizer.from_pretrained(model_name)

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    trust_remote_code=True
)