Uploaded model

  • Developed by: mervinpraison
  • Finetuned from model : unsloth/Phi-4-unsloth-bnb-4bit
dataset:
- name: mervinpraison/harup-fall-axis-alpaca
dataset_num_proc: 2
dataset_text_field: text
gradient_accumulation_steps: 2
hf_model_name: mervinpraison/Phi-4-harupfall-axis
huggingface_save: 'true'
learning_rate: 0.0001
load_in_4bit: true
loftq_config: null
logging_steps: 15
lora_alpha: 16
lora_bias: none
lora_dropout: 0
lora_r: 16
lora_target_modules:
- q_proj
- k_proj
- v_proj
- o_proj
- gate_proj
- up_proj
- down_proj
lr_scheduler_type: linear
max_seq_length: 2048
max_steps: 6000
model_name: unsloth/Phi-4-unsloth-bnb-4bit
model_parameters: 14b
num_train_epochs: 10
ollama_model: mervinpraison/Phi-4-harupfall-axis
ollama_save: 'true'
optim: lion_8bit
output_dir: outputs
packing: false
per_device_train_batch_size: 1
quantization_method:
- q4_k_m
random_state: 3407
seed: 3407
train: 'true'
use_gradient_checkpointing: unsloth
use_rslora: false
warmup_steps: 100
weight_decay: 0.05

Training Details: wandb

Downloads last month
47
Safetensors
Model size
14.7B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.