🧠 DeepSeek-Qwen-1.5B-Multitask-LoRA

Author: Gilbert Akham
License: Apache-2.0
Base model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Adapter type: LoRA (PEFT)
Capabilities: Multi-task generalization & reasoning

🚀 What It Can Do

This multitask fine-tuned model handles a broad set of natural language and reasoning-based tasks, such as:

✉️ Email & message writing — generate clear, friendly, or professional communications.

📖 Story & creative writing — craft imaginative narratives, poems, and dialogues.

💬 Conversational chat — maintain coherent, context-aware conversations.

💡 Explanations & tutoring — explain technical or abstract topics simply.

🧩 Reasoning & logic tasks — provide step-by-step answers for analytical questions.

💻 Code generation & explanation — write and explain Python or general programming code.

🌍 Translation & summarization — translate between multiple languages or condense information.

The model’s multi-domain training (based on datasets like SmolTalk, Everyday Conversations, and reasoning-rich samples) makes it suitable for assistants, chatbots, content generators, or educational tools.

🧩 Training Details

Parameter	Value
Base model	`deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B`
Adapter	LoRA (r=8, alpha=32, dropout=0.1)
Max sequence length	1024
Learning rate	3e-5 (cosine decay)
Optimizer	`adamw_8bit`
Grad Accumulation	4
Precision	4-bit quantized, FP16 compute
Steps	12k total (best @ ~8.2k)
Training time	~2.5h on A4000
Frameworks	🤗 Transformers, PEFT, TRL, BitsAndBytes

🧠 Reasoning Capability

Thanks to integration of SmolTalk and diverse multi-task prompts, the model learns:

Chain-of-thought style reasoning
Conversational grounding
Multi-step logical inferences
Instruction following across domains

Example:

### Task: Explain reasoning

### Input:
If a train leaves City A at 3 PM and arrives at City B at 6 PM, covering 180 km, what is its average speed?

### Output:
The train travels 180 km in 3 hours. 
Average speed = 180 ÷ 3 = 60 km/h.

Downloads last month: 125

Model tree for GilbertAkham/deepseek-R1-multitask-lora

Base model

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Adapter

(150)

this model

GilbertAkham
/

deepseek-R1-multitask-lora