NishKook
/

legal-qa-lora

retrieval-augmented-generation

Model card Files Files and versions Community

legal-qa-lora / README.md

NishKook's picture

Update README

927c319 verified 3 months ago

|

history blame contribute delete

2.53 kB

	---
	library_name: transformers
	tags:
	- legal-qa
	- lora
	- mistral-7b
	- huggingface
	- langchain
	- faiss
	- legal-tech
	- retrieval-augmented-generation
	datasets:
	- theatticusproject/cuad
	- jhu-clsp/SARA
	language:
	- en
	base_model:
	- mistralai/Mistral-7B-Instruct-v0.2
	---

	# 🧠 LexiQ – Legal Document QA using LoRA-Finetuned Mistral-7B

	> Authors: Nishad Kookana, Ishita, Milan Agarwal, and Sushant Bhatia
	> License: Apache 2.0
	> Base model: [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)

	LexiQ is a domain-specific Legal Question-Answering model built on top of Mistral 7B. It is fine-tuned using LoRA on two legal datasets: CUAD (Contract Understanding Atticus Dataset) and SARA (Summarization of Appellate Court Rulings). The model supports question answering from legal documents with Retrieval-Augmented Generation (RAG).

	---

	## 🔍 Use Cases

	### ✅ Direct Use:
	- Legal document understanding
	- Question answering over case files, legal summaries, contracts
	- Legal education, research, and policy drafting

	### ❌ Out-of-Scope Use:
	- Not suitable for non-English documents
	- Not trained on international law — Indian and US law concepts dominate
	- Not intended for legal advice in real-world litigation

	---

	## 🧪 Model Details

	- Model Type: Causal Language Model (LLM) with LoRA fine-tuning
	- Languages: English
	- Quantization: 4-bit (`bnb` nf4) via `bitsandbytes`
	- Training Precision: Mixed FP16
	- Training Datasets: CUAD, SARA
	- Tokenizer: AutoTokenizer (fast)
	- Training Framework: Transformers + PEFT + Accelerate

	---

	## 📦 How to Use

	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import PeftModel
	import torch

	tokenizer = AutoTokenizer.from_pretrained("NishKook/legal-qa-lora")
	base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2", device_map="auto", torch_dtype=torch.float16)
	model = PeftModel.from_pretrained(base_model, "NishKook/legal-qa-lora", device_map="auto")

	question = "What are the four elements of negligence?"
	context = "Negligence requires a duty, a breach of that duty, causation, and damages."

	prompt = f"### Question:\n{question}\n\n### Context:\n{context}\n\n### Answer:\n"
	inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
	outputs = model.generate(**inputs, max_new_tokens=256)
	print(tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special_tokens=True))