Fine-Tuning - a wumingshi Collection

wumingshi 's Collections

Agents

FLLM

LLM

Code Generation

3D

REL

Small

Reverse Engineering

RAG

Fine-Tuning

updated about 4 hours ago

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Paper • 2310.17752 • Published Oct 26, 2023 • 14
Instruction-tuning Aligns LLMs to the Human Brain

Paper • 2312.00575 • Published Dec 1, 2023 • 14
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2, 2024 • 28
Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 29
Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published 2 days ago • 52