-
FreeU: Free Lunch in Diffusion U-Net
Paper • 2309.11497 • Published • 65 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 53 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
Mistral 7B
Paper • 2310.06825 • Published • 46
Collections
Discover the best community collections!
Collections including paper arxiv:2310.06825
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 33 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 13 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 11 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 13