Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 134 β’ 10
MoritzLaurer/deberta-v3-large-zeroshot-v2.0 Zero-Shot Classification β’ Updated Apr 11, 2024 β’ 68.6k β’ β’ 92
Running 920 920 Can You Run It? LLM version π Determine GPU requirements for large language models
view article Article π¦βοΈ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero β’ Jun 4, 2024 β’ 76