inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.23k • • 160 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 32 • • 35 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 188 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2, 2025 • 16.8k • 210
inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.23k • • 160 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 32 • • 35 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 188 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2, 2025 • 16.8k • 210