π Commit Message Generation Evaluation π Collection All the resources for our "Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings" study on CMG metrics! β’ 7 items β’ Updated Oct 18, 2024 β’ 2
Running 112 112 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation
Running 921 921 Can You Run It? LLM version π Determine GPU requirements for large language models
Wuerstchen: Efficient Pretraining of Text-to-Image Models Paper β’ 2306.00637 β’ Published Jun 1, 2023 β’ 12
stacked-summaries/flan-t5-large-stacked-samsum-1024 Summarization β’ Updated Sep 23, 2023 β’ 53 β’ 10
Running on CPU Upgrade 4.87k 4.87k MTEB Leaderboard π₯ Select benchmarks and languages for text embeddings evaluation