-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 146 -
Orion-14B: Open-source Multilingual Large Language Models
Paper • 2401.12246 • Published • 13 -
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 54 -
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper • 2401.13601 • Published • 47
Collections
Discover the best community collections!
Collections including paper arxiv:2412.04003
-
The Role of Language Imbalance in Cross-lingual Generalisation: Insights from Cloned Language Experiments
Paper • 2404.07982 • Published -
Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages
Paper • 2404.11553 • Published -
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Paper • 2410.14815 • Published • 1 -
Benchmarking Linguistic Diversity of Large Language Models
Paper • 2412.10271 • Published
-
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages
Paper • 2411.12240 • Published • 7 -
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Paper • 2411.11171 • Published • 8 -
Xmodel-1.5: An 1B-scale Multilingual LLM
Paper • 2411.10083 • Published • 14 -
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement
Paper • 2412.04003 • Published • 10
-
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Paper • 2409.10516 • Published • 41 -
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Paper • 2409.11242 • Published • 7 -
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Paper • 2409.11136 • Published • 23 -
On the Diagram of Thought
Paper • 2409.10038 • Published • 14