Multimodal Autoregressive Pre-training of Large Vision Encoders Paper β’ 2411.14402 β’ Published Nov 21, 2024 β’ 43
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper β’ 2408.15998 β’ Published Aug 28, 2024 β’ 86
Text2SQL is Not Enough: Unifying AI and Databases with TAG Paper β’ 2408.14717 β’ Published Aug 27, 2024 β’ 26
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper β’ 2407.13481 β’ Published Jul 18, 2024 β’ 10
Fast Matrix Multiplications for Lookup Table-Quantized LLMs Paper β’ 2407.10960 β’ Published Jul 15, 2024 β’ 12
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper β’ 2407.14482 β’ Published Jul 19, 2024 β’ 26
Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Paper β’ 2406.07057 β’ Published Jun 11, 2024 β’ 16
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Paper β’ 2406.18009 β’ Published Jun 26, 2024 β’ 21
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Paper β’ 2406.07394 β’ Published Jun 11, 2024 β’ 27
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper β’ 2406.06592 β’ Published Jun 5, 2024 β’ 28