Post
Here is my selection of papers for today (9 Jan)
https://huggingface.co/papers
AGG: Amortized Generative 3D Gaussians for Single Image to 3D
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
TeleChat Technical Report
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach
Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Mixtral of Experts
https://huggingface.co/papers
AGG: Amortized Generative 3D Gaussians for Single Image to 3D
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
TeleChat Technical Report
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach
Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Mixtral of Experts