Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 16 days ago • 114
DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training Paper • 2310.02025 • Published Oct 3, 2023 • 1
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities Paper • 2210.06640 • Published Oct 13, 2022
Generative Counterfactual Introspection for Explainable Deep Learning Paper • 1907.03077 • Published Jul 6, 2019
NEFTune: Noisy Embeddings Improve Instruction Finetuning Paper • 2310.05914 • Published Oct 9, 2023 • 14
Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models Paper • 2307.01379 • Published Jul 3, 2023 • 1
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations Paper • 2402.12348 • Published Feb 19, 2024 • 1
Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond Paper • 2002.12920 • Published Feb 28, 2020
DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training Paper • 2310.02025 • Published Oct 3, 2023 • 1
Transformers Can Do Arithmetic with the Right Embeddings Paper • 2405.17399 • Published May 27, 2024 • 52
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression Paper • 2403.15447 • Published Mar 18, 2024 • 16
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression Paper • 2403.15447 • Published Mar 18, 2024 • 16