Li-Wei Chen's picture

Li-Wei Chen

txya900619

·

txya900619

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Craw4LLM: Efficient Web Crawling for LLM Pretraining

upvoted a paper 3 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

upvoted a paper 3 days ago

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

View all activity

Organizations

txya900619's activity

upvoted 3 papers 3 days ago

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published 5 days ago • 24

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 5 days ago • 72

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 6 days ago • 41

upvoted a paper 7 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 11 days ago • 43

upvoted 4 papers 10 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 20 days ago • 62

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 13 days ago • 122

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 13 days ago • 134

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 16 days ago • 114

upvoted 2 papers 16 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 19 days ago • 56

upvoted 2 papers 21 days ago

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published 27 days ago • 15

Humanity's Last Exam

Paper • 2501.14249 • Published about 1 month ago • 62

upvoted 3 papers 27 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 91

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted 5 papers about 1 month ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 43

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 69

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53