4 13 10

Zhengyang Tang

tangzhy

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

upvoted a paper 18 days ago

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

authored a paper 27 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

View all activity

Organizations

tangzhy's activity

upvoted a paper 5 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 5 days ago • 72

upvoted a paper 18 days ago

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published 20 days ago • 32

authored a paper 27 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published about 1 month ago • 30

upvoted a paper 28 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published about 1 month ago • 30

commented a paper 28 days ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published about 1 month ago • 30 •

authored a paper about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

upvoted a paper about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

commented a paper about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70 •

upvoted a paper 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

New activity in tangzhy/ORLM 4 months ago

Apply for community grant: Academic project (gpu)

#1 opened 7 months ago by

tangzhy

upvoted 2 papers 4 months ago

Roadmap towards Superhuman Speech Understanding using Large Language Models

Paper • 2410.13268 • Published Oct 17, 2024 • 34

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 32

authored a paper 4 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 32

upvoted a paper 6 months ago

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 55

liked a dataset 6 months ago

fdqerq22ds/MathScaleQA-2M

Viewer • Updated Jun 14, 2024 • 2M • 91 • 8

liked 2 datasets 7 months ago

CardinalOperations/MAMO

Viewer • Updated May 29, 2024 • 863 • 864 • 4

CardinalOperations/IndustryOR

Viewer • Updated May 29, 2024 • 100 • 82 • 7

liked a model 7 months ago

CardinalOperations/ORLM-LLaMA-3-8B

Text Generation • Updated May 29, 2024 • 246 • 4

liked 2 datasets 7 months ago

CardinalOperations/OR-Instruct-Data-3K

Viewer • Updated May 29, 2024 • 3k • 64 • 4

CardinalOperations/NL4OPT

Viewer • Updated May 29, 2024 • 245 • 117 • 2