Chenc's picture

15 4

Chenc

USTC-Chen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a dataset 19 days ago

axxkaya/UVT-Explanatory-based-Vision-Tasks

upvoted a paper 19 days ago

Humanity's Last Exam

View all activity

Organizations

None yet

USTC-Chen's activity

upvoted a paper 9 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 11 days ago • 181

liked a dataset 19 days ago

axxkaya/UVT-Explanatory-based-Vision-Tasks

Viewer • Updated 12 days ago • 284k • 179 • 9

upvoted 4 papers 19 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published about 1 month ago • 62

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published 29 days ago • 23

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 26 days ago • 35

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published 26 days ago • 27

upvoted 4 papers 27 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 97

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published about 1 month ago • 30

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published 29 days ago • 12

upvoted 3 papers about 1 month ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 26

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 75

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 50

liked 3 models about 1 month ago

vikhyatk/moondream2

Image-Text-to-Text • Updated Jan 9 • 124k • 1.05k

hexgrad/Kokoro-82M

Text-to-Speech • Updated 22 days ago • 1.09M • 3.38k

microsoft/phi-4

Text Generation • Updated 20 days ago • 608k • • 1.77k

upvoted 3 papers about 1 month ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Paper • 2501.07730 • Published Jan 13 • 16

PokerBench: Training Large Language Models to become Professional Poker Players

Paper • 2501.08328 • Published Jan 14 • 17