Tony Congqian Wang
TonyCWang
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 11 hours ago
The Optimal Architecture for Small Language Models
upvoted
a
paper
19 days ago
TiDAR: Think in Diffusion, Talk in Autoregression
upvoted
an
article
about 1 month ago
Why Did MiniMax M2 End Up as a Full Attention Model?
Organizations
None yet