Chuanming Liu's picture

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Recent Activity

updated a collection about 6 hours ago

upvoted a paper about 6 hours ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

liked a model about 6 hours ago

ByteDance-Seed/Seed-OSS-36B-Base

View all activity

Organizations

upvoted a paper about 6 hours ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 15 days ago • 80

upvoted a collection about 6 hours ago

Seed-OSS

Seed-OSS Open-Source Models • 3 items • Updated about 16 hours ago • 28

upvoted an article 20 days ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

By

and 4 others •

Jul 17

• 66

upvoted a collection 2 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Jul 3 • 110

upvoted an article 2 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 83

upvoted a collection 2 months ago

Model Optimizer

A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 30 items • Updated 3 days ago • 28

upvoted a paper 2 months ago

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Paper • 2506.05301 • Published Jun 5 • 55

upvoted 2 articles 2 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

By

•

May 28, 2024

• 243

Article

What is Qwen-Agent framework? Inside the Qwen family

By

and 1 other •

Mar 20

• 12

upvoted an article 3 months ago

Article

KV Cache from scratch in nanoVLM

By

and 4 others •

Jun 4

• 89

upvoted 2 collections 3 months ago

Qwen3-Reranker

3 items • Updated about 1 month ago • 62

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 7 days ago • 257

upvoted 3 articles 3 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By

and 3 others •

May 23

• 157

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 295

Article

How to Build an MCP Server with Gradio

By

and 1 other •

Apr 30

• 189

upvoted a paper 3 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 149

upvoted a collection 3 months ago

Qwen3

84 items • Updated 15 days ago • 1.11k

upvoted an article 4 months ago

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 437

upvoted a paper 4 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22 • 64

upvoted an article 4 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 199