4 22 23

Kaiqiang Song

kqsong

http://i2u.world

KaiQiangSong

AI & ML interests

Summarization and Text Generation

Recent Activity

authored a paper 9 days ago

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

upvoted a paper 9 days ago

Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service

upvoted a paper 9 days ago

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

View all activity

Organizations

None yet

authored a paper 9 days ago

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published 11 days ago • 21

upvoted 2 papers 9 days ago

Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service

Paper • 2407.15441 • Published Jul 22, 2024 • 2

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published 11 days ago • 21

upvoted a paper 12 days ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published 13 days ago • 34

upvoted a paper 16 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 17 days ago • 44

authored a paper 16 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 17 days ago • 44

authored a paper 24 days ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 26 days ago • 38

upvoted a paper 25 days ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 26 days ago • 38

liked 4 datasets about 1 month ago

upvoted an article about 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 646

upvoted a paper 2 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 53

upvoted 2 collections 4 months ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 43

Qwen3

Collection

84 items • Updated Aug 6 • 1.2k

liked a Space 4 months ago

769

Qwen3 Demo

📊

Generate responses to text prompts in a conversational format

liked 2 models 5 months ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

71B • Updated Apr 13 • 7.82k • 88

Nexusflow/Athene-RM-70B

Text Classification • 70B • Updated Nov 15, 2024 • 10 • 9

upvoted a paper 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 374

Kaiqiang Song

AI & ML interests

Recent Activity

Organizations

kqsong's activity

SmolLM3: smol, multilingual, long-context reasoner

Qwen3 Demo