57 21 127

chansung park PRO

chansung

AI & ML interests

None yet

Recent Activity

updated a model about 14 hours ago

chansung/Gemma2-9B-CCRL-CUR-VAR-1E

published a model 1 day ago

chansung/Gemma2-9B-CCRL-CUR-VAR-ASCE-REV-1E

published a model 1 day ago

chansung/Gemma2-9B-CCRL-CUR-VAR-ASCE-NORMAL-1E

View all activity

Organizations

upvoted a paper about 1 month ago

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

Paper • 2507.11407 • Published Jul 15 • 54

upvoted an article about 2 months ago

Article

Use hallucination as feature for vibe coding

•

Jun 30

• 4

upvoted a paper 2 months ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 108

upvoted an article 3 months ago

Article

Introducing SynthID Text

and 5 others •

Oct 23, 2024

• 46

upvoted an article 4 months ago

Article

How to Build an MCP Server with Gradio

and 1 other •

Apr 30

• 189

upvoted an article 6 months ago

Article

Distilling from Dialogues: Finding Meaning in LLM Interactions

•

Feb 25

• 4

upvoted 2 articles 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 877

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

and 1 other •

Jan 16

• 75

upvoted a paper 8 months ago

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Paper • 2412.06071 • Published Dec 8, 2024 • 9

upvoted 2 articles 10 months ago

Article

Llama 3.2 in Keras

•

Oct 21, 2024

• 13

Article

dstack to manage clusters of on-prem servers for AI workloads with ease

•

Oct 10, 2024

• 7

upvoted a collection 11 months ago

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Jul 10 • 87

upvoted an article 12 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

•

Aug 26, 2024

• 71

upvoted a paper 12 months ago

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

Paper • 2408.13467 • Published Aug 24, 2024 • 26

upvoted an article 12 months ago

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

•

Aug 22, 2024

• 13

upvoted an article about 1 year ago

Article

The Workflow of PEFT

•

Aug 14, 2024

• 19

upvoted 4 articles over 1 year ago

Article

Deploying 🤗 ViT on Vertex AI

and 1 other •

Aug 19, 2022

• 2

Article

Expanding Model Context and Creating Chat Models with a Single Click

•

Apr 28, 2024

• 38

Article

Faster fine-tuning using TRL & Unsloth

•

Jan 10, 2024

• 68

Article

CodeGemma - an official Google release for code LLMs

and 5 others •

Apr 9, 2024

• 102

chansung park PRO

AI & ML interests

Recent Activity

Organizations

chansung's activity

Use hallucination as feature for vibe coding

Introducing SynthID Text

How to Build an MCP Server with Gradio

Distilling from Dialogues: Finding Meaning in LLM Interactions

Open-R1: a fully open reproduction of DeepSeek-R1

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Llama 3.2 in Keras

dstack to manage clusters of on-prem servers for AI workloads with ease

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

The Workflow of PEFT

Deploying 🤗 ViT on Vertex AI

Expanding Model Context and Creating Chat Models with a Single Click

Faster fine-tuning using TRL & Unsloth

CodeGemma - an official Google release for code LLMs