fetch's picture

5 9

fetch

Fetchniche

·

fetchniches

AI & ML interests

AIGC

Recent Activity

upvoted a paper 28 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

upvoted a paper 28 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

liked a dataset about 2 months ago

fka/awesome-chatgpt-prompts

View all activity

Organizations

upvoted 2 papers 28 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 129

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 170

liked a dataset about 2 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 42.8k • 9k

upvoted a collection about 2 months ago

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining • 15 items • Updated 27 days ago • 29

upvoted an article 3 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

By

and 2 others •

Mar 20, 2024

• 104

liked a Space 6 months ago

LLM训练终极指南 | The Ultra-Scale Playbook

了解LLM训练的方方面面

liked a dataset 6 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 4.65k • 566

liked 2 Spaces 6 months ago

AI Deadlines

Manage project deadlines with AI assistance

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 2 Spaces 9 months ago

FLUX.1 [Schnell]

Generate images from text prompts

FLUX.1 [dev]

Generate images from text prompts

updated a model 10 months ago

maple-research-lab/SIM

Text-to-Image • Updated Nov 17, 2024 • 2

upvoted a paper 10 months ago

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 69

reacted to chansung's post with ❤️ over 1 year ago

Post

4427

💻 Smoothing the Transition from Service LLM to Local LLM

Imagine your go-to LLM service is down, or you need to use it offline – yikes! This project is all about having that "Plan B" ready to go. Here's LLaMA Duo I've been building with @sayakpaul :

✨ Fine-tune a smaller LLM: We used Hugging Face's alignment-handbook to teach a smaller-sized LLM to mimic my favorite large language model. Think of it as that super-smart AI assistant getting a capable understudy.

🤖 Batch Inference: Let's get that fine-tuned LLM working! My scripts generate lots of text like a champ, and we've made sure things run smoothly even with bigger workloads.

🧐 Evaluation: How well is my small LLM doing? We integrated with the Gemini API to use it as an expert judge – it compares my model's work to the original. Talk about a tough critic!

🪄 Synthetic Data Generation: Need to boost that model's performance? Using Gemini's feedback, we can create even more training data, custom-made to make the LLM better.

🧱 Building Blocks: This isn't just a one-time thing – it's a toolkit for all kinds of LLMOps work. Want to change your evaluation metrics? Bring in models trained differently? Absolutely, let's make it happen.

Why this project is awesome:

💪 Reliability: Keep things running no matter what happens to your main LLM source.
🔒 Privacy: Process sensitive information on your own terms.
🗺️ Offline capable: No internet connection? No problem!
🕰️ Version Control: Lock in your favorite LLM's behavior, even if the service model changes.

We'm excited to share the code on GitHub. Curious to see what you all think! 👉🏻 https://github.com/deep-diver/llamaduo

liked a dataset over 1 year ago

apf1/datafilteringnetworks_2b

Updated Feb 28 • 166 • 19

liked a model about 2 years ago

Alvinyz/lora-panorama-v2

Text-to-Image • Updated Jun 1, 2023 • 3 • 3