Ashish Tanwer
ashishtanwer
AI & ML interests
None yet
Recent Activity
liked
a Space
1 day ago
Qwen/Qwen-Image-Edit
liked
a model
1 day ago
deepseek-ai/DeepSeek-V3.1-Base
liked
a model
3 days ago
Qwen/Qwen3-Coder-30B-A3B-Instruct
Organizations
RAG
DataLabelling
LLM
-
Running2.59k2.59k
Anycoder
🏢Generate HTML/CSS code from images
-
Runtime error274274
Qwen2.5 Coder Artifacts
🐢Generate application code with Qwen2.5-Coder-32B
-
Running923923
QwQ-32B-Preview
🔍QwQ-32B-Preview
-
Running on CPU Upgrade13.4k13.4k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
Evals
ClassicalML
Paper and resources for Classical ML
InfraML
Agents
Transformer
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 19.2M • • 1.13k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 14 -
google-t5/t5-base
Translation • 0.2B • Updated • 1.66M • • 739 -
Attention Is All You Need
Paper • 1706.03762 • Published • 79
DataCleaning
Dataset
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 38 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 305k • 2.32k -
tiiuae/falcon-refinedweb
Viewer • Updated • 968M • 11.5k • 866 -
cerebras/SlimPajama-627B
Preview • Updated • 42.4k • 489
Training
-
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 14 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 60 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 46
Diffusion
DataCrawling
Agents
RAG
Transformer
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 19.2M • • 1.13k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 14 -
google-t5/t5-base
Translation • 0.2B • Updated • 1.66M • • 739 -
Attention Is All You Need
Paper • 1706.03762 • Published • 79
DataLabelling
DataCleaning
LLM
-
Running2.59k2.59k
Anycoder
🏢Generate HTML/CSS code from images
-
Runtime error274274
Qwen2.5 Coder Artifacts
🐢Generate application code with Qwen2.5-Coder-32B
-
Running923923
QwQ-32B-Preview
🔍QwQ-32B-Preview
-
Running on CPU Upgrade13.4k13.4k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
Dataset
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 38 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 305k • 2.32k -
tiiuae/falcon-refinedweb
Viewer • Updated • 968M • 11.5k • 866 -
cerebras/SlimPajama-627B
Preview • Updated • 42.4k • 489
Evals
Training
-
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 14 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 60 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 46
ClassicalML
Paper and resources for Classical ML
Diffusion
InfraML