Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
130
Ömer Kaya
andthattoo
Follow
John6666's profile picture
erhant-fb's profile picture
AtakanTekparmak's profile picture
8 followers
·
9 following
https://twitter.com/andthatto
andthatto
andthattoo
AI & ML interests
Synthetic data, verifiable information retrieval
Recent Activity
updated
a model
6 days ago
driaforall/Tiny-Agent-a-1.5B
liked
a model
6 days ago
microsoft/OmniParser-v2.0
reacted
to
Kseniase
's
post
with 🔥
7 days ago
8 New Applications of Test-Time Scaling We've noticed a huge interest in test-time scaling (TTS), so we decided to explore this concept further. Test-time compute (TTC) refers to the amount of computational power used by an AI model when generating a response. Many researchers are now focused on scaling TTC, as it enables slow, deep "thinking" and step-by-step reasoning, which improves overall models' performance. Here are 8 fresh studies on test-time scaling: 1. https://huggingface.co/papers/2502.05171 Introduces an LM that scales TTC by reasoning in latent space instead of generating more tokens with no special training. Here, a recurrent block to processes information iteratively. 2. https://huggingface.co/papers/2502.04728 Shows how TTS is applied to enhance model's Planning Domain Definition Language (PDDL) reasoning capabilities, which can be used to generate a symbolic world model. 3. https://huggingface.co/papers/2502.06703 Analyzes optimal TTS strategies and shows how small models can outperform much larger ones. 4. https://huggingface.co/papers/2502.04128 Shows how TTS improves expressiveness, timbre consistency and accuracy in speech synthesis with Llasa framework. It also dives into benefits of scaling train-time compute. 5. https://huggingface.co/papers/2502.07154 Suggests a modified training loss for better reasoning of LLMs when scaling TTC. 6. https://huggingface.co/papers/2502.05078 Unifies the strengths of chain, tree, and graph paradigms into one framework that expands reasoning only on necessary subproblems. 7. https://huggingface.co/papers/2502.01839 Explores scaling trends of self-verification and how to improve its capabilities with TTC. 8. https://huggingface.co/papers/2501.14723 Explores how scaling serial compute (iterations) and parallel compute (trajectories), can improve accuracy in real-world software engineering issues. Also, explore our article about TTS for more -> https://huggingface.co/blog/Kseniase/testtimecompute
View all activity
Organizations
andthattoo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
6 days ago
microsoft/OmniParser-v2.0
Image-Text-to-Text
•
Updated
5 days ago
•
4.54k
•
880
liked
a model
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
14 days ago
•
1.09M
•
•
917
liked
a dataset
9 days ago
AI-MO/NuminaMath-1.5
Viewer
•
Updated
13 days ago
•
896k
•
2.09k
•
108
liked
a model
12 days ago
intfloat/multilingual-e5-large-instruct
Feature Extraction
•
Updated
7 days ago
•
512k
•
•
336
liked
a dataset
13 days ago
driaforall/verifiable-pythonic-function-calling-lite
Viewer
•
Updated
16 days ago
•
16.4k
•
203
•
6
liked
a Space
17 days ago
Running
on
A10G
1.24k
1.24k
GGUF My Repo
🦙
liked
a dataset
18 days ago
simplescaling/s1K
Viewer
•
Updated
13 days ago
•
1k
•
4.8k
•
178
liked
2 datasets
19 days ago
adyen/DABstep
Viewer
•
Updated
2 days ago
•
10.4k
•
2.54k
•
8
TIGER-Lab/AceCode-87K
Viewer
•
Updated
15 days ago
•
87.1k
•
893
•
32
liked
2 datasets
25 days ago
cognitivecomputations/dolphin-r1
Viewer
•
Updated
24 days ago
•
814k
•
5.75k
•
263
nisten/all-human-diseases
Viewer
•
Updated
Aug 19, 2024
•
2.2k
•
145
•
106
liked
a dataset
27 days ago
driaforall/pythonic-function-calling
Viewer
•
Updated
17 days ago
•
81.8k
•
512
•
19
liked
a model
27 days ago
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
1.58M
•
534
liked
a model
28 days ago
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
•
Updated
25 days ago
•
293k
•
238
liked
a model
29 days ago
dnhkng/RYS-XLarge
Text Generation
•
Updated
Oct 11, 2024
•
1.87k
•
85
liked
a dataset
29 days ago
THUDM/ComplexFuncBench
Updated
Jan 22
•
195
•
3
liked
4 models
about 1 month ago
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
Oct 15, 2024
•
40
•
71
qresearch/Llama-3.2-1B-Instruct-SAE-l9
Updated
Jan 22
•
13
Qwen/Qwen2.5-Coder-1.5B
Text Generation
•
Updated
Nov 18, 2024
•
10.1k
•
47
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
14 days ago
•
4.43M
•
•
9.99k
Load more