Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

updated a Space 1 day ago

huggingface/inference-playground

new activity 3 days ago

hf-doc-build/doc-build:Create trackio/_versions.yml

upvoted a paper 8 days ago

Hierarchical Reasoning Model

View all activity

Organizations

upvoted 2 papers 8 days ago

Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26 • 28

Mean Flows for One-step Generative Modeling

Paper • 2505.13447 • Published May 19 • 4

upvoted a paper 23 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 28 days ago • 289

upvoted an article about 1 month ago

Article

Arc Virtual Cell Challenge: A Primer

By

and 1 other •

Jul 18

• 51

upvoted a paper about 1 month ago

MindJourney: Test-Time Scaling with World Models for Spatial Reasoning

Paper • 2507.12508 • Published Jul 16 • 26

upvoted an article about 1 month ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

By

•

Jul 16

• 133

upvoted 4 papers about 1 month ago

JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes

Paper • 2505.06771 • Published May 10 • 1

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Paper • 2506.14702 • Published Jun 17 • 4

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 60

upvoted 2 articles about 2 months ago

Article

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

By

and 2 others •

Jul 1

• 21

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

By

•

Jul 2

• 73

upvoted 2 papers about 2 months ago

Pretrained Transformers as Universal Computation Engines

Paper • 2103.05247 • Published Mar 9, 2021 • 1

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Paper • 2506.21862 • Published Jun 27 • 36

upvoted 3 papers 2 months ago

Approximating Language Model Training Data from Weights

Paper • 2506.15553 • Published Jun 18 • 1

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29, 2024 • 16

Chain-of-Thought Reasoning is a Policy Improvement Operator

Paper • 2309.08589 • Published Sep 15, 2023 • 2

upvoted an article 2 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

By

and 5 others •

Jun 11

• 80

upvoted a changelog 2 months ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 105

upvoted an article 2 months ago

Article

The Common Pile v0.1

By

and 2 others •

Jun 6

• 46