Zhimeng Guo's picture

21 116

Zhimeng Guo

zhimeng

·

https://zhimeng.page

AI & ML interests

Machine Learning

Recent Activity

published a model 2 days ago

zhimeng/Qwen2.5-1.5B-Open-R1-Code-GRPO

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

published a model 2 days ago

zhimeng/Qwen2.5-1.5B-Open-R1-Distill

View all activity

Organizations

zhimeng's activity

upvoted an article 12 days ago

Article

Open R1: Update #2

By

and 6 others •

13 days ago

• 184

upvoted a paper 4 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

upvoted 3 papers 11 months ago

PointInfinity: Resolution-Invariant Point Diffusion Models

Paper • 2404.03566 • Published Apr 4, 2024 • 14

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 69

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13, 2024 • 48

upvoted 5 papers 12 months ago

Sequence Parallelism: Long Sequence Training from System Perspective

Paper • 2105.13120 • Published May 26, 2021 • 5

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7, 2024 • 62

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 63

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Paper • 2403.05438 • Published Mar 8, 2024 • 20

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22, 2024 • 30

upvoted 5 papers about 1 year ago

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 31

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 80

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Paper • 2402.08017 • Published Feb 12, 2024 • 26

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Paper • 2402.07456 • Published Feb 12, 2024 • 44

Model Editing with Canonical Examples

Paper • 2402.06155 • Published Feb 9, 2024 • 13

upvoted a collection about 1 year ago

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated 13 days ago • 71

upvoted 4 papers about 1 year ago

Scavenging Hyena: Distilling Transformers into Long Convolution Models

Paper • 2401.17574 • Published Jan 31, 2024 • 16

Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach

Paper • 2401.02987 • Published Jan 2, 2024 • 10

Instruct-Imagen: Image Generation with Multi-modal Instruction

Paper • 2401.01952 • Published Jan 3, 2024 • 31

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 35