Yu Zhao's picture

Yu Zhao

yuzhaouoe

·

https://yuzhaouoe.github.io/

AI & ML interests

NLP/ML

Recent Activity

upvoted a paper about 2 months ago

Inverse Scaling in Test-Time Compute

upvoted a paper about 2 months ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

upvoted a paper 3 months ago

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

View all activity

Organizations

upvoted 2 papers about 2 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Paper • 2507.08800 • Published Jul 11 • 79

upvoted a paper 3 months ago

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

liked a dataset 4 months ago

ZhaoweiWang/MMLongBench

Preview • Updated May 15 • 408 • 3

authored a paper 4 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 54

upvoted a paper 4 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 54

upvoted an article 6 months ago

Article

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

By

•

Dec 25, 2024

• 16

updated a model 6 months ago

yuzhaouoe/Llama2-7b-SAE

Updated Mar 7 • 4 • 3

authored a paper 6 months ago

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4 • 10

upvoted a paper 6 months ago

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4 • 10

upvoted a collection 6 months ago

Q-Filters

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7

updated 2 collections 6 months ago

Pre-Trianing Data Packing

[ACL'24] Analysing the Impact of Sequence Composition on Language Model Pre-Training. https://github.com/yuzhaouoe/pretraining-data-packing • 10 items • Updated Mar 3

SAE-Based Representation Engineering

[NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering • 5 items • Updated Mar 3

liked a Space 7 months ago

KaLM Embedding

Retrieve documents based on a query

upvoted a paper 7 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 69

liked a model 7 months ago

yuzhaouoe/Llama2-7b-SAE

Updated Mar 7 • 4 • 3

updated a collection 10 months ago

SAE-Based Representation Engineering

[NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering • 5 items • Updated Mar 3