Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Zhenhua Han
hzhua
Follow
hzhua
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
authored
a paper
5 months ago
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
authored
a paper
8 months ago
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
View all activity
Organizations
None yet
Papers
3
arxiv:
2409.10516
arxiv:
2407.02490
arxiv:
2405.19888
models
None public yet
datasets
None public yet