Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Zeming Wei
ZemingWei
Follow
0 followers
·
2 following
https://weizeming.github.io
weizeming25
weizeming
AI & ML interests
Trustworthy AI
Recent Activity
authored
a paper
about 12 hours ago
False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
commented
on
a paper
2 days ago
False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
authored
a paper
over 1 year ago
Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations
View all activity
Organizations
None yet
ZemingWei
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
over 1 year ago
lmsys/toxic-chat
Viewer
•
Updated
May 14, 2024
•
20.3k
•
7.49k
•
165
liked
a model
almost 3 years ago
CompVis/stable-diffusion-v1-4
Text-to-Image
•
Updated
Aug 23, 2023
•
660k
•
6.91k