pyvene

university

https://github.com/stanfordnlp/pyvene

Activity Feed

AI & ML interests

interpretability

Recent Activity

zhengxuanzenwu updated a model 10 days ago

pyvene/gemma-reft-r1-9b-it-res

zhengxuanzenwu updated a model 10 days ago

pyvene/gemma-reft-r1-2b-it-res

zhengxuanzenwu updated a model 10 days ago

pyvene/gemma-diffmean-2b-it-res

View all activity

Organization Card

Community About org cards

Who are we?

We are a group of hackers from Stanford's NLP group, and we are interested in LLM interpretability.

pyvene is where we started, which stands for pytorch model intervenetion.

Resources

Supervised dictionary learning models (SDLs) and datasets releases for Gemma 2 2B and 9B: AxBench Collection.

Benchmark interpretability methods at scale (AxBench) library: AxBench.

Representation finetuning (ReFT) library: pyreft.

PyTorch model intervention library: pyvene.

Collections 1

spaces 6

Running on Zero

SDL-ReFT-cr1

🫠

Guide chatbot with specific topics

Running on Zero

SDL-ReFT-r1

🤔

Guide conversations with specific topics

Sleeping

ReFT-Golden-Gate-Bridge

🫠

Converse with an AI assistant that mimics the Golden Gate Bridge

Sleeping

ReFT-Chat7B

🫡

Generate responses to chat messages using ReFT-Chat

Running on Zero

ReFT-Emoji

🫡

Chat with an emoji-enhanced assistant

Sleeping

ReFT-Ethos

🚀

Converse with a helpful assistant in text form

models 12

datasets 5

pyvene/axbench-concept16k_v2

Viewer • Updated 12 days ago • 3.46M • 23

pyvene/axbench-conceptFD

Viewer • Updated 25 days ago • 5.33k • 100 • 2

pyvene/axbench-concept16k

Viewer • Updated about 1 month ago • 2.27M • 179 • 3

pyvene/axbench-concept500

Viewer • Updated about 1 month ago • 297k • 173 • 1

pyvene/axbench-concept10

Viewer • Updated about 1 month ago • 6.8k • 253 • 1

AI & ML interests

Recent Activity

Team members 2

Who are we?

Resources

Collections 1

SDL-ReFT-r1

SDL-ReFT-cr1

spaces 6 Sort: Recently updated

SDL-ReFT-cr1

SDL-ReFT-r1

ReFT-Golden-Gate-Bridge

ReFT-Chat7B

ReFT-Emoji

ReFT-Ethos

models 12 Sort: Recently updated

datasets 5 Sort: Recently updated

spaces 6

models 12

datasets 5