Mark Rogers's picture

Mark Rogers

markrogersjr

markrogersjr

AI & ML interests

None yet

Recent Activity

commented on an article about 1 month ago

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

View all activity

Organizations

None yet

commented on Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers about 1 month ago

PyTorch now natively supports Flash Attention. I created a PR to add Flash Attention support for GPT-OSS:

https://github.com/huggingface/transformers/pull/42345

If you can't wait for the PR to get merged and registered in PyPI, here's a patch:

https://gist.github.com/markrogersjr/ebada9ad3a31381d8d4e0d956c852569