Flex attention + refactor

Opening this to add support for all models following #34282

Lets bring support for flex attention to more models! 🤗 

- [x] Gemma2

It would be great to add the support for more architectures such as
- [ ] Qwen2
- [ ] Llama
- [ ] Gemma
- [ ] QwenVl
- [ ] Mistral
- [ ] Clip


... and many more

For anyone who wants to contribute just open a PR and link it to this issue, and ping me for a review!! 🤗 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flex attention + refactor #34809

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Flex attention + refactor #34809

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions