- Major refactor
- Add support for FSDP2 and TP
- Add support for various activation functions and introduce LRA function
- Add support for FlexAttention, xFormers, FlashAttention3. Improve custom mask and sliding window handling
Full Changelog: v5.0.0...v6.0.0