Skip to content

Conversation

pommedeterresautee
Copy link
Member

@pommedeterresautee pommedeterresautee commented Dec 21, 2021

QAT is now done through monkey patching to limit the amount of LOC to manage

Added QAT support to:

  • Electra
  • Albert
  • Distilbert
  • Bert
  • Roberta
  • Deberta V1 and V2 (but not working as they can't be exported to ONNX)

@pommedeterresautee pommedeterresautee self-assigned this Dec 21, 2021
@pommedeterresautee pommedeterresautee added performance improve performance quantization GPU/CPU quantization support labels Dec 21, 2021
@pommedeterresautee pommedeterresautee marked this pull request as ready for review December 28, 2021 22:25
@pommedeterresautee pommedeterresautee merged commit 404c5ee into main Dec 28, 2021
@pommedeterresautee pommedeterresautee deleted the monkey_patch branch December 28, 2021 22:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance improve performance quantization GPU/CPU quantization support
Development

Successfully merging this pull request may close these issues.

1 participant