Skip to content

Conversation

@tosterberg
Copy link
Contributor

Description

Fixes the quantization path when doing AOT partitioning for weight only quantization strategies, since these do not require any AOT model changes.

  • If this change is a backward incompatible change, why must this change be made?
  • Interesting edge cases to note here

@tosterberg tosterberg requested review from a team, frankfliu and zachgk as code owners June 17, 2024 23:29
@sindhuvahinis sindhuvahinis merged commit 00f7412 into deepjavalibrary:master Jun 17, 2024
@tosterberg tosterberg deleted the fix-neo-quant-neuron branch June 17, 2024 23:42
sindhuvahinis pushed a commit to sindhuvahinis/djl-serving that referenced this pull request Jun 17, 2024
tosterberg added a commit that referenced this pull request Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants