[Neo] Neo compilation/quantization script bugfixes #2115

a-ys · 2024-06-27T21:39:44Z

Description

This PR includes various fixes for the Neo Neuron compilation & vLLM quantization scripts.

9253e42 Hard-codes engine=Python in to the Neo Neuron compilation script so that errors in customer serving.properties do not cause compilation to fail.

405ea55 Removes a hanging reference to TARGET_INSTANCE_TYPE in the Neo Quantization script.

36810cc Adds logic to pass through engine and option.entryPoint to the outputted serving.properties. This is done so that when we compile with hardcoded values engine=Python and option.entryPoint=djl_python.transformers_neuronx, customer values for these are passed through to support custom entrypoints.

c1556ec Changes the output file format to this following:

Files in the input directory are directly copied to the output.
The outputs of compilation are saved in a subdirectory of the output: optimized_model
The outputted serving.properties sets model_id=./optimized_model so that the compiled model is used during deployment.
This is done to allow for custom entrypoint files & requirements files for serving. Introduces an issue of doubling the model size. This will be refined in future changes.

9ddd2c3 Adds a check to make tp_degree required in the Neo neuron compilation script.

Changes the Neo output file format to better support requirements.txt and custom entry point files. The compiler output will be saved to a subdirectory and the input files are copied over to the output.

) (cherry picked from commit 88f84ba)

Co-authored-by: Andrew Song <[email protected]>

a-ys added 5 commits June 18, 2024 18:45

[Neo] Hard-code engine in Neo TNX entrypoint

9253e42

Merge branch 'master' into v10_neo_patches

747bdbd

[Neo] Remove reference to target instance

405ea55

[Neo, Neuron] Pass-through engine, entryPoint

36810cc

[Neo] Change Neo output file format

c1556ec

Changes the Neo output file format to better support requirements.txt and custom entry point files. The compiler output will be saved to a subdirectory and the input files are copied over to the output.

a-ys force-pushed the v10_neo_patches branch from 2639652 to c1556ec Compare June 27, 2024 21:48

a-ys added 3 commits June 27, 2024 22:36

Merge branch 'master' into v10_neo_patches

3ef1bb7

[Neo] Require tp_degree in Neo Neuron script

9ddd2c3

Merge branch 'master' into v10_neo_patches

f508b3d

a-ys changed the title ~~[Neo] Fixing various Neo compilation/quantization script bugs~~ [Neo] Neo compilation/quantization script bugfixes Jul 9, 2024

a-ys marked this pull request as ready for review July 9, 2024 23:32

a-ys requested review from a team, frankfliu and zachgk as code owners July 9, 2024 23:32

lanking520 approved these changes Jul 10, 2024

View reviewed changes

lanking520 merged commit 88f84ba into deepjavalibrary:master Jul 11, 2024

tosterberg pushed a commit to tosterberg/djl-serving that referenced this pull request Jul 18, 2024

[Neo] Neo compilation/quantization script bugfixes (deepjavalibrary#2115

e22b1d3

) (cherry picked from commit 88f84ba)

tosterberg mentioned this pull request Jul 18, 2024

[cherrypick][0.28.0 dlc] Neo fixes (#2189) (#2115) (#2095) #2193

Merged

tosterberg added a commit that referenced this pull request Jul 18, 2024

[cherrypick][0.28.0 dlc] Neo fixes (#2189) (#2115) (#2095) (#2193)

9f664f9

Co-authored-by: Andrew Song <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Neo] Neo compilation/quantization script bugfixes #2115

[Neo] Neo compilation/quantization script bugfixes #2115

Uh oh!

a-ys commented Jun 27, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Neo] Neo compilation/quantization script bugfixes #2115

[Neo] Neo compilation/quantization script bugfixes #2115

Uh oh!

Conversation

a-ys commented Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

a-ys commented Jun 27, 2024 •

edited

Loading