Use docker entrypoint args instead of custom entry point #521

achraf-mer · 2023-07-21T14:28:29Z

No description provided.

EshamAaqib · 2023-07-24T07:28:49Z

I don't think this will work because we need to pass the env vars from -e this reflects to env: or envFrom:on K8s deployments and Helm (Ex: https://github.com/h2oai/helium/blob/a639155f3939a5cf0f7a33f27da84c08c5feac26/helm/helium-chart/templates/deployment.yaml#L28 ). Without this it will be harder to pass the vars via Helm.

EshamAaqib · 2023-07-24T07:36:32Z

I would suggest changing the gen.py main function to something like this might work,

def main(
    load_8bit: bool = os.environ.get('LOAD_8BIT', 'False')
    load_4bit: bool = os.environ.get('LOAD_4BIT', 'False')
    load_half: bool = os.environ.get('LOAD_HALF', 'True')
    load_gptq: str = os.environ.get('LOAD_GPTQ', '')
    load_exllama: bool = os.environ.get('LOAD_EXLLAMA', 'False')
    use_safetensors: bool = os.environ.get('USE_SAFETENSORS', 'False')
.....
.....
.....
)

As per my knowledge this should directly fetch the env var set by Docker if not it will use the default value, Also we will not need to do any changes to Docker as well

ChathurindaRanasinghe · 2023-07-24T10:39:32Z

I would suggest changing the gen.py main function to something like this might work,
def main(
    load_8bit: bool = os.environ.get('LOAD_8BIT', 'False')
    load_4bit: bool = os.environ.get('LOAD_4BIT', 'False')
    load_half: bool = os.environ.get('LOAD_HALF', 'True')
    load_gptq: str = os.environ.get('LOAD_GPTQ', '')
    load_exllama: bool = os.environ.get('LOAD_EXLLAMA', 'False')
    use_safetensors: bool = os.environ.get('USE_SAFETENSORS', 'False')
.....
.....
.....
)
As per my knowledge this should directly fetch the env var set by Docker if not it will use the default value, Also we will not need to do any changes to Docker as well

CC: @achraf-mer

achraf-mer · 2023-07-24T13:50:42Z

I would suggest changing the gen.py main function to something like this might work,
def main(
    load_8bit: bool = os.environ.get('LOAD_8BIT', 'False')
    load_4bit: bool = os.environ.get('LOAD_4BIT', 'False')
    load_half: bool = os.environ.get('LOAD_HALF', 'True')
    load_gptq: str = os.environ.get('LOAD_GPTQ', '')
    load_exllama: bool = os.environ.get('LOAD_EXLLAMA', 'False')
    use_safetensors: bool = os.environ.get('USE_SAFETENSORS', 'False')
.....
.....
.....
)
As per my knowledge this should directly fetch the env var set by Docker if not it will use the default value, Also we will not need to do any changes to Docker as well

let's use command with a set of args in helm chart, IMO best than to sprinkle env vars, that way it is sure only one way to set flags and not two (env vars and query args).

pseudotensor

Thanks! @ChathurindaRanasinghe Please review post-merge.

Use docker entrypoint args instead of custom entry point

e18f36f

achraf-mer requested review from ChathurindaRanasinghe and EshamAaqib July 21, 2023 14:28

EshamAaqib requested a review from ozahavi July 24, 2023 07:37

pseudotensor self-requested a review August 1, 2023 17:39

pseudotensor approved these changes Aug 1, 2023

View reviewed changes

achraf-mer merged commit 0d588e5 into main Aug 1, 2023

achraf-mer deleted the am/refactor branch August 1, 2023 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use docker entrypoint args instead of custom entry point #521

Use docker entrypoint args instead of custom entry point #521

Uh oh!

achraf-mer commented Jul 21, 2023

Uh oh!

EshamAaqib commented Jul 24, 2023

Uh oh!

EshamAaqib commented Jul 24, 2023 •

edited

Loading

Uh oh!

ChathurindaRanasinghe commented Jul 24, 2023

Uh oh!

achraf-mer commented Jul 24, 2023

Uh oh!

pseudotensor left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use docker entrypoint args instead of custom entry point #521

Use docker entrypoint args instead of custom entry point #521

Uh oh!

Conversation

achraf-mer commented Jul 21, 2023

Uh oh!

EshamAaqib commented Jul 24, 2023

Uh oh!

EshamAaqib commented Jul 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChathurindaRanasinghe commented Jul 24, 2023

Uh oh!

achraf-mer commented Jul 24, 2023

Uh oh!

pseudotensor left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

EshamAaqib commented Jul 24, 2023 •

edited

Loading