[Bugfix] Pass `trust_remote_code_model=True` for deepseek examples #1012

dsikka · 2024-12-23T18:23:27Z

SUMMARY:

Pass trust_remote_code_model=True for deepseek models
Needed due to the slight differences in how tokenizer and processor pull down their relevant configs

github-actions · 2024-12-23T18:23:38Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

horheynm · 2024-12-23T18:33:53Z

why do we need this now? what change caused this?

Needed due to the slight differences in how tokenizer and processor pull down their relevant configs

Sorry missed this

kylesayrs · 2024-12-23T18:59:34Z

@horheynm Both AutoConfig and tokenizers share the same cache for AutoConfig, meaning that if you pass trust_remote_code to the processor, you do not have the pass it into oneshot (specifically, the instantiation of AutoConfig in oneshot).

However, the AutoConfig cache is not shared between AutoConfig and processors. Previously we were getting away with not passing trust_remote_code_model because it was sharing from the tokenizer cache, but now we do since processor and AutoConfig no longer shared from the same cache.

pass trust_remote_code_model=True for deepseek examples

9ee00b0

rahul-tuli approved these changes Dec 23, 2024

View reviewed changes

kylesayrs approved these changes Dec 23, 2024

View reviewed changes

dsikka merged commit 384059b into main Dec 23, 2024
6 of 8 checks passed

dsikka deleted the fix_examples branch December 23, 2024 19:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Pass `trust_remote_code_model=True` for deepseek examples #1012

[Bugfix] Pass `trust_remote_code_model=True` for deepseek examples #1012

Uh oh!

dsikka commented Dec 23, 2024

Uh oh!

github-actions bot commented Dec 23, 2024

Uh oh!

horheynm commented Dec 23, 2024 •

edited

Loading

Uh oh!

kylesayrs commented Dec 23, 2024

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Pass trust_remote_code_model=True for deepseek examples #1012

[Bugfix] Pass trust_remote_code_model=True for deepseek examples #1012

Uh oh!

Conversation

dsikka commented Dec 23, 2024

Uh oh!

github-actions bot commented Dec 23, 2024

Uh oh!

horheynm commented Dec 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kylesayrs commented Dec 23, 2024

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Pass `trust_remote_code_model=True` for deepseek examples #1012

[Bugfix] Pass `trust_remote_code_model=True` for deepseek examples #1012

horheynm commented Dec 23, 2024 •

edited

Loading