Skip to content

Pulling from mosaic main #14

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 11 commits into from
Nov 14, 2023
Merged

Conversation

ShashankMosaicML
Copy link
Owner

No description provided.

dakinggg and others added 11 commits November 8, 2023 17:22
* dont use lambdas
* tokenizer building distributed safety
…#727)

* Make logs appear and disable InsecureRequestWarning for ignore_cert

* Clean up

* Repair symlinks after cache download

* Clean up logging

---------

Co-authored-by: Daniel King <[email protected]>
…t tied embeddings (#728)

* enable disabling embed weight tying

* fix bug

* updt with descriptive var names

* fix hf config

* move comment with code

* bug fix

* add _tie_weights method

* undo mcli yaml change

* refactor

* add tests

* Update llmfoundry/models/mpt/modeling_mpt.py

Co-authored-by: Sasha Doubov <[email protected]>

* pr comments

* updt tests to guard against numerical issues

---------

Co-authored-by: Sasha Doubov <[email protected]>
* add act checkpoint at sub layer level

* Update llmfoundry/models/mpt/modeling_mpt.py

Co-authored-by: Mihir Patel <[email protected]>

* address comments

* addess coments

* add log info

* fix pyright

* refactor

* better log info and error msg

* add test

* Update llmfoundry/models/mpt/modeling_mpt.py

Co-authored-by: Mihir Patel <[email protected]>

* remove unneeded comments

---------

Co-authored-by: Mihir Patel <[email protected]>
Co-authored-by: Daniel King <[email protected]>
@ShashankMosaicML ShashankMosaicML merged commit f209b58 into ShashankMosaicML:main Nov 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants