mini-llm-brawl

Implementing tiny versions of current state-of-art llms and comparing them on simple task

Training

With train.py you can train different models or different hyperparameter settings

You can change the hyperparamers for different models from models/model_name.py

train all available models

python3 train.py --model_params=[n_params]

train single model

python3 train.py [model_name] --model_params=[n_params]

train single model with different hyperparameters

python3 train.py [model_name] --tune --model_params=[n_params]

use --save if you want to save the model and losses

at the moment only 50M, 75M and 100M param models are supported

Comparision

See the training graph of training run

latest run

python3 compare.py

specific run

python3 compare.py [run_name]

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
models		models
.gitignore		.gitignore
README.md		README.md
compare.py		compare.py
generate.py		generate.py
graph-75M.png		graph-75M.png
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mini-llm-brawl

Training

train all available models

train single model

train single model with different hyperparameters

Comparision

latest run

specific run

shakespare dataset - 75M models

About

Uh oh!

Releases

Packages

Languages

jonirajala/mini-llm-brawl

Folders and files

Latest commit

History

Repository files navigation

mini-llm-brawl

Training

train all available models

train single model

train single model with different hyperparameters

Comparision

latest run

specific run

shakespare dataset - 75M models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages