Skip to content

jonirajala/mini-llm-brawl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mini-llm-brawl

Implementing tiny versions of current state-of-art llms and comparing them on simple task

Training

With train.py you can train different models or different hyperparameter settings

You can change the hyperparamers for different models from models/model_name.py

train all available models

python3 train.py --model_params=[n_params]

train single model

python3 train.py [model_name] --model_params=[n_params]

train single model with different hyperparameters

python3 train.py [model_name] --tune --model_params=[n_params]

use --save if you want to save the model and losses

at the moment only 50M, 75M and 100M param models are supported

Comparision

See the training graph of training run

latest run

python3 compare.py

specific run

python3 compare.py [run_name]

shakespare dataset - 75M models

graph

About

Implementing tiny versions of current state-of-art llms and comparing them on simple task

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages