LLOME

This is the repository for the paper "Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks," by Angelica Chen^*, Samuel D. Stanton^*, Frances Ding, Robert G. Alberstein, Andrew M. Watkins, Richard Bonneau, Vladimir Gligorijević, Kyunghyun Cho, Nathan C. Frey

Set-Up

Clone this repo to your home directory! (Some of the scripts in this repo assume that your code is stored in ~/llome. If the path to your repo is different, change the path_to_repo option in the hydra config.)
Install the requirements in a virtual environment:

uv venv --python 3.10
source .venv/bin/activate
uv pip install -r requirements.txt

Run aws configure to ensure you are configured for access to S3 (if using S3 storage).

To test your setup

python -m run_pipeline --config-name=pipeline_sanity_check_marge local_output_dir=<PATH_TO_LOCAL_OUTPUT_DIR> parent_output_dir=<S3_OUTPUT_DIR>

Important things to know about the LLOME pipeline

Storage: This pipeline makes use of both local storage (local_output_dir in the YAML config) and S3 storage (parent_output_dir in the YAML config). All model checkpoints are automatically copied to S3 and deleted from local storage after the end of a training job. By default, all generated seeds, generations, and training data are stored in <config.parent_output_dir>/<config.run_name>, where config is the Hydra config.
- To disable S3 storage and use only local storage, simply set parent_output_dir: "null" in the config.
Slurm Configurations: All training, dataset generation/formatting, and generation jobs are launched as separate SLURM jobs. These SLURM jobs are easily configured with the respective slurm_args subsections in the config.
Resuming jobs: This pipeline is designed to automatically resume previous runs, assuming that the pipeline is run with the same arguments as a prior run. To disable this behavior, use --overwrite=True.
Training large models: We currently use DDP (without accompanying model parallelism) to train our models (FSDP is a TODO though!), which means that you likely need GPU RAM that is ~4x the size of the model to train in full precision. Our code is currently only tested on single-node multi-GPU setups.
Analyzing results: To assess the performance of the LLM at iteration $i$, analyze the training data generated for iteration $i+1$ -- this accounts for both the likelihood-based ranking and filtering of the LLM at iteration $i$.

To reproduce experiments

Note: Update the slurm_args section in each config before running your experiment!

For function $f_2$:

LLOME-SFT:

python -m run_pipeline --config-name=pipeline_sft_f2 local_output_dir=<PATH_TO_LOCAL_OUTPUT_DIR> parent_output_dir=<S3_OUTPUT_DIR>

LLOME-MargE:

python -m run_pipeline --config-name=pipeline_marge_f2 local_output_dir=<PATH_TO_LOCAL_OUTPUT_DIR> parent_output_dir=<S3_OUTPUT_DIR>

LLOME-DPO:

python -m run_pipeline --config-name=pipeline_dpo_f2 local_output_dir=<PATH_TO_LOCAL_OUTPUT_DIR> parent_output_dir=<S3_OUTPUT_DIR>

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
config		config
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dpo.py		dpo.py
ehrlich_fn_difficulty.py		ehrlich_fn_difficulty.py
environment.yaml		environment.yaml
file_handler.py		file_handler.py
finetune_ehrlich.py		finetune_ehrlich.py
finetune_utils.py		finetune_utils.py
iterative_generation.py		iterative_generation.py
marge.py		marge.py
marge_trainer.py		marge_trainer.py
model_client.py		model_client.py
model_client_test.py		model_client_test.py
pref_tuning_trainer.py		pref_tuning_trainer.py
process_utils.py		process_utils.py
requirements.in		requirements.in
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
score.py		score.py
seq2seq_sft_trainer.py		seq2seq_sft_trainer.py
slurm_utils.py		slurm_utils.py
synthetic_dataset_formatter.py		synthetic_dataset_formatter.py
synthetic_dataset_formatter_test.py		synthetic_dataset_formatter_test.py
synthetic_dataset_generator.py		synthetic_dataset_generator.py
synthetic_dataset_lib.py		synthetic_dataset_lib.py
synthetic_dataset_lib_test.py		synthetic_dataset_lib_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLOME

Set-Up

To test your setup

Important things to know about the LLOME pipeline

To reproduce experiments

For function $f_2$:

About

Uh oh!

Releases

Packages

Languages

License

prescient-design/llome

Folders and files

Latest commit

History

Repository files navigation

LLOME

Set-Up

To test your setup

Important things to know about the LLOME pipeline

To reproduce experiments

For function $f_2$:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages