Skip to content

A Snakemake workflow for analyzing codon usage, extracting ORFs, translating sequences to protein, and computing + plotting hydropathy profiles.

License

Notifications You must be signed in to change notification settings

bibymaths/codonanalyzer

Repository files navigation

CodonAnalyzer

Snakemake

DOI

Originally developed using Perl in 2015, this pipeline has been modernized in 2025 with reproducibility and modularity in mind.


Features

  • Codon usage statistics and GC content
  • Longest ORF extraction
  • Protein sequence translation
  • Hydropathy calculation (Kyte–Doolittle)
  • Plotting via Python + Matplotlib
  • Configurable via config.yaml
  • Reproducible workflow using Snakemake

Quickstart

git clone https://github.com/yourusername/codonanalyzer.git
cd codonanalyzer

Setup Environment

conda install -c conda-forge -c bioconda snakemake matplotlib numpy=1.26.0

or using environment.yml:

conda env create -f environment.yml 

Then, run:

snakemake --cores 1

Documentation

See the full documentation here.


Author

Abhinav Mishra
Email: [email protected]

For full historical context, see About.


License

BSD 3-Clause. See LICENSE.

About

A Snakemake workflow for analyzing codon usage, extracting ORFs, translating sequences to protein, and computing + plotting hydropathy profiles.

Topics

Resources

License

Stars

Watchers

Forks