OpenMS — open-source C++ framework with Python bindings (pyOpenMS) for liquid chromatography-mass spectrometry (LC-MS/MS) data analysis. Provides 200+ TOPP command-line tools for proteomics, metabolom

8

pbccs (CCS)

Generate highly accurate HiFi long reads from PacBio SMRT sequencing data using Circular Consensus Sequencing (CCS). Use this skill when working with PacBio subreads BAM files to produce consensus rea

8

PBSIM3

PBSIM3 simulates PacBio CLR, PacBio CCS/HiFi, and Oxford Nanopore Technology long reads from a reference FASTA using HMM-based error models. Use when users need to generate synthetic long reads for be

8

pbsv

pbsv — PacBio structural variant (SV) caller for HiFi and CLR long-read sequencing data. Discovers and genotypes deletions, insertions, duplications, inversions, translocations, and breakends from ali

8

PGGB

Verified

PGGB (PanGenome Graph Builder) — constructs unbiased pangenome variation graphs from multiple whole-genome assemblies using all-vs-all pairwise alignment (wfmash), graph induction (seqwish), and graph

8

PGS Catalog

PGS Catalog utilities — download, validate, match, and calculate polygenic scores (PGS) from the PGS Catalog database. Provides CLI tools for fetching scoring files by PGS ID or trait, combining multi

8

PHATE

PHATE (Potential of Heat-diffusion for Affinity-based Trajectory Embedding) — dimensionality reduction for visualizing high-dimensional biological data while preserving both local and global structure

8

phyloseq -- Microbiome Data Analysis in R

phyloseq -- R/Bioconductor package for importing, storing, analyzing, and visualizing microbiome census data. Handles OTU/ASV tables, taxonomy tables, sample metadata, phylogenetic trees, and referenc

8

PICRUSt2

PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States 2) predicts functional composition of microbial metagenomes from 16S rRNA, ITS1, or ITS2 amplicon marker gene

8

PLINK 2.0

PLINK 2.0 — whole-genome association analysis toolset for large-scale genotype data management, quality control (missingness, HWE, MAF filtering), population stratification correction (PCA), linkage d

8

PPanGGOLiN

Use when building pangenome graphs or analyzing pangenome structure across prokaryotic genomes. PPanGGOLiN partitions gene families into persistent, shell, and cloud categories using a statistical mod

8

Preseq

Preseq — C++ toolkit for predicting library complexity and future sequencing yield from genomic sequencing experiments. Estimates distinct molecule counts, extrapolates coverage curves, and identifies

8

pyBigWig

pyBigWig is a Python library for reading and writing bigWig and bigBed files, the standard binary genome-browser signal formats. Use when users need to query per-base coverage or signal values from bi

8

RDP

RDP (Ribosomal Database Project) — curated 16S rRNA, 28S rRNA, and ITS fungal sequence database with integrated classification, alignment, and probe-matching tools. The RDP Classifier assigns taxonomy

8

RiboCode

Use when working with ribosome profiling (Ribo-seq) data to identify actively translated open reading frames (ORFs) using RiboCode — a Python tool for de novo ORF annotation from ribosome footprinting

8

Ribo-TISH

Use when working with ribosome profiling (Ribo-seq) data and ORF discovery using Ribo-TISH — a Python toolkit for genome-wide identification and differential analysis of translational initiation. Cove

8

rMATS

rMATS (Replicate Multivariate Analysis of Transcript Splicing) — statistical tool for detecting differential alternative splicing from RNA-seq data. Identifies five splicing event types: skipped exon

8

SAIGE-GENE+

SAIGE-GENE+ — gene-based and region-based rare variant association testing using mixed models. Extends SAIGE to aggregate rare variants (SKAT, Burden, SKAT-O) while controlling for population structur

8

Sambamba mkdup

Sambamba mkdup routes duplicate-marking workflows for coordinate-sorted BAM files using `sambamba markdup`. Use this skill when a user needs to mark or remove duplicate reads, tune `--hash-table-size`

8

Segway

Use when working with segway — segway — probabilistic genome segmentation

8

SIRIUS

SIRIUS routing skill for metabolomics and lipidomics LC-MS/MS compound identification. Use this skill when users mention SIRIUS, CSI:FingerID, CANOPUS, ZODIAC, molecular formula prediction, structure

8

SNPRelate

Use when working with SNPRelate — a Bioconductor R package for high-performance SNP-based population genetics analysis. Supports principal component analysis (PCA), identity by descent (IBD), Fst esti

8

Subread

Subread — high-performance read alignment and quantification package using a seed-and-vote mapping strategy. Includes subread-align for genomic DNA and RNA-seq read alignment, subjunc for exon-junctio

8

tabix

tabix — fast random-access indexer for bgzip-compressed, tab-delimited genomic position files (VCF, BED, GFF3, GTF, SAM, and custom formats). Creates .tbi or .csi index files enabling sub-second regio

8

TADbit

TADbit — Python toolkit for end-to-end Hi-C data analysis including read mapping, contact matrix construction, ICE normalization, TAD (Topologically Associating Domain) detection, and 3D chromatin str

8

TaxonKit

Use when working with TaxonKit — a cross-platform NCBI taxonomy toolkit for querying lineages, converting names to TaxIds, reformatting taxonomy strings, filtering by rank, computing lowest common anc

8

UMI-tools

UMI-tools — Python toolkit for processing Unique Molecular Identifiers (UMIs) in sequencing data. Provides end-to-end UMI workflows including extraction from FASTQ reads, cell barcode demultiplexing f

8

UnMICST

Verified

UnMICST (Universal Multiplexed Image Segmentation and Classification Tool) — deep-learning nuclear segmentation for multiplexed tissue imaging. Generates probability maps (background, nucleus interior

8

vcflib

vcflib is a C++ library and suite of command-line tools for parsing, filtering, merging, annotating, and transforming VCF (Variant Call Format) files. Use when users need to filter variants by INFO/FO

8

Velvet

Velvet — de Bruijn graph short-read genome assembler for Illumina, Solexa, and 454 sequencing data. Runs as a two-stage pipeline: velveth builds the hash table and de Bruijn graph, velvetg traverses t

8

WhatsHap

WhatsHap -- read-based phasing (haplotype assembly) of genomic variants from sequencing reads. Phases diploid and polyploid genomes using long reads (PacBio, ONT) or short reads (Illumina). Key subcom

8

AICSImageIO

AICSImageIO — Python library for reading and writing multi-dimensional microscopy image data. Supports CZI (Zeiss), LIF (Leica), ND2 (Nikon), OME-TIFF, TIFF, and DV formats via a unified AICSImage int

9

AMBER

AMBER (Assisted Model Building with Energy Refinement) — suite of molecular dynamics simulation programs for biomolecular systems. Provides force fields (ff14SB, ff19SB, GAFF2), system preparation (tl

10

antiSMASH

antiSMASH (antibiotic & Secondary Metabolite Analysis SHell) — genome mining tool for identifying and annotating biosynthetic gene clusters (BGCs) in bacterial and fungal genomes. Detects NRPS, PKS, t

10

AUCell

Use when working with AUCell — an R/Python tool for scoring gene set activity in single cells using the Area Under the Curve (AUC) metric. Computes per-cell activity scores for gene sets (regulons, pa

10

AutoDock Vina

AutoDock Vina — fast open-source molecular docking engine for predicting protein-ligand binding poses and affinities. Uses gradient-optimization conformational search with Vina, Vinardo, or AutoDock4

11

BAGEL2

BAGEL2 — Bayesian Analysis of Gene Essentiality v2 for identifying essential genes from CRISPR/Cas9 knockout screens. Calculates Bayes Factors from fold-change distributions using reference sets of co

10

bigsnpr

Use when working with bigsnpr — an R package for analysis of massive

9

BioPython.PDB

BioPython.PDB -- Python module for structural biology and macromolecular structure analysis. Parses PDB and mmCIF files, navigates the Structure-Model-Chain-Residue-Atom (SMCRA) hierarchy, computes at

9

BioRender

BioRender — web-based platform for creating professional scientific figures, illustrations, and diagrams. Use for designing publication-quality figures for research papers, grants, posters, and presen

9

Boltz-1

Boltz-1 — open-source deep learning model for predicting biomolecular 3D structures and interactions, approaching AlphaFold3-level accuracy. Supports protein, DNA, RNA, and small-molecule ligand struc

11

Bonito

Bonito — open-source PyTorch research basecaller for Oxford Nanopore Technologies (ONT) long-read sequencing. Designed for basecaller training and model development, not production basecalling (use Do

9

BrainGlobe Atlas API

Use when working with BrainGlobe Atlas API (bg-atlasapi / brainglobe-atlasapi) — a Python library for programmatic access to standardized brain atlases including the Allen Mouse Brain Atlas, Waxholm r

10

brainrender

Use when working with brainrender — a Python library for 3D visualization of neuroscience data including brain atlases, neuron morphologies, cell distributions, and volumetric data. Part of the BrainG

10

BUGS/WinBUGS

BUGS/WinBUGS — Bayesian inference Using Gibbs Sampling for complex hierarchical models via MCMC. Model specification in the BUGS language with stochastic (~) and deterministic (<-) nodes, 30+ distribu

11

buttery-eel

buttery-eel — streaming Guppy/Dorado basecaller wrapper for Oxford Nanopore SLOW5/BLOW5 format long-read sequencing data. Streams reads from SLOW5 or BLOW5 files directly through a running Guppy serve

9

Caduceus

Caduceus — bi-directional DNA foundation model built on the Mamba state-space architecture for genomic sequence modeling. Provides bi-directional long-range sequence understanding with reverse complem

9

Canopy

Canopy — Bayesian phylogenetic reconstruction of tumor subclonal architecture from somatic SNVs and copy number alterations (CNAs). Uses MCMC sampling to infer cancer evolutionary trees, estimate subc

9

Cas-OFFinder

Cas-OFFinder — OpenCL-accelerated off-target site finder for CRISPR/Cas RNA-guided endonucleases. Searches whole genomes for potential off-target sites with configurable mismatch thresholds, DNA/RNA b

10

CCP4

CCP4 — Collaborative Computational Project Number 4 software suite for macromolecular X-ray crystallography. Provides programs for data processing (AIMLESS, POINTLESS), molecular replacement (Phaser),

10

Tool	Registry	Domain	Docs
OpenMS OpenMS — open-source C++ framework with Python bindings (pyOpenMS) for liquid chromatography-mass spectrometry (LC-MS/MS) data analysis. Provides 200+ TOPP command-line tools for proteomics, metabolom	OpenMS/OpenMS	Proteomics	8
pbccs (CCS) Generate highly accurate HiFi long reads from PacBio SMRT sequencing data using Circular Consensus Sequencing (CCS). Use this skill when working with PacBio subreads BAM files to produce consensus rea	PacificBiosciences/ccs	Genomics	8
PBSIM3 PBSIM3 simulates PacBio CLR, PacBio CCS/HiFi, and Oxford Nanopore Technology long reads from a reference FASTA using HMM-based error models. Use when users need to generate synthetic long reads for be	yukiteruono/pbsim3	QC & Preprocessing	8
pbsv pbsv — PacBio structural variant (SV) caller for HiFi and CLR long-read sequencing data. Discovers and genotypes deletions, insertions, duplications, inversions, translocations, and breakends from ali	PacificBiosciences/pbsv	Genomics	8
PGGB Verified PGGB (PanGenome Graph Builder) — constructs unbiased pangenome variation graphs from multiple whole-genome assemblies using all-vs-all pairwise alignment (wfmash), graph induction (seqwish), and graph	pangenome/pggb	Genomics	8
PGS Catalog PGS Catalog utilities — download, validate, match, and calculate polygenic scores (PGS) from the PGS Catalog database. Provides CLI tools for fetching scoring files by PGS ID or trait, combining multi	PGScatalog/pgscatalog_utils	Utilities & Infrastructure	8
PHATE PHATE (Potential of Heat-diffusion for Affinity-based Trajectory Embedding) — dimensionality reduction for visualizing high-dimensional biological data while preserving both local and global structure	KrishnaswamyLab/PHATE	Statistics	8
phyloseq -- Microbiome Data Analysis in R phyloseq -- R/Bioconductor package for importing, storing, analyzing, and visualizing microbiome census data. Handles OTU/ASV tables, taxonomy tables, sample metadata, phylogenetic trees, and referenc	joey711/phyloseq	Metagenomics	8
PICRUSt2 PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States 2) predicts functional composition of microbial metagenomes from 16S rRNA, ITS1, or ITS2 amplicon marker gene	picrust/picrust2	Metagenomics	8
PLINK 2.0 PLINK 2.0 — whole-genome association analysis toolset for large-scale genotype data management, quality control (missingness, HWE, MAF filtering), population stratification correction (PCA), linkage d	chrchang/plink-ng	Genomics	8
PPanGGOLiN Use when building pangenome graphs or analyzing pangenome structure across prokaryotic genomes. PPanGGOLiN partitions gene families into persistent, shell, and cloud categories using a statistical mod	labgem/PPanGGOLiN	Metagenomics	8
Preseq Preseq — C++ toolkit for predicting library complexity and future sequencing yield from genomic sequencing experiments. Estimates distinct molecule counts, extrapolates coverage curves, and identifies	smithlabcode/preseq	QC & Preprocessing	8
pyBigWig pyBigWig is a Python library for reading and writing bigWig and bigBed files, the standard binary genome-browser signal formats. Use when users need to query per-base coverage or signal values from bi	deeptools/pyBigWig	Utilities & Infrastructure	8
RDP RDP (Ribosomal Database Project) — curated 16S rRNA, 28S rRNA, and ITS fungal sequence database with integrated classification, alignment, and probe-matching tools. The RDP Classifier assigns taxonomy	rdpstaff/classifier	Utilities & Infrastructure	8
RiboCode Use when working with ribosome profiling (Ribo-seq) data to identify actively translated open reading frames (ORFs) using RiboCode — a Python tool for de novo ORF annotation from ribosome footprinting	xryanglab/RiboCode	Transcriptomics	8
Ribo-TISH Use when working with ribosome profiling (Ribo-seq) data and ORF discovery using Ribo-TISH — a Python toolkit for genome-wide identification and differential analysis of translational initiation. Cove	zhpn1024/ribotish	Genomics	8
rMATS rMATS (Replicate Multivariate Analysis of Transcript Splicing) — statistical tool for detecting differential alternative splicing from RNA-seq data. Identifies five splicing event types: skipped exon	Xinglab/rmats-turbo	Transcriptomics	8
SAIGE-GENE+ SAIGE-GENE+ — gene-based and region-based rare variant association testing using mixed models. Extends SAIGE to aggregate rare variants (SKAT, Burden, SKAT-O) while controlling for population structur	saigegit/SAIGE	Population Genetics	8
Sambamba mkdup Sambamba mkdup routes duplicate-marking workflows for coordinate-sorted BAM files using `sambamba markdup`. Use this skill when a user needs to mark or remove duplicate reads, tune `--hash-table-size`	biod/sambamba	Utilities & Infrastructure	8
Segway Use when working with segway — segway — probabilistic genome segmentation	hoffmangroup/segway	Epigenomics	8
SIRIUS SIRIUS routing skill for metabolomics and lipidomics LC-MS/MS compound identification. Use this skill when users mention SIRIUS, CSI:FingerID, CANOPUS, ZODIAC, molecular formula prediction, structure	boecker-lab/sirius	Systems Biology	8
SNPRelate Use when working with SNPRelate — a Bioconductor R package for high-performance SNP-based population genetics analysis. Supports principal component analysis (PCA), identity by descent (IBD), Fst esti	zhengxwen/SNPRelate	Other	8
Subread Subread — high-performance read alignment and quantification package using a seed-and-vote mapping strategy. Includes subread-align for genomic DNA and RNA-seq read alignment, subjunc for exon-junctio	ShiLab-Bioinformatics/subread	Transcriptomics	8
tabix tabix — fast random-access indexer for bgzip-compressed, tab-delimited genomic position files (VCF, BED, GFF3, GTF, SAM, and custom formats). Creates .tbi or .csi index files enabling sub-second regio	samtools/htslib	Utilities & Infrastructure	8
TADbit TADbit — Python toolkit for end-to-end Hi-C data analysis including read mapping, contact matrix construction, ICE normalization, TAD (Topologically Associating Domain) detection, and 3D chromatin str	3DGenomes/TADbit	Genomics	8
TaxonKit Use when working with TaxonKit — a cross-platform NCBI taxonomy toolkit for querying lineages, converting names to TaxIds, reformatting taxonomy strings, filtering by rank, computing lowest common anc	shenwei356/taxonkit	Other	8
UMI-tools UMI-tools — Python toolkit for processing Unique Molecular Identifiers (UMIs) in sequencing data. Provides end-to-end UMI workflows including extraction from FASTQ reads, cell barcode demultiplexing f	CGATOxford/UMI-tools	QC & Preprocessing	8
UnMICST Verified UnMICST (Universal Multiplexed Image Segmentation and Classification Tool) — deep-learning nuclear segmentation for multiplexed tissue imaging. Generates probability maps (background, nucleus interior	HMS-IDAC/UnMICST	Imaging	8
vcflib vcflib is a C++ library and suite of command-line tools for parsing, filtering, merging, annotating, and transforming VCF (Variant Call Format) files. Use when users need to filter variants by INFO/FO	vcflib/vcflib	Utilities & Infrastructure	8
Velvet Velvet — de Bruijn graph short-read genome assembler for Illumina, Solexa, and 454 sequencing data. Runs as a two-stage pipeline: velveth builds the hash table and de Bruijn graph, velvetg traverses t	dzerbino/velvet	Genomics	8
WhatsHap WhatsHap -- read-based phasing (haplotype assembly) of genomic variants from sequencing reads. Phases diploid and polyploid genomes using long reads (PacBio, ONT) or short reads (Illumina). Key subcom	whatshap/whatshap	Genomics	8
AICSImageIO AICSImageIO — Python library for reading and writing multi-dimensional microscopy image data. Supports CZI (Zeiss), LIF (Leica), ND2 (Nikon), OME-TIFF, TIFF, and DV formats via a unified AICSImage int	AllenCellModeling/aicsimageio	Imaging	9
AMBER AMBER (Assisted Model Building with Energy Refinement) — suite of molecular dynamics simulation programs for biomolecular systems. Provides force fields (ff14SB, ff19SB, GAFF2), system preparation (tl	conda-forge/ambertools-feedstock	Structure Prediction	10
antiSMASH antiSMASH (antibiotic & Secondary Metabolite Analysis SHell) — genome mining tool for identifying and annotating biosynthetic gene clusters (BGCs) in bacterial and fungal genomes. Detects NRPS, PKS, t	antismash/antismash	Metagenomics	10
AUCell Use when working with AUCell — an R/Python tool for scoring gene set activity in single cells using the Area Under the Curve (AUC) metric. Computes per-cell activity scores for gene sets (regulons, pa	aertslab/AUCell	Transcriptomics	10
AutoDock Vina AutoDock Vina — fast open-source molecular docking engine for predicting protein-ligand binding poses and affinities. Uses gradient-optimization conformational search with Vina, Vinardo, or AutoDock4	ccsb-scripps/AutoDock-Vina	Structure Prediction	11
BAGEL2 BAGEL2 — Bayesian Analysis of Gene Essentiality v2 for identifying essential genes from CRISPR/Cas9 knockout screens. Calculates Bayes Factors from fold-change distributions using reference sets of co	hart-lab/bagel	Systems Biology	10
bigsnpr Use when working with bigsnpr — an R package for analysis of massive	privefl/bigsnpr	Population Genetics	9
BioPython.PDB BioPython.PDB -- Python module for structural biology and macromolecular structure analysis. Parses PDB and mmCIF files, navigates the Structure-Model-Chain-Residue-Atom (SMCRA) hierarchy, computes at	biopython/biopython	Structure Prediction	9
BioRender BioRender — web-based platform for creating professional scientific figures, illustrations, and diagrams. Use for designing publication-quality figures for research papers, grants, posters, and presen	manual	Imaging	9
Boltz-1 Boltz-1 — open-source deep learning model for predicting biomolecular 3D structures and interactions, approaching AlphaFold3-level accuracy. Supports protein, DNA, RNA, and small-molecule ligand struc	jwohlwend/boltz	Structure Prediction	11
Bonito Bonito — open-source PyTorch research basecaller for Oxford Nanopore Technologies (ONT) long-read sequencing. Designed for basecaller training and model development, not production basecalling (use Do	nanoporetech/bonito	Genomics	9
BrainGlobe Atlas API Use when working with BrainGlobe Atlas API (bg-atlasapi / brainglobe-atlasapi) — a Python library for programmatic access to standardized brain atlases including the Allen Mouse Brain Atlas, Waxholm r	brainglobe/bg-atlasapi	Imaging	10
brainrender Use when working with brainrender — a Python library for 3D visualization of neuroscience data including brain atlases, neuron morphologies, cell distributions, and volumetric data. Part of the BrainG	brainglobe/brainrender	Imaging	10
BUGS/WinBUGS BUGS/WinBUGS — Bayesian inference Using Gibbs Sampling for complex hierarchical models via MCMC. Model specification in the BUGS language with stochastic (~) and deterministic (<-) nodes, 30+ distribu	manual	Statistics	11
buttery-eel buttery-eel — streaming Guppy/Dorado basecaller wrapper for Oxford Nanopore SLOW5/BLOW5 format long-read sequencing data. Streams reads from SLOW5 or BLOW5 files directly through a running Guppy serve	Psy-Fer/buttery-eel	Genomics	9
Caduceus Caduceus — bi-directional DNA foundation model built on the Mamba state-space architecture for genomic sequence modeling. Provides bi-directional long-range sequence understanding with reverse complem	kuleshov-group/caduceus	Machine Learning	9
Canopy Canopy — Bayesian phylogenetic reconstruction of tumor subclonal architecture from somatic SNVs and copy number alterations (CNAs). Uses MCMC sampling to infer cancer evolutionary trees, estimate subc	yuchaojiang/Canopy	Genomics	9
Cas-OFFinder Cas-OFFinder — OpenCL-accelerated off-target site finder for CRISPR/Cas RNA-guided endonucleases. Searches whole genomes for potential off-target sites with configurable mismatch thresholds, DNA/RNA b	snugel/cas-offinder	Systems Biology	10
CCP4 CCP4 — Collaborative Computational Project Number 4 software suite for macromolecular X-ray crystallography. Provides programs for data processing (AIMLESS, POINTLESS), molecular replacement (Phaser),	manual	Structure Prediction	10

Tool	Registry	Domain	Docs
OpenMS OpenMS — open-source C++ framework with Python bindings (pyOpenMS) for liquid chromatography-mass spectrometry (LC-MS/MS) data analysis. Provides 200+ TOPP command-line tools for proteomics, metabolom	OpenMS/OpenMS	Proteomics	8
pbccs (CCS) Generate highly accurate HiFi long reads from PacBio SMRT sequencing data using Circular Consensus Sequencing (CCS). Use this skill when working with PacBio subreads BAM files to produce consensus rea	PacificBiosciences/ccs	Genomics	8
PBSIM3 PBSIM3 simulates PacBio CLR, PacBio CCS/HiFi, and Oxford Nanopore Technology long reads from a reference FASTA using HMM-based error models. Use when users need to generate synthetic long reads for be	yukiteruono/pbsim3	QC & Preprocessing	8
pbsv pbsv — PacBio structural variant (SV) caller for HiFi and CLR long-read sequencing data. Discovers and genotypes deletions, insertions, duplications, inversions, translocations, and breakends from ali	PacificBiosciences/pbsv	Genomics	8
PGGB Verified PGGB (PanGenome Graph Builder) — constructs unbiased pangenome variation graphs from multiple whole-genome assemblies using all-vs-all pairwise alignment (wfmash), graph induction (seqwish), and graph	pangenome/pggb	Genomics	8
PGS Catalog PGS Catalog utilities — download, validate, match, and calculate polygenic scores (PGS) from the PGS Catalog database. Provides CLI tools for fetching scoring files by PGS ID or trait, combining multi	PGScatalog/pgscatalog_utils	Utilities & Infrastructure	8
PHATE PHATE (Potential of Heat-diffusion for Affinity-based Trajectory Embedding) — dimensionality reduction for visualizing high-dimensional biological data while preserving both local and global structure	KrishnaswamyLab/PHATE	Statistics	8
phyloseq -- Microbiome Data Analysis in R phyloseq -- R/Bioconductor package for importing, storing, analyzing, and visualizing microbiome census data. Handles OTU/ASV tables, taxonomy tables, sample metadata, phylogenetic trees, and referenc	joey711/phyloseq	Metagenomics	8
PICRUSt2 PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States 2) predicts functional composition of microbial metagenomes from 16S rRNA, ITS1, or ITS2 amplicon marker gene	picrust/picrust2	Metagenomics	8
PLINK 2.0 PLINK 2.0 — whole-genome association analysis toolset for large-scale genotype data management, quality control (missingness, HWE, MAF filtering), population stratification correction (PCA), linkage d	chrchang/plink-ng	Genomics	8
PPanGGOLiN Use when building pangenome graphs or analyzing pangenome structure across prokaryotic genomes. PPanGGOLiN partitions gene families into persistent, shell, and cloud categories using a statistical mod	labgem/PPanGGOLiN	Metagenomics	8
Preseq Preseq — C++ toolkit for predicting library complexity and future sequencing yield from genomic sequencing experiments. Estimates distinct molecule counts, extrapolates coverage curves, and identifies	smithlabcode/preseq	QC & Preprocessing	8
pyBigWig pyBigWig is a Python library for reading and writing bigWig and bigBed files, the standard binary genome-browser signal formats. Use when users need to query per-base coverage or signal values from bi	deeptools/pyBigWig	Utilities & Infrastructure	8
RDP RDP (Ribosomal Database Project) — curated 16S rRNA, 28S rRNA, and ITS fungal sequence database with integrated classification, alignment, and probe-matching tools. The RDP Classifier assigns taxonomy	rdpstaff/classifier	Utilities & Infrastructure	8
RiboCode Use when working with ribosome profiling (Ribo-seq) data to identify actively translated open reading frames (ORFs) using RiboCode — a Python tool for de novo ORF annotation from ribosome footprinting	xryanglab/RiboCode	Transcriptomics	8
Ribo-TISH Use when working with ribosome profiling (Ribo-seq) data and ORF discovery using Ribo-TISH — a Python toolkit for genome-wide identification and differential analysis of translational initiation. Cove	zhpn1024/ribotish	Genomics	8
rMATS rMATS (Replicate Multivariate Analysis of Transcript Splicing) — statistical tool for detecting differential alternative splicing from RNA-seq data. Identifies five splicing event types: skipped exon	Xinglab/rmats-turbo	Transcriptomics	8
SAIGE-GENE+ SAIGE-GENE+ — gene-based and region-based rare variant association testing using mixed models. Extends SAIGE to aggregate rare variants (SKAT, Burden, SKAT-O) while controlling for population structur	saigegit/SAIGE	Population Genetics	8
Sambamba mkdup Sambamba mkdup routes duplicate-marking workflows for coordinate-sorted BAM files using `sambamba markdup`. Use this skill when a user needs to mark or remove duplicate reads, tune `--hash-table-size`	biod/sambamba	Utilities & Infrastructure	8
Segway Use when working with segway — segway — probabilistic genome segmentation	hoffmangroup/segway	Epigenomics	8
SIRIUS SIRIUS routing skill for metabolomics and lipidomics LC-MS/MS compound identification. Use this skill when users mention SIRIUS, CSI:FingerID, CANOPUS, ZODIAC, molecular formula prediction, structure	boecker-lab/sirius	Systems Biology	8
SNPRelate Use when working with SNPRelate — a Bioconductor R package for high-performance SNP-based population genetics analysis. Supports principal component analysis (PCA), identity by descent (IBD), Fst esti	zhengxwen/SNPRelate	Other	8
Subread Subread — high-performance read alignment and quantification package using a seed-and-vote mapping strategy. Includes subread-align for genomic DNA and RNA-seq read alignment, subjunc for exon-junctio	ShiLab-Bioinformatics/subread	Transcriptomics	8
tabix tabix — fast random-access indexer for bgzip-compressed, tab-delimited genomic position files (VCF, BED, GFF3, GTF, SAM, and custom formats). Creates .tbi or .csi index files enabling sub-second regio	samtools/htslib	Utilities & Infrastructure	8
TADbit TADbit — Python toolkit for end-to-end Hi-C data analysis including read mapping, contact matrix construction, ICE normalization, TAD (Topologically Associating Domain) detection, and 3D chromatin str	3DGenomes/TADbit	Genomics	8
TaxonKit Use when working with TaxonKit — a cross-platform NCBI taxonomy toolkit for querying lineages, converting names to TaxIds, reformatting taxonomy strings, filtering by rank, computing lowest common anc	shenwei356/taxonkit	Other	8
UMI-tools UMI-tools — Python toolkit for processing Unique Molecular Identifiers (UMIs) in sequencing data. Provides end-to-end UMI workflows including extraction from FASTQ reads, cell barcode demultiplexing f	CGATOxford/UMI-tools	QC & Preprocessing	8
UnMICST Verified UnMICST (Universal Multiplexed Image Segmentation and Classification Tool) — deep-learning nuclear segmentation for multiplexed tissue imaging. Generates probability maps (background, nucleus interior	HMS-IDAC/UnMICST	Imaging	8
vcflib vcflib is a C++ library and suite of command-line tools for parsing, filtering, merging, annotating, and transforming VCF (Variant Call Format) files. Use when users need to filter variants by INFO/FO	vcflib/vcflib	Utilities & Infrastructure	8
Velvet Velvet — de Bruijn graph short-read genome assembler for Illumina, Solexa, and 454 sequencing data. Runs as a two-stage pipeline: velveth builds the hash table and de Bruijn graph, velvetg traverses t	dzerbino/velvet	Genomics	8
WhatsHap WhatsHap -- read-based phasing (haplotype assembly) of genomic variants from sequencing reads. Phases diploid and polyploid genomes using long reads (PacBio, ONT) or short reads (Illumina). Key subcom	whatshap/whatshap	Genomics	8
AICSImageIO AICSImageIO — Python library for reading and writing multi-dimensional microscopy image data. Supports CZI (Zeiss), LIF (Leica), ND2 (Nikon), OME-TIFF, TIFF, and DV formats via a unified AICSImage int	AllenCellModeling/aicsimageio	Imaging	9
AMBER AMBER (Assisted Model Building with Energy Refinement) — suite of molecular dynamics simulation programs for biomolecular systems. Provides force fields (ff14SB, ff19SB, GAFF2), system preparation (tl	conda-forge/ambertools-feedstock	Structure Prediction	10
antiSMASH antiSMASH (antibiotic & Secondary Metabolite Analysis SHell) — genome mining tool for identifying and annotating biosynthetic gene clusters (BGCs) in bacterial and fungal genomes. Detects NRPS, PKS, t	antismash/antismash	Metagenomics	10
AUCell Use when working with AUCell — an R/Python tool for scoring gene set activity in single cells using the Area Under the Curve (AUC) metric. Computes per-cell activity scores for gene sets (regulons, pa	aertslab/AUCell	Transcriptomics	10
AutoDock Vina AutoDock Vina — fast open-source molecular docking engine for predicting protein-ligand binding poses and affinities. Uses gradient-optimization conformational search with Vina, Vinardo, or AutoDock4	ccsb-scripps/AutoDock-Vina	Structure Prediction	11
BAGEL2 BAGEL2 — Bayesian Analysis of Gene Essentiality v2 for identifying essential genes from CRISPR/Cas9 knockout screens. Calculates Bayes Factors from fold-change distributions using reference sets of co	hart-lab/bagel	Systems Biology	10
bigsnpr Use when working with bigsnpr — an R package for analysis of massive	privefl/bigsnpr	Population Genetics	9
BioPython.PDB BioPython.PDB -- Python module for structural biology and macromolecular structure analysis. Parses PDB and mmCIF files, navigates the Structure-Model-Chain-Residue-Atom (SMCRA) hierarchy, computes at	biopython/biopython	Structure Prediction	9
BioRender BioRender — web-based platform for creating professional scientific figures, illustrations, and diagrams. Use for designing publication-quality figures for research papers, grants, posters, and presen	manual	Imaging	9
Boltz-1 Boltz-1 — open-source deep learning model for predicting biomolecular 3D structures and interactions, approaching AlphaFold3-level accuracy. Supports protein, DNA, RNA, and small-molecule ligand struc	jwohlwend/boltz	Structure Prediction	11
Bonito Bonito — open-source PyTorch research basecaller for Oxford Nanopore Technologies (ONT) long-read sequencing. Designed for basecaller training and model development, not production basecalling (use Do	nanoporetech/bonito	Genomics	9
BrainGlobe Atlas API Use when working with BrainGlobe Atlas API (bg-atlasapi / brainglobe-atlasapi) — a Python library for programmatic access to standardized brain atlases including the Allen Mouse Brain Atlas, Waxholm r	brainglobe/bg-atlasapi	Imaging	10
brainrender Use when working with brainrender — a Python library for 3D visualization of neuroscience data including brain atlases, neuron morphologies, cell distributions, and volumetric data. Part of the BrainG	brainglobe/brainrender	Imaging	10
BUGS/WinBUGS BUGS/WinBUGS — Bayesian inference Using Gibbs Sampling for complex hierarchical models via MCMC. Model specification in the BUGS language with stochastic (~) and deterministic (<-) nodes, 30+ distribu	manual	Statistics	11
buttery-eel buttery-eel — streaming Guppy/Dorado basecaller wrapper for Oxford Nanopore SLOW5/BLOW5 format long-read sequencing data. Streams reads from SLOW5 or BLOW5 files directly through a running Guppy serve	Psy-Fer/buttery-eel	Genomics	9
Caduceus Caduceus — bi-directional DNA foundation model built on the Mamba state-space architecture for genomic sequence modeling. Provides bi-directional long-range sequence understanding with reverse complem	kuleshov-group/caduceus	Machine Learning	9
Canopy Canopy — Bayesian phylogenetic reconstruction of tumor subclonal architecture from somatic SNVs and copy number alterations (CNAs). Uses MCMC sampling to infer cancer evolutionary trees, estimate subc	yuchaojiang/Canopy	Genomics	9
Cas-OFFinder Cas-OFFinder — OpenCL-accelerated off-target site finder for CRISPR/Cas RNA-guided endonucleases. Searches whole genomes for potential off-target sites with configurable mismatch thresholds, DNA/RNA b	snugel/cas-offinder	Systems Biology	10
CCP4 CCP4 — Collaborative Computational Project Number 4 software suite for macromolecular X-ray crystallography. Provides programs for data processing (AIMLESS, POINTLESS), molecular replacement (Phaser),	manual	Structure Prediction	10

Browse Tools

Browse Tools