SuSiE (Sum of Single Effects) — R package for Bayesian variable selection and fine-mapping of GWAS loci. Fits a sparse regression model to identify credible sets of likely causal variants with posteri

10

sva

sva — R/Bioconductor package for surrogate variable analysis, batch effect correction, and unwanted variation removal in high-throughput experiments. Provides sva() for estimating surrogate variables,

11

TALON

TALON — technology-agnostic long-read transcriptome analysis for identifying and quantifying known and novel genes/isoforms. Works with PacBio and Oxford Nanopore SAM files via a SQLite database backe

10

totalVI

totalVI — deep generative model for joint analysis of CITE-seq RNA and surface protein data built on scvi-tools and PyTorch. Provides integrated dimensionality reduction across transcriptome and prote

9

totalVI / MultiVI

totalVI and MultiVI — deep generative models from scvi-tools for multi-modal single-cell integration. totalVI jointly models scRNA-seq and protein (CITE-seq) data via a variational autoencoder for den

11

TransDecoder

TransDecoder — identifies candidate coding regions within transcript sequences from de novo RNA-Seq assemblies. Provides TransDecoder.LongOrfs for extracting long open reading frames and TransDecoder.

10

Trinity

Trinity — de novo transcriptome assembler for RNA-seq data. Reconstructs full-length transcript isoforms without a reference genome using three sequential modules: Inchworm (greedy k-mer assembly), Ch

10

Trinotate

Trinotate — comprehensive functional annotation suite for transcriptome assemblies. Integrates BLAST/DIAMOND homology searches against SwissProt, HMMER/Pfam protein domain identification, SignalP sign

10

TxDb.Hsapiens.UCSC.hg38.knownGene

Use when working with TxDb.Hsapiens.UCSC.hg38.knownGene, the Bioconductor annotation package providing UCSC hg38 transcript/gene coordinates for Homo sapiens. Handles transcript extraction, exon group

9

tximeta

tximeta — R/Bioconductor package for importing transcript-level quantification data from Salmon, alevin, piscem, and other quantifiers with automatic metadata attachment. Automatically identifies the

10

tximport

tximport -- R/Bioconductor package for importing transcript-level abundance, estimated counts, and transcript lengths from quantification tools (Salmon, Kallisto, RSEM, StringTie, Sailfish) and summar

9

UCE

UCE (Universal Cell Embeddings) — species-agnostic cell embedding model from Stanford SNAP Lab that uses protein language models (ESM-2) to create universal cell representations without species-specif

9

UCSC Bioinformatics Resources

UCSC (University of California Santa Cruz) Bioinformatics — comprehensive routing for all UCSC genomic tools and resources. Covers the UCSC Genome Browser, Table Browser, UCSC MySQL public database, G

9

VAMB

Use when working with VAMB — Variational Autoencoders for Metagenomic Binning — a deep learning tool that groups assembled metagenomic contigs into metagenome-assembled genomes (MAGs). Takes concatena

9

VarSome

Use when working with varsome — varSome — comprehensive human genomics

10

VIBRANT

VIBRANT — metagenomics tool for identifying and annotating viruses from microbial assemblies (metagenomes, isolates, and virome datasets). Uses neural networks and KEGG/Pfam/VOG HMM profiles to classi

10

Vireo

Vireo (vireoSNP) is a Bayesian variational inference tool for donor demultiplexing in pooled single-cell RNA-seq and single-cell ATAC-seq experiments. It assigns cells to donors using SNP genotype inf

10

WASP

WASP (Weight And SNP-based Pipeline) corrects allele-specific mapping bias in RNA-seq and ChIP-seq by SNP-masking reads that overlap heterozygous variants, remapping them, and filtering those that shi

9

wateRmelon

wateRmelon v2.16.0 — Bioconductor R package for Illumina 450K and EPIC DNA methylation array normalization and performance metrics. Provides 15 normalization methods (dasen, nasen, naten, danes, swan,

14

WDL/Cromwell

WDL/Cromwell — Workflow Description Language (WDL) authoring and Cromwell execution engine for reproducible bioinformatics pipelines. WDL defines portable, composable workflows using task/workflow blo

10

wf-basecalling

wf-basecalling — Oxford Nanopore Technologies (ONT) EPI2ME Nextflow workflow for production-grade basecalling using Dorado. Converts POD5 or FAST5 raw electrical signal files to BAM/FASTQ with optiona

9

wgbs_tools

wgbs_tools — command-line toolkit for whole-genome bisulfite sequencing (WGBS) analysis. Converts, indexes, and queries methylation data in efficient PAT/BETA binary formats. Provides per-CpG and per-

10

WGCNA

WGCNA (Weighted Gene Co-expression Network Analysis) — R package for constructing weighted correlation networks from high-dimensional gene expression data. Identifies co-expression modules via hierarc

10

Winnowmap

Winnowmap — long-read mapping tool optimized for repetitive reference sequences. Uses weighted minimizer sampling and minimal confidently alignable substrings for accurate ONT and PacBio read alignmen

9

Wm82 Tools

Wm82-tools covers bioinformatics workflows for the Glycine max (soybean) Williams 82 (Wm82) reference genome — the primary reference for soybean genomics. Use cases: downloading and indexing Wm82 geno

9

WNN (Weighted Nearest Neighbor)

WNN (Weighted Nearest Neighbor) — Seurat's method for multimodal single-cell data integration that computes cell-specific modality weights and combines per-modality KNN graphs. Supports any combinatio

9

ZeroCostDL4Mic

ZeroCostDL4Mic is a Google Colab-based platform providing zero-cost deep learning for fluorescence microscopy image analysis. Supports segmentation (U-Net, StarDist, Cellpose), denoising (Noise2Void,

11

Bgen

Use when working with bgen — BGEN binary genotype format for large-scale

8

Gwasvcfieugwasr

Combined gwasvcf + ieugwasr routing skill for the IEU OpenGWAS ecosystem. Use when working with OpenGWAS study IDs, GWAS-VCF downloads, local summary statistics querying, LD clumping, proxy lookup, re

8

MBASED

Use when detecting allele-specific expression (ASE) from RNA-seq data with MBASED, a Bioconductor R package using a meta-analysis-based beta-binomial model. Covers SNP-level and gene-level ASE, phased

8

Pipeline Composer

Use when working with pipeline-composer — pipeline-composer — compose

8

Protégé

Protégé is an open-source ontology editor and knowledge graph framework for creating, editing, and reasoning over OWL and RDF ontologies. Use for biomedical ontology development (Gene Ontology, SNOMED

8

Vcontact2

vConTACT2 (Viral Contig Automatic Cluster Taxonomy 2) clusters prokaryotic viral genomes and contigs by shared protein content to assign taxonomy context from reference viral genomes. Use when working

7

Virsorter2

Verified

VirSorter2 is a multi-classifier pipeline for identifying DNA and RNA viral sequences from metagenomic and genomic assemblies. It supports dsDNA phage, ssDNA, RNA, NCLDV, and lavidaviridae groups, out

13

ClinVar Database

ClinVar Database — NCBI's public archive of human genetic variant clinical significance. Query via E-utilities REST API (esearch, esummary, efetch, elink) or download bulk data from FTP in XML, VCF, a

15

GREML (GCTA)

GREML (GCTA) — genome-based restricted maximum likelihood analysis for SNP heritability estimation, genetic correlation, mixed linear model association (MLMA), and COJO conditional analysis. Use when

10

Find Alternative Tools

Find drop-in replacements or compatible alternatives for bioinformatics tools using EDAM ontology I/O compatibility matching. Searches 47,000+ tools with curated migration knowledge for 20+ common alt

8

BEAST2

BEAST2 — Bayesian Evolutionary Analysis by Sampling Trees. Cross-platform Java application for Bayesian phylogenetic inference using MCMC sampling. Use when estimating time-calibrated phylogenies, rel

10

bgzip

Use when working with bgzip — bgzip — BGZF block-compression utility

9

Docker

Verified

Docker — container platform for building, running, and distributing reproducible bioinformatics environments. Provides Dockerfile authoring, image management, container execution with volume mounts fo

10

ENA Browser Tools

ENA Browser Tools — EMBL-EBI command-line utilities for programmatic access to the European Nucleotide Archive (ENA). Download FASTQ, SRA, FASTA, EMBL, and submitted files by study, sample, experiment

8

nf-core lint

Use when validating Nextflow pipelines with nf-core/tools lint checks. Routes users to practical guidance for running `nf-core pipelines lint`, selecting targeted tests with `--key`, producing JSON/Ma

8

NiBabel

NiBabel — Python library for reading and writing neuroimaging file formats. Supports NIfTI-1, NIfTI-2, ANALYZE, MINC1, MINC2, GIFTI, FreeSurfer MGH/MGZ, CIFTI-2, PARREC, and BrainVoyager formats. Core

9

scipyoptimize

scipy.optimize — Unified interface for numerical optimization in Python. Provides local and global minimization (minimize, differential_evolution, dual_annealing), root finding (root, root_scalar), cu

13

TOBIAS

TOBIAS — Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal. Python toolkit for differential transcription factor (TF) footprinting from ATAC-seq data. Corrects Tn5 insertio

9

TreeTime

TreeTime — maximum likelihood molecular clock inference, ancestral sequence reconstruction, and phylodynamic analysis from time-stamped phylogenies. Infers timetrees by iteratively optimizing ancestra

10

VCF2PCA

Verified

Use when working with vcf2pca — VCF2PCA — lightweight Python tool for

10

AlleleSeq

AlleleSeq — pipeline for detecting allele-specific expression (ASE) and allele-specific binding (ASB) from RNA-seq and ChIP-seq data at heterozygous SNP positions. Constructs a personal diploid genome

8

AMRFinderPlus

AMRFinderPlus (NCBI Antimicrobial Resistance Gene Finder Plus) — identifies antimicrobial resistance genes, virulence factors, and stress genes in bacterial genome assemblies and protein sequences. Ac

8

ANGSD

Use when working with angsd — ANGSD — Analysis of Next Generation Sequencing

8

Tool	Registry	Domain	Docs
SuSiE SuSiE (Sum of Single Effects) — R package for Bayesian variable selection and fine-mapping of GWAS loci. Fits a sparse regression model to identify credible sets of likely causal variants with posteri	stephenslab/susieR	Population Genetics	10
sva sva — R/Bioconductor package for surrogate variable analysis, batch effect correction, and unwanted variation removal in high-throughput experiments. Provides sva() for estimating surrogate variables,	jtleek/sva-devel	Transcriptomics	11
TALON TALON — technology-agnostic long-read transcriptome analysis for identifying and quantifying known and novel genes/isoforms. Works with PacBio and Oxford Nanopore SAM files via a SQLite database backe	mortazavilab/TALON	Transcriptomics	10
totalVI totalVI — deep generative model for joint analysis of CITE-seq RNA and surface protein data built on scvi-tools and PyTorch. Provides integrated dimensionality reduction across transcriptome and prote	scverse/scvi-tools	Systems Biology	9
totalVI / MultiVI totalVI and MultiVI — deep generative models from scvi-tools for multi-modal single-cell integration. totalVI jointly models scRNA-seq and protein (CITE-seq) data via a variational autoencoder for den	scverse/scvi-tools	Single-Cell	11
TransDecoder TransDecoder — identifies candidate coding regions within transcript sequences from de novo RNA-Seq assemblies. Provides TransDecoder.LongOrfs for extracting long open reading frames and TransDecoder.	TransDecoder/TransDecoder	Transcriptomics	10
Trinity Trinity — de novo transcriptome assembler for RNA-seq data. Reconstructs full-length transcript isoforms without a reference genome using three sequential modules: Inchworm (greedy k-mer assembly), Ch	trinityrnaseq/trinityrnaseq	Transcriptomics	10
Trinotate Trinotate — comprehensive functional annotation suite for transcriptome assemblies. Integrates BLAST/DIAMOND homology searches against SwissProt, HMMER/Pfam protein domain identification, SignalP sign	Trinotate/Trinotate	Transcriptomics	10
TxDb.Hsapiens.UCSC.hg38.knownGene Use when working with TxDb.Hsapiens.UCSC.hg38.knownGene, the Bioconductor annotation package providing UCSC hg38 transcript/gene coordinates for Homo sapiens. Handles transcript extraction, exon group	manual	Utilities & Infrastructure	9
tximeta tximeta — R/Bioconductor package for importing transcript-level quantification data from Salmon, alevin, piscem, and other quantifiers with automatic metadata attachment. Automatically identifies the	thelovelab/tximeta	Transcriptomics	10
tximport tximport -- R/Bioconductor package for importing transcript-level abundance, estimated counts, and transcript lengths from quantification tools (Salmon, Kallisto, RSEM, StringTie, Sailfish) and summar	thelovelab/tximport	Transcriptomics	9
UCE UCE (Universal Cell Embeddings) — species-agnostic cell embedding model from Stanford SNAP Lab that uses protein language models (ESM-2) to create universal cell representations without species-specif	snap-stanford/UCE	Transcriptomics	9
UCSC Bioinformatics Resources UCSC (University of California Santa Cruz) Bioinformatics — comprehensive routing for all UCSC genomic tools and resources. Covers the UCSC Genome Browser, Table Browser, UCSC MySQL public database, G	manual	Genomics	9
VAMB Use when working with VAMB — Variational Autoencoders for Metagenomic Binning — a deep learning tool that groups assembled metagenomic contigs into metagenome-assembled genomes (MAGs). Takes concatena	RasmussenLab/vamb	Metagenomics	9
VarSome Use when working with varsome — varSome — comprehensive human genomics	saphetor/varsome-api-client-python	Clinical Genomics	10
VIBRANT VIBRANT — metagenomics tool for identifying and annotating viruses from microbial assemblies (metagenomes, isolates, and virome datasets). Uses neural networks and KEGG/Pfam/VOG HMM profiles to classi	AnantharamanLab/VIBRANT	Metagenomics	10
Vireo Vireo (vireoSNP) is a Bayesian variational inference tool for donor demultiplexing in pooled single-cell RNA-seq and single-cell ATAC-seq experiments. It assigns cells to donors using SNP genotype inf	single-cell-genetics/vireo	Single-Cell	10
WASP WASP (Weight And SNP-based Pipeline) corrects allele-specific mapping bias in RNA-seq and ChIP-seq by SNP-masking reads that overlap heterozygous variants, remapping them, and filtering those that shi	bmvdgeijn/WASP	Population Genetics	9
wateRmelon wateRmelon v2.16.0 — Bioconductor R package for Illumina 450K and EPIC DNA methylation array normalization and performance metrics. Provides 15 normalization methods (dasen, nasen, naten, danes, swan,	schalkwyk/wateRmelon	Epigenomics	14
WDL/Cromwell WDL/Cromwell — Workflow Description Language (WDL) authoring and Cromwell execution engine for reproducible bioinformatics pipelines. WDL defines portable, composable workflows using task/workflow blo	broadinstitute/cromwell	Workflows	10
wf-basecalling wf-basecalling — Oxford Nanopore Technologies (ONT) EPI2ME Nextflow workflow for production-grade basecalling using Dorado. Converts POD5 or FAST5 raw electrical signal files to BAM/FASTQ with optiona	epi2me-labs/wf-basecalling	Genomics	9
wgbs_tools wgbs_tools — command-line toolkit for whole-genome bisulfite sequencing (WGBS) analysis. Converts, indexes, and queries methylation data in efficient PAT/BETA binary formats. Provides per-CpG and per-	nloyfer/wgbs_tools	Epigenomics	10
WGCNA WGCNA (Weighted Gene Co-expression Network Analysis) — R package for constructing weighted correlation networks from high-dimensional gene expression data. Identifies co-expression modules via hierarc	cran/WGCNA	Transcriptomics	10
Winnowmap Winnowmap — long-read mapping tool optimized for repetitive reference sequences. Uses weighted minimizer sampling and minimal confidently alignable substrings for accurate ONT and PacBio read alignmen	marbl/Winnowmap	Genomics	9
Wm82 Tools Wm82-tools covers bioinformatics workflows for the Glycine max (soybean) Williams 82 (Wm82) reference genome — the primary reference for soybean genomics. Use cases: downloading and indexing Wm82 geno	manual	Other	9
WNN (Weighted Nearest Neighbor) WNN (Weighted Nearest Neighbor) — Seurat's method for multimodal single-cell data integration that computes cell-specific modality weights and combines per-modality KNN graphs. Supports any combinatio	satijalab/seurat	Proteomics	9
ZeroCostDL4Mic ZeroCostDL4Mic is a Google Colab-based platform providing zero-cost deep learning for fluorescence microscopy image analysis. Supports segmentation (U-Net, StarDist, Cellpose), denoising (Noise2Void,	HenriquesLab/ZeroCostDL4Mic	Imaging	11
Bgen Use when working with bgen — BGEN binary genotype format for large-scale	limix/bgen	Clinical Genomics	8
Gwasvcfieugwasr Combined gwasvcf + ieugwasr routing skill for the IEU OpenGWAS ecosystem. Use when working with OpenGWAS study IDs, GWAS-VCF downloads, local summary statistics querying, LD clumping, proxy lookup, re	MRCIEU/ieugwasr	Population Genetics	8
MBASED Use when detecting allele-specific expression (ASE) from RNA-seq data with MBASED, a Bioconductor R package using a meta-analysis-based beta-binomial model. Covers SNP-level and gene-level ASE, phased	manual	Transcriptomics	8
Pipeline Composer Use when working with pipeline-composer — pipeline-composer — compose	Hordago-Labs/biocontext7	Transcriptomics	8
Protégé Protégé is an open-source ontology editor and knowledge graph framework for creating, editing, and reasoning over OWL and RDF ontologies. Use for biomedical ontology development (Gene Ontology, SNOMED	protegeproject/protege	Systems Biology	8
Vcontact2 vConTACT2 (Viral Contig Automatic Cluster Taxonomy 2) clusters prokaryotic viral genomes and contigs by shared protein content to assign taxonomy context from reference viral genomes. Use when working	Hocnonsense/vcontact2	Metagenomics	7
Virsorter2 Verified VirSorter2 is a multi-classifier pipeline for identifying DNA and RNA viral sequences from metagenomic and genomic assemblies. It supports dsDNA phage, ssDNA, RNA, NCLDV, and lavidaviridae groups, out	jiarong/VirSorter2	Metagenomics	13
ClinVar Database ClinVar Database — NCBI's public archive of human genetic variant clinical significance. Query via E-utilities REST API (esearch, esummary, efetch, elink) or download bulk data from FTP in XML, VCF, a	manual	Genomics	15
GREML (GCTA) GREML (GCTA) — genome-based restricted maximum likelihood analysis for SNP heritability estimation, genetic correlation, mixed linear model association (MLMA), and COJO conditional analysis. Use when	jianyangqt/gcta	Population Genetics	10
Find Alternative Tools Find drop-in replacements or compatible alternatives for bioinformatics tools using EDAM ontology I/O compatibility matching. Searches 47,000+ tools with curated migration knowledge for 20+ common alt	manual	Utilities & Infrastructure	8
BEAST2 BEAST2 — Bayesian Evolutionary Analysis by Sampling Trees. Cross-platform Java application for Bayesian phylogenetic inference using MCMC sampling. Use when estimating time-calibrated phylogenies, rel	CompEvol/beast2	Phylogenetics	10
bgzip Use when working with bgzip — bgzip — BGZF block-compression utility	samtools/htslib	Utilities & Infrastructure	9
Docker Verified Docker — container platform for building, running, and distributing reproducible bioinformatics environments. Provides Dockerfile authoring, image management, container execution with volume mounts fo	docker/cli	Workflows	10
ENA Browser Tools ENA Browser Tools — EMBL-EBI command-line utilities for programmatic access to the European Nucleotide Archive (ENA). Download FASTQ, SRA, FASTA, EMBL, and submitted files by study, sample, experiment	enasequence/enaBrowserTools	Utilities & Infrastructure	8
nf-core lint Use when validating Nextflow pipelines with nf-core/tools lint checks. Routes users to practical guidance for running `nf-core pipelines lint`, selecting targeted tests with `--key`, producing JSON/Ma	nf-core/tools	Workflows	8
NiBabel NiBabel — Python library for reading and writing neuroimaging file formats. Supports NIfTI-1, NIfTI-2, ANALYZE, MINC1, MINC2, GIFTI, FreeSurfer MGH/MGZ, CIFTI-2, PARREC, and BrainVoyager formats. Core	nipy/nibabel	Imaging	9
scipyoptimize scipy.optimize — Unified interface for numerical optimization in Python. Provides local and global minimization (minimize, differential_evolution, dual_annealing), root finding (root, root_scalar), cu	scipy/scipy	Statistics	13
TOBIAS TOBIAS — Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal. Python toolkit for differential transcription factor (TF) footprinting from ATAC-seq data. Corrects Tn5 insertio	loosolab/TOBIAS	Epigenomics	9
TreeTime TreeTime — maximum likelihood molecular clock inference, ancestral sequence reconstruction, and phylodynamic analysis from time-stamped phylogenies. Infers timetrees by iteratively optimizing ancestra	neherlab/treetime	Phylogenetics	10
VCF2PCA Verified Use when working with vcf2pca — VCF2PCA — lightweight Python tool for	sinanshi/VCF2PCA	Population Genetics	10
AlleleSeq AlleleSeq — pipeline for detecting allele-specific expression (ASE) and allele-specific binding (ASB) from RNA-seq and ChIP-seq data at heterozygous SNP positions. Constructs a personal diploid genome	nygenome/alleleseq	Transcriptomics	8
AMRFinderPlus AMRFinderPlus (NCBI Antimicrobial Resistance Gene Finder Plus) — identifies antimicrobial resistance genes, virulence factors, and stress genes in bacterial genome assemblies and protein sequences. Ac	ncbi/amr	Metagenomics	8
ANGSD Use when working with angsd — ANGSD — Analysis of Next Generation Sequencing	ANGSD/angsd	Population Genetics	8

Tool	Registry	Domain	Docs
SuSiE SuSiE (Sum of Single Effects) — R package for Bayesian variable selection and fine-mapping of GWAS loci. Fits a sparse regression model to identify credible sets of likely causal variants with posteri	stephenslab/susieR	Population Genetics	10
sva sva — R/Bioconductor package for surrogate variable analysis, batch effect correction, and unwanted variation removal in high-throughput experiments. Provides sva() for estimating surrogate variables,	jtleek/sva-devel	Transcriptomics	11
TALON TALON — technology-agnostic long-read transcriptome analysis for identifying and quantifying known and novel genes/isoforms. Works with PacBio and Oxford Nanopore SAM files via a SQLite database backe	mortazavilab/TALON	Transcriptomics	10
totalVI totalVI — deep generative model for joint analysis of CITE-seq RNA and surface protein data built on scvi-tools and PyTorch. Provides integrated dimensionality reduction across transcriptome and prote	scverse/scvi-tools	Systems Biology	9
totalVI / MultiVI totalVI and MultiVI — deep generative models from scvi-tools for multi-modal single-cell integration. totalVI jointly models scRNA-seq and protein (CITE-seq) data via a variational autoencoder for den	scverse/scvi-tools	Single-Cell	11
TransDecoder TransDecoder — identifies candidate coding regions within transcript sequences from de novo RNA-Seq assemblies. Provides TransDecoder.LongOrfs for extracting long open reading frames and TransDecoder.	TransDecoder/TransDecoder	Transcriptomics	10
Trinity Trinity — de novo transcriptome assembler for RNA-seq data. Reconstructs full-length transcript isoforms without a reference genome using three sequential modules: Inchworm (greedy k-mer assembly), Ch	trinityrnaseq/trinityrnaseq	Transcriptomics	10
Trinotate Trinotate — comprehensive functional annotation suite for transcriptome assemblies. Integrates BLAST/DIAMOND homology searches against SwissProt, HMMER/Pfam protein domain identification, SignalP sign	Trinotate/Trinotate	Transcriptomics	10
TxDb.Hsapiens.UCSC.hg38.knownGene Use when working with TxDb.Hsapiens.UCSC.hg38.knownGene, the Bioconductor annotation package providing UCSC hg38 transcript/gene coordinates for Homo sapiens. Handles transcript extraction, exon group	manual	Utilities & Infrastructure	9
tximeta tximeta — R/Bioconductor package for importing transcript-level quantification data from Salmon, alevin, piscem, and other quantifiers with automatic metadata attachment. Automatically identifies the	thelovelab/tximeta	Transcriptomics	10
tximport tximport -- R/Bioconductor package for importing transcript-level abundance, estimated counts, and transcript lengths from quantification tools (Salmon, Kallisto, RSEM, StringTie, Sailfish) and summar	thelovelab/tximport	Transcriptomics	9
UCE UCE (Universal Cell Embeddings) — species-agnostic cell embedding model from Stanford SNAP Lab that uses protein language models (ESM-2) to create universal cell representations without species-specif	snap-stanford/UCE	Transcriptomics	9
UCSC Bioinformatics Resources UCSC (University of California Santa Cruz) Bioinformatics — comprehensive routing for all UCSC genomic tools and resources. Covers the UCSC Genome Browser, Table Browser, UCSC MySQL public database, G	manual	Genomics	9
VAMB Use when working with VAMB — Variational Autoencoders for Metagenomic Binning — a deep learning tool that groups assembled metagenomic contigs into metagenome-assembled genomes (MAGs). Takes concatena	RasmussenLab/vamb	Metagenomics	9
VarSome Use when working with varsome — varSome — comprehensive human genomics	saphetor/varsome-api-client-python	Clinical Genomics	10
VIBRANT VIBRANT — metagenomics tool for identifying and annotating viruses from microbial assemblies (metagenomes, isolates, and virome datasets). Uses neural networks and KEGG/Pfam/VOG HMM profiles to classi	AnantharamanLab/VIBRANT	Metagenomics	10
Vireo Vireo (vireoSNP) is a Bayesian variational inference tool for donor demultiplexing in pooled single-cell RNA-seq and single-cell ATAC-seq experiments. It assigns cells to donors using SNP genotype inf	single-cell-genetics/vireo	Single-Cell	10
WASP WASP (Weight And SNP-based Pipeline) corrects allele-specific mapping bias in RNA-seq and ChIP-seq by SNP-masking reads that overlap heterozygous variants, remapping them, and filtering those that shi	bmvdgeijn/WASP	Population Genetics	9
wateRmelon wateRmelon v2.16.0 — Bioconductor R package for Illumina 450K and EPIC DNA methylation array normalization and performance metrics. Provides 15 normalization methods (dasen, nasen, naten, danes, swan,	schalkwyk/wateRmelon	Epigenomics	14
WDL/Cromwell WDL/Cromwell — Workflow Description Language (WDL) authoring and Cromwell execution engine for reproducible bioinformatics pipelines. WDL defines portable, composable workflows using task/workflow blo	broadinstitute/cromwell	Workflows	10
wf-basecalling wf-basecalling — Oxford Nanopore Technologies (ONT) EPI2ME Nextflow workflow for production-grade basecalling using Dorado. Converts POD5 or FAST5 raw electrical signal files to BAM/FASTQ with optiona	epi2me-labs/wf-basecalling	Genomics	9
wgbs_tools wgbs_tools — command-line toolkit for whole-genome bisulfite sequencing (WGBS) analysis. Converts, indexes, and queries methylation data in efficient PAT/BETA binary formats. Provides per-CpG and per-	nloyfer/wgbs_tools	Epigenomics	10
WGCNA WGCNA (Weighted Gene Co-expression Network Analysis) — R package for constructing weighted correlation networks from high-dimensional gene expression data. Identifies co-expression modules via hierarc	cran/WGCNA	Transcriptomics	10
Winnowmap Winnowmap — long-read mapping tool optimized for repetitive reference sequences. Uses weighted minimizer sampling and minimal confidently alignable substrings for accurate ONT and PacBio read alignmen	marbl/Winnowmap	Genomics	9
Wm82 Tools Wm82-tools covers bioinformatics workflows for the Glycine max (soybean) Williams 82 (Wm82) reference genome — the primary reference for soybean genomics. Use cases: downloading and indexing Wm82 geno	manual	Other	9
WNN (Weighted Nearest Neighbor) WNN (Weighted Nearest Neighbor) — Seurat's method for multimodal single-cell data integration that computes cell-specific modality weights and combines per-modality KNN graphs. Supports any combinatio	satijalab/seurat	Proteomics	9
ZeroCostDL4Mic ZeroCostDL4Mic is a Google Colab-based platform providing zero-cost deep learning for fluorescence microscopy image analysis. Supports segmentation (U-Net, StarDist, Cellpose), denoising (Noise2Void,	HenriquesLab/ZeroCostDL4Mic	Imaging	11
Bgen Use when working with bgen — BGEN binary genotype format for large-scale	limix/bgen	Clinical Genomics	8
Gwasvcfieugwasr Combined gwasvcf + ieugwasr routing skill for the IEU OpenGWAS ecosystem. Use when working with OpenGWAS study IDs, GWAS-VCF downloads, local summary statistics querying, LD clumping, proxy lookup, re	MRCIEU/ieugwasr	Population Genetics	8
MBASED Use when detecting allele-specific expression (ASE) from RNA-seq data with MBASED, a Bioconductor R package using a meta-analysis-based beta-binomial model. Covers SNP-level and gene-level ASE, phased	manual	Transcriptomics	8
Pipeline Composer Use when working with pipeline-composer — pipeline-composer — compose	Hordago-Labs/biocontext7	Transcriptomics	8
Protégé Protégé is an open-source ontology editor and knowledge graph framework for creating, editing, and reasoning over OWL and RDF ontologies. Use for biomedical ontology development (Gene Ontology, SNOMED	protegeproject/protege	Systems Biology	8
Vcontact2 vConTACT2 (Viral Contig Automatic Cluster Taxonomy 2) clusters prokaryotic viral genomes and contigs by shared protein content to assign taxonomy context from reference viral genomes. Use when working	Hocnonsense/vcontact2	Metagenomics	7
Virsorter2 Verified VirSorter2 is a multi-classifier pipeline for identifying DNA and RNA viral sequences from metagenomic and genomic assemblies. It supports dsDNA phage, ssDNA, RNA, NCLDV, and lavidaviridae groups, out	jiarong/VirSorter2	Metagenomics	13
ClinVar Database ClinVar Database — NCBI's public archive of human genetic variant clinical significance. Query via E-utilities REST API (esearch, esummary, efetch, elink) or download bulk data from FTP in XML, VCF, a	manual	Genomics	15
GREML (GCTA) GREML (GCTA) — genome-based restricted maximum likelihood analysis for SNP heritability estimation, genetic correlation, mixed linear model association (MLMA), and COJO conditional analysis. Use when	jianyangqt/gcta	Population Genetics	10
Find Alternative Tools Find drop-in replacements or compatible alternatives for bioinformatics tools using EDAM ontology I/O compatibility matching. Searches 47,000+ tools with curated migration knowledge for 20+ common alt	manual	Utilities & Infrastructure	8
BEAST2 BEAST2 — Bayesian Evolutionary Analysis by Sampling Trees. Cross-platform Java application for Bayesian phylogenetic inference using MCMC sampling. Use when estimating time-calibrated phylogenies, rel	CompEvol/beast2	Phylogenetics	10
bgzip Use when working with bgzip — bgzip — BGZF block-compression utility	samtools/htslib	Utilities & Infrastructure	9
Docker Verified Docker — container platform for building, running, and distributing reproducible bioinformatics environments. Provides Dockerfile authoring, image management, container execution with volume mounts fo	docker/cli	Workflows	10
ENA Browser Tools ENA Browser Tools — EMBL-EBI command-line utilities for programmatic access to the European Nucleotide Archive (ENA). Download FASTQ, SRA, FASTA, EMBL, and submitted files by study, sample, experiment	enasequence/enaBrowserTools	Utilities & Infrastructure	8
nf-core lint Use when validating Nextflow pipelines with nf-core/tools lint checks. Routes users to practical guidance for running `nf-core pipelines lint`, selecting targeted tests with `--key`, producing JSON/Ma	nf-core/tools	Workflows	8
NiBabel NiBabel — Python library for reading and writing neuroimaging file formats. Supports NIfTI-1, NIfTI-2, ANALYZE, MINC1, MINC2, GIFTI, FreeSurfer MGH/MGZ, CIFTI-2, PARREC, and BrainVoyager formats. Core	nipy/nibabel	Imaging	9
scipyoptimize scipy.optimize — Unified interface for numerical optimization in Python. Provides local and global minimization (minimize, differential_evolution, dual_annealing), root finding (root, root_scalar), cu	scipy/scipy	Statistics	13
TOBIAS TOBIAS — Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal. Python toolkit for differential transcription factor (TF) footprinting from ATAC-seq data. Corrects Tn5 insertio	loosolab/TOBIAS	Epigenomics	9
TreeTime TreeTime — maximum likelihood molecular clock inference, ancestral sequence reconstruction, and phylodynamic analysis from time-stamped phylogenies. Infers timetrees by iteratively optimizing ancestra	neherlab/treetime	Phylogenetics	10
VCF2PCA Verified Use when working with vcf2pca — VCF2PCA — lightweight Python tool for	sinanshi/VCF2PCA	Population Genetics	10
AlleleSeq AlleleSeq — pipeline for detecting allele-specific expression (ASE) and allele-specific binding (ASB) from RNA-seq and ChIP-seq data at heterozygous SNP positions. Constructs a personal diploid genome	nygenome/alleleseq	Transcriptomics	8
AMRFinderPlus AMRFinderPlus (NCBI Antimicrobial Resistance Gene Finder Plus) — identifies antimicrobial resistance genes, virulence factors, and stress genes in bacterial genome assemblies and protein sequences. Ac	ncbi/amr	Metagenomics	8
ANGSD Use when working with angsd — ANGSD — Analysis of Next Generation Sequencing	ANGSD/angsd	Population Genetics	8

Browse Tools

Browse Tools