BAMtools — C++ API and command-line toolkit for reading, writing, sorting, indexing, filtering, merging, splitting, and converting BAM alignment files. Provides JSON-based filter expressions, multi-fo

10

bamutil

bamutil — C++ toolkit for reading, writing, and manipulating BAM alignment files. Provides duplicate marking (dedup), base quality score recalibration (recab), overlapping read clipping (clipOverlap),

10

Bandage

Bandage (a Bioinformatics Application for Navigating De novo Assembly Graphs Easily) — interactive visualization tool for de novo assembly graphs in GFA and FASTG formats. Visualize genome assembly gr

10

BBDuk

BBDuk (Decontamination Using Kmers) — high-performance Java tool from the BBTools suite for adapter trimming, quality trimming, contaminant filtering, sequence masking, GC filtering, and format conver

11

BBKNN -- Batch Balanced K Nearest Neighbours

BBKNN (Batch Balanced K Nearest Neighbours) -- lightweight Python batch correction method for single-cell RNA-seq that operates on the k-nearest neighbor graph rather than the expression matrix. Repla

9

BBMap / BBTools

BBMap/BBTools — suite of 265+ fast, multithreaded Java-based bioinformatics tools for DNA/RNA sequence analysis. Includes BBMap short read aligner, BBDuk adapter trimmer and quality filter, BBMerge pa

10

bcl2fastq

Use when converting Illumina BCL (Binary Base Call) files to FASTQ format, demultiplexing sequencing runs by index sequences, or trimming adapters from short-read data. Handles HiSeq, MiSeq, NextSeq,

10

BEDOPS

BEDOPS — high-performance genomic interval arithmetic suite for BED, VCF, GFF, GTF, BAM, and SAM files. Provides set operations (intersection, union, difference, complement, symmetric difference) via

9

bedToBigBed

bedToBigBed — UCSC Kent utility that converts BED/BED+ text files into indexed binary BigBed (.bb) format for fast random-access genome browser tracks. Supports BED3 through BED12, BED+N extra-field e

11

Bifrost

Use when building, querying, or manipulating colored compacted de Bruijn graphs (ccDBG) with Bifrost for genome assembly, metagenomics, variant detection, or k-mer analysis. Covers graph construction

9

bigWigMerge

bigWigMerge merges multiple bigWig signal tracks into a single bedGraph output by summing (or optionally taking the maximum of) per-base signal values. Essential for combining ChIP-seq, ATAC-seq, RNA-

9

biobambam2

biobambam2 — C++ toolkit for early-stage BAM file processing built on libmaus2. Provides bamsormadup for parallel sorting with duplicate marking, bamcollate2 for name-collation, bammarkduplicates for

11

BLASR

BLASR (Basic Local Alignment with Successive Refinement) — PacBio long-read aligner for mapping SMRT sequencing reads to reference genomes. Uses suffix array indexing and global chaining for seed-and-

10

Bowtie2

Bowtie2 — ultrafast and memory-efficient short-read aligner using FM-index with gapped alignment and local alignment modes. Standard aligner for ChIP-seq, ATAC-seq, CUT&RUN, CUT&Tag, and general short

11

BRAKER

BRAKER — fully automated gene structure annotation pipeline combining GeneMark-ES/ET/EP/ETP with AUGUSTUS training. Supports RNA-Seq evidence (BRAKER1), protein evidence (BRAKER2), and combined RNA-Se

10

BSseeker2

Use when working with bsseeker2 — BSseeker2 — bisulfite sequencing aligner

10

BUSCO

BUSCO (Benchmarking Universal Single-Copy Orthologs) — assess genome assembly, transcriptome, and proteome completeness using lineage-specific single-copy orthologs from the OrthoDB database. Reports

10

BUStools

BUStools — C toolkit for manipulating BUS (Barcode, UMI, Set) files from single-cell RNA-seq pseudoalignment. Provides sorting, barcode error correction, UMI counting, matrix generation, quality inspe

10

BWA-MEM2

BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace

12

BWA-MEM2

BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace

9

Canu

Canu — long-read de novo genome assembler for PacBio HiFi, PacBio CLR, and Oxford Nanopore reads. Performs read correction, trimming, and overlap-layout-consensus assembly in three stages. Supports di

10

CD-HIT

CD-HIT — fast sequence clustering and redundancy reduction for protein and nucleotide sequences. Clusters FASTA sequences at a user-defined identity threshold to remove redundancy from databases. Prov

10

ChAMP

Verified

ChAMP — Chip Analysis Methylation Pipeline for Illumina 450K and EPIC arrays. Integrated R/Bioconductor pipeline for DNA methylation analysis covering IDAT loading, quality control, normalization (BMI

16

Chromosight

Chromosight — Python computer-vision tool for detecting chromatin loops, borders (TAD boundaries), hairpins, and other patterns in Hi-C contact maps using template matching with Pearson correlation. R

10

chromVAR

chromVAR — R/Bioconductor package for determining variation in chromatin accessibility across sets of annotations or peaks. Designed for single-cell ATAC-seq (scATAC-seq) and sparse bulk ATAC-seq data

11

Circos

Circos — Perl-based circular visualization tool for genomics data. Creates publication-quality circular plots for genome comparisons, synteny, GC content, gene density, chromosome ideograms, link/ribb

9

CNVkit -- Copy Number Variant Detection

CNVkit -- Python toolkit for detecting copy number variants (CNVs) from targeted DNA sequencing data (hybrid capture, amplicon) and whole-genome sequencing (WGS). Leverages both on-target and off-targ

9

CONCOCT

CONCOCT — metagenomic binning tool that clusters contigs into genomes using sequence composition (k-mer frequencies) and coverage across multiple samples. Combines PCA dimensionality reduction with Ga

9

cooler

cooler — file format and Python library for Hi-C chromatin contact data. Stores sparse contact matrices in .cool (single-resolution) and .mcool (multi-resolution) HDF5 files. Supports ingestion from p

9

Cramino

Use when working with cramino — a fast QC tool for long-read alignment files (CRAM and BAM). Generates alignment statistics including N50, median length, mean quality, percent aligned, and yield from

10

csvtk

Use when working with csvtk — csvtk — cross-platform, ultrafast CSV/TSV

10

Cufflinks

Cufflinks — RNA-seq transcript assembly, quantification, and differential expression suite. Assembles aligned reads into transcripts (cufflinks), performs differential expression analysis at transcrip

10

DADA2

DADA2 — high-resolution amplicon sequence variant (ASV) inference from Illumina, 454, and Ion Torrent amplicon sequencing data. R/Bioconductor package that models sequencing errors to resolve exact bi

11

DAS Tool

DAS Tool — genome-resolved metagenomics bin refinement tool that integrates results from multiple binning algorithms to produce an optimized, non-redundant set of metagenome-assembled genomes (MAGs).

10

datamash

Use when working with datamash — datamash — GNU command-line tool for

10

DeepVariant

DeepVariant — CNN-based variant caller that converts aligned reads into pileup image tensors and classifies them with a deep neural network to produce SNP and indel calls in VCF/gVCF format. Supports

11

dnaPipeTE

Verified

dnaPipeTE (do not assemble - PipelinE for Transposable Elements) is a bioinformatics pipeline for estimating the proportion and landscape of transposable elements (TEs) and other repeats in a genome f

9

DRAGMAP

DRAGMAP — Illumina's open-source hash-table-based DNA short-read aligner, implementing the DRAGEN mapping algorithm. Produces SAM/BAM output from FASTQ inputs against a pre-built hash table index. Use

10

DRAM-v

DRAM-v (Distilled and Refined Annotation of Metabolism — Viral) — annotate viral genomes and metagenome-assembled phage sequences for auxiliary metabolic genes (AMGs) and viral hallmarks. Produces ann

10

Drimseq

DRIMSeq is a Bioconductor R package for Differential Transcript Usage (DTU) analysis using a Dirichlet-multinomial distribution to model transcript-level count proportions. Tests whether relative tran

10

Dysgu

Dysgu — structural variant caller for paired-end and long-read sequencing data (Illumina, PacBio HiFi, Oxford Nanopore). Detects deletions, insertions, duplications, inversions, and translocations fro

10

EAGLE

Use when detecting allele-specific expression (ASE), allelic imbalance, or phased allele counts from RNA-seq data with EAGLE (Empirical Analysis of Allele-Specific Gene Expression). Covers phased VCF

9

Eeglab

Use when processing EEG or MEG data with EEGLAB (MATLAB toolbox). Covers loading raw data, filtering, re-referencing, epoching, ICA, artifact rejection, channel interpolation, time-frequency analysis,

10

EIGENSOFT smartpca -- Principal Component Analysis for Population Genetics

EIGENSOFT smartpca -- C/C++ tool for principal component analysis of genome-wide SNP genotype data. Computes eigenvectors and eigenvalues for population structure analysis, ancestry inference, stratif

9

EMU

EMU (Exact Mapping-based Unambiguous) — species-level 16S rRNA taxonomic profiling for long-read sequencing data (Oxford Nanopore, PacBio). Uses an expectation-maximization algorithm to resolve multi-

10

FACETS

FACETS (Fraction and Allele-Specific Copy number Estimates from Tumor/normal Sequencing) is an R package for allele-specific somatic copy number analysis of matched tumor-normal sequencing data. It es

10

Fastq Screen

Fastq Screen — Perl tool for screening FASTQ sequencing reads against multiple reference genomes to detect sample contamination. Maps reads to user-defined genome databases (human, mouse, E. coli, Phi

9

FastQC

FastQC — universal quality control tool for high-throughput sequencing data. Performs modular analysis of FASTQ and BAM files, generating per-base quality scores, GC content distribution, adapter cont

11

featureCounts

featureCounts — ultrafast read counting program for assigning aligned reads (SAM/BAM) to genomic features such as genes, exons, promoters, and genomic bins. Part of the Subread package. Supports singl

11

fgbio

fgbio — command-line toolkit for working with genomic and next-generation sequencing data, specializing in UMI (Unique Molecular Identifier) processing, consensus read calling, duplex sequencing, and

11

Tool	Registry	Domain	Docs
BAMtools BAMtools — C++ API and command-line toolkit for reading, writing, sorting, indexing, filtering, merging, splitting, and converting BAM alignment files. Provides JSON-based filter expressions, multi-fo	pezmaster31/bamtools	Genomics	10
bamutil bamutil — C++ toolkit for reading, writing, and manipulating BAM alignment files. Provides duplicate marking (dedup), base quality score recalibration (recab), overlapping read clipping (clipOverlap),	statgen/bamUtil	Utilities & Infrastructure	10
Bandage Bandage (a Bioinformatics Application for Navigating De novo Assembly Graphs Easily) — interactive visualization tool for de novo assembly graphs in GFA and FASTG formats. Visualize genome assembly gr	rrwick/Bandage	Genomics	10
BBDuk BBDuk (Decontamination Using Kmers) — high-performance Java tool from the BBTools suite for adapter trimming, quality trimming, contaminant filtering, sequence masking, GC filtering, and format conver	BioInfoTools/BBMap	QC & Preprocessing	11
BBKNN -- Batch Balanced K Nearest Neighbours BBKNN (Batch Balanced K Nearest Neighbours) -- lightweight Python batch correction method for single-cell RNA-seq that operates on the k-nearest neighbor graph rather than the expression matrix. Repla	Teichlab/bbknn	Transcriptomics	9
BBMap / BBTools BBMap/BBTools — suite of 265+ fast, multithreaded Java-based bioinformatics tools for DNA/RNA sequence analysis. Includes BBMap short read aligner, BBDuk adapter trimmer and quality filter, BBMerge pa	bbushnell/BBTools	Genomics	10
bcl2fastq Use when converting Illumina BCL (Binary Base Call) files to FASTQ format, demultiplexing sequencing runs by index sequences, or trimming adapters from short-read data. Handles HiSeq, MiSeq, NextSeq,	manual	Workflows	10
BEDOPS BEDOPS — high-performance genomic interval arithmetic suite for BED, VCF, GFF, GTF, BAM, and SAM files. Provides set operations (intersection, union, difference, complement, symmetric difference) via	bedops/bedops	Utilities & Infrastructure	9
bedToBigBed bedToBigBed — UCSC Kent utility that converts BED/BED+ text files into indexed binary BigBed (.bb) format for fast random-access genome browser tracks. Supports BED3 through BED12, BED+N extra-field e	ucscGenomeBrowser/kent	Genomics	11
Bifrost Use when building, querying, or manipulating colored compacted de Bruijn graphs (ccDBG) with Bifrost for genome assembly, metagenomics, variant detection, or k-mer analysis. Covers graph construction	pmelsted/bifrost	Genomics	9
bigWigMerge bigWigMerge merges multiple bigWig signal tracks into a single bedGraph output by summing (or optionally taking the maximum of) per-base signal values. Essential for combining ChIP-seq, ATAC-seq, RNA-	ucscGenomeBrowser/kent	Genomics	9
biobambam2 biobambam2 — C++ toolkit for early-stage BAM file processing built on libmaus2. Provides bamsormadup for parallel sorting with duplicate marking, bamcollate2 for name-collation, bammarkduplicates for	gt1/biobambam2	Genomics	11
BLASR BLASR (Basic Local Alignment with Successive Refinement) — PacBio long-read aligner for mapping SMRT sequencing reads to reference genomes. Uses suffix array indexing and global chaining for seed-and-	PacificBiosciences/blasr	Genomics	10
Bowtie2 Bowtie2 — ultrafast and memory-efficient short-read aligner using FM-index with gapped alignment and local alignment modes. Standard aligner for ChIP-seq, ATAC-seq, CUT&RUN, CUT&Tag, and general short	BenLangmead/bowtie2	Genomics	11
BRAKER BRAKER — fully automated gene structure annotation pipeline combining GeneMark-ES/ET/EP/ETP with AUGUSTUS training. Supports RNA-Seq evidence (BRAKER1), protein evidence (BRAKER2), and combined RNA-Se	Gaius-Augustus/BRAKER	Genomics	10
BSseeker2 Use when working with bsseeker2 — BSseeker2 — bisulfite sequencing aligner	BSSeeker/BSseeker2	Epigenomics	10
BUSCO BUSCO (Benchmarking Universal Single-Copy Orthologs) — assess genome assembly, transcriptome, and proteome completeness using lineage-specific single-copy orthologs from the OrthoDB database. Reports	WenchaoLin/BUSCO	Genomics	10
BUStools BUStools — C toolkit for manipulating BUS (Barcode, UMI, Set) files from single-cell RNA-seq pseudoalignment. Provides sorting, barcode error correction, UMI counting, matrix generation, quality inspe	BUStools/bustools	Single-Cell	10
BWA-MEM2 BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace	bwa-mem2/bwa-mem2	Genomics	12
BWA-MEM2 BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace	bwa-mem2/bwa-mem2	Genomics	9
Canu Canu — long-read de novo genome assembler for PacBio HiFi, PacBio CLR, and Oxford Nanopore reads. Performs read correction, trimming, and overlap-layout-consensus assembly in three stages. Supports di	marbl/canu	Genomics	10
CD-HIT CD-HIT — fast sequence clustering and redundancy reduction for protein and nucleotide sequences. Clusters FASTA sequences at a user-defined identity threshold to remove redundancy from databases. Prov	weizhongli/cdhit	Phylogenetics	10
ChAMP Verified ChAMP — Chip Analysis Methylation Pipeline for Illumina 450K and EPIC arrays. Integrated R/Bioconductor pipeline for DNA methylation analysis covering IDAT loading, quality control, normalization (BMI	YuanTian1991/ChAMP	Epigenomics	16
Chromosight Chromosight — Python computer-vision tool for detecting chromatin loops, borders (TAD boundaries), hairpins, and other patterns in Hi-C contact maps using template matching with Pearson correlation. R	koszullab/chromosight	Genomics	10
chromVAR chromVAR — R/Bioconductor package for determining variation in chromatin accessibility across sets of annotations or peaks. Designed for single-cell ATAC-seq (scATAC-seq) and sparse bulk ATAC-seq data	GreenleafLab/chromVAR	Single-Cell	11
Circos Circos — Perl-based circular visualization tool for genomics data. Creates publication-quality circular plots for genome comparisons, synteny, GC content, gene density, chromosome ideograms, link/ribb	vigsterkr/circos	Imaging	9
CNVkit -- Copy Number Variant Detection CNVkit -- Python toolkit for detecting copy number variants (CNVs) from targeted DNA sequencing data (hybrid capture, amplicon) and whole-genome sequencing (WGS). Leverages both on-target and off-targ	etal/cnvkit	Genomics	9
CONCOCT CONCOCT — metagenomic binning tool that clusters contigs into genomes using sequence composition (k-mer frequencies) and coverage across multiple samples. Combines PCA dimensionality reduction with Ga	BinPro/CONCOCT	Metagenomics	9
cooler cooler — file format and Python library for Hi-C chromatin contact data. Stores sparse contact matrices in .cool (single-resolution) and .mcool (multi-resolution) HDF5 files. Supports ingestion from p	open2c/cooler	Genomics	9
Cramino Use when working with cramino — a fast QC tool for long-read alignment files (CRAM and BAM). Generates alignment statistics including N50, median length, mean quality, percent aligned, and yield from	wdecoster/cramino	QC & Preprocessing	10
csvtk Use when working with csvtk — csvtk — cross-platform, ultrafast CSV/TSV	shenwei356/csvtk	Utilities & Infrastructure	10
Cufflinks Cufflinks — RNA-seq transcript assembly, quantification, and differential expression suite. Assembles aligned reads into transcripts (cufflinks), performs differential expression analysis at transcrip	cole-trapnell-lab/cufflinks	Transcriptomics	10
DADA2 DADA2 — high-resolution amplicon sequence variant (ASV) inference from Illumina, 454, and Ion Torrent amplicon sequencing data. R/Bioconductor package that models sequencing errors to resolve exact bi	benjjneb/dada2	Metagenomics	11
DAS Tool DAS Tool — genome-resolved metagenomics bin refinement tool that integrates results from multiple binning algorithms to produce an optimized, non-redundant set of metagenome-assembled genomes (MAGs).	cmks/DAS_Tool	Metagenomics	10
datamash Use when working with datamash — datamash — GNU command-line tool for	manual	Utilities & Infrastructure	10
DeepVariant DeepVariant — CNN-based variant caller that converts aligned reads into pileup image tensors and classifies them with a deep neural network to produce SNP and indel calls in VCF/gVCF format. Supports	google/deepvariant	Genomics	11
dnaPipeTE Verified dnaPipeTE (do not assemble - PipelinE for Transposable Elements) is a bioinformatics pipeline for estimating the proportion and landscape of transposable elements (TEs) and other repeats in a genome f	clemgoub/dnaPipeTE	Genomics	9
DRAGMAP DRAGMAP — Illumina's open-source hash-table-based DNA short-read aligner, implementing the DRAGEN mapping algorithm. Produces SAM/BAM output from FASTQ inputs against a pre-built hash table index. Use	Illumina/DRAGMAP	Genomics	10
DRAM-v DRAM-v (Distilled and Refined Annotation of Metabolism — Viral) — annotate viral genomes and metagenome-assembled phage sequences for auxiliary metabolic genes (AMGs) and viral hallmarks. Produces ann	WrightonLabCSU/DRAM	Metagenomics	10
Drimseq DRIMSeq is a Bioconductor R package for Differential Transcript Usage (DTU) analysis using a Dirichlet-multinomial distribution to model transcript-level count proportions. Tests whether relative tran	gosianow/DRIMSeq	Transcriptomics	10
Dysgu Dysgu — structural variant caller for paired-end and long-read sequencing data (Illumina, PacBio HiFi, Oxford Nanopore). Detects deletions, insertions, duplications, inversions, and translocations fro	kcleal/dysgu	Genomics	10
EAGLE Use when detecting allele-specific expression (ASE), allelic imbalance, or phased allele counts from RNA-seq data with EAGLE (Empirical Analysis of Allele-Specific Gene Expression). Covers phased VCF	tonyd256/eagle	Transcriptomics	9
Eeglab Use when processing EEG or MEG data with EEGLAB (MATLAB toolbox). Covers loading raw data, filtering, re-referencing, epoching, ICA, artifact rejection, channel interpolation, time-frequency analysis,	sccn/eeglab	Systems Biology	10
EIGENSOFT smartpca -- Principal Component Analysis for Population Genetics EIGENSOFT smartpca -- C/C++ tool for principal component analysis of genome-wide SNP genotype data. Computes eigenvectors and eigenvalues for population structure analysis, ancestry inference, stratif	chrchang/eigensoft	Genomics	9
EMU EMU (Exact Mapping-based Unambiguous) — species-level 16S rRNA taxonomic profiling for long-read sequencing data (Oxford Nanopore, PacBio). Uses an expectation-maximization algorithm to resolve multi-	treangenlab/emu	Metagenomics	10
FACETS FACETS (Fraction and Allele-Specific Copy number Estimates from Tumor/normal Sequencing) is an R package for allele-specific somatic copy number analysis of matched tumor-normal sequencing data. It es	mskcc/facets	Clinical Genomics	10
Fastq Screen Fastq Screen — Perl tool for screening FASTQ sequencing reads against multiple reference genomes to detect sample contamination. Maps reads to user-defined genome databases (human, mouse, E. coli, Phi	StevenWingworthy/FastQ-Screen	QC & Preprocessing	9
FastQC FastQC — universal quality control tool for high-throughput sequencing data. Performs modular analysis of FASTQ and BAM files, generating per-base quality scores, GC content distribution, adapter cont	s-andrews/FastQC	QC & Preprocessing	11
featureCounts featureCounts — ultrafast read counting program for assigning aligned reads (SAM/BAM) to genomic features such as genes, exons, promoters, and genomic bins. Part of the Subread package. Supports singl	manual	Genomics	11
fgbio fgbio — command-line toolkit for working with genomic and next-generation sequencing data, specializing in UMI (Unique Molecular Identifier) processing, consensus read calling, duplex sequencing, and	fulcrumgenomics/fgbio	QC & Preprocessing	11

Tool	Registry	Domain	Docs
BAMtools BAMtools — C++ API and command-line toolkit for reading, writing, sorting, indexing, filtering, merging, splitting, and converting BAM alignment files. Provides JSON-based filter expressions, multi-fo	pezmaster31/bamtools	Genomics	10
bamutil bamutil — C++ toolkit for reading, writing, and manipulating BAM alignment files. Provides duplicate marking (dedup), base quality score recalibration (recab), overlapping read clipping (clipOverlap),	statgen/bamUtil	Utilities & Infrastructure	10
Bandage Bandage (a Bioinformatics Application for Navigating De novo Assembly Graphs Easily) — interactive visualization tool for de novo assembly graphs in GFA and FASTG formats. Visualize genome assembly gr	rrwick/Bandage	Genomics	10
BBDuk BBDuk (Decontamination Using Kmers) — high-performance Java tool from the BBTools suite for adapter trimming, quality trimming, contaminant filtering, sequence masking, GC filtering, and format conver	BioInfoTools/BBMap	QC & Preprocessing	11
BBKNN -- Batch Balanced K Nearest Neighbours BBKNN (Batch Balanced K Nearest Neighbours) -- lightweight Python batch correction method for single-cell RNA-seq that operates on the k-nearest neighbor graph rather than the expression matrix. Repla	Teichlab/bbknn	Transcriptomics	9
BBMap / BBTools BBMap/BBTools — suite of 265+ fast, multithreaded Java-based bioinformatics tools for DNA/RNA sequence analysis. Includes BBMap short read aligner, BBDuk adapter trimmer and quality filter, BBMerge pa	bbushnell/BBTools	Genomics	10
bcl2fastq Use when converting Illumina BCL (Binary Base Call) files to FASTQ format, demultiplexing sequencing runs by index sequences, or trimming adapters from short-read data. Handles HiSeq, MiSeq, NextSeq,	manual	Workflows	10
BEDOPS BEDOPS — high-performance genomic interval arithmetic suite for BED, VCF, GFF, GTF, BAM, and SAM files. Provides set operations (intersection, union, difference, complement, symmetric difference) via	bedops/bedops	Utilities & Infrastructure	9
bedToBigBed bedToBigBed — UCSC Kent utility that converts BED/BED+ text files into indexed binary BigBed (.bb) format for fast random-access genome browser tracks. Supports BED3 through BED12, BED+N extra-field e	ucscGenomeBrowser/kent	Genomics	11
Bifrost Use when building, querying, or manipulating colored compacted de Bruijn graphs (ccDBG) with Bifrost for genome assembly, metagenomics, variant detection, or k-mer analysis. Covers graph construction	pmelsted/bifrost	Genomics	9
bigWigMerge bigWigMerge merges multiple bigWig signal tracks into a single bedGraph output by summing (or optionally taking the maximum of) per-base signal values. Essential for combining ChIP-seq, ATAC-seq, RNA-	ucscGenomeBrowser/kent	Genomics	9
biobambam2 biobambam2 — C++ toolkit for early-stage BAM file processing built on libmaus2. Provides bamsormadup for parallel sorting with duplicate marking, bamcollate2 for name-collation, bammarkduplicates for	gt1/biobambam2	Genomics	11
BLASR BLASR (Basic Local Alignment with Successive Refinement) — PacBio long-read aligner for mapping SMRT sequencing reads to reference genomes. Uses suffix array indexing and global chaining for seed-and-	PacificBiosciences/blasr	Genomics	10
Bowtie2 Bowtie2 — ultrafast and memory-efficient short-read aligner using FM-index with gapped alignment and local alignment modes. Standard aligner for ChIP-seq, ATAC-seq, CUT&RUN, CUT&Tag, and general short	BenLangmead/bowtie2	Genomics	11
BRAKER BRAKER — fully automated gene structure annotation pipeline combining GeneMark-ES/ET/EP/ETP with AUGUSTUS training. Supports RNA-Seq evidence (BRAKER1), protein evidence (BRAKER2), and combined RNA-Se	Gaius-Augustus/BRAKER	Genomics	10
BSseeker2 Use when working with bsseeker2 — BSseeker2 — bisulfite sequencing aligner	BSSeeker/BSseeker2	Epigenomics	10
BUSCO BUSCO (Benchmarking Universal Single-Copy Orthologs) — assess genome assembly, transcriptome, and proteome completeness using lineage-specific single-copy orthologs from the OrthoDB database. Reports	WenchaoLin/BUSCO	Genomics	10
BUStools BUStools — C toolkit for manipulating BUS (Barcode, UMI, Set) files from single-cell RNA-seq pseudoalignment. Provides sorting, barcode error correction, UMI counting, matrix generation, quality inspe	BUStools/bustools	Single-Cell	10
BWA-MEM2 BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace	bwa-mem2/bwa-mem2	Genomics	12
BWA-MEM2 BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace	bwa-mem2/bwa-mem2	Genomics	9
Canu Canu — long-read de novo genome assembler for PacBio HiFi, PacBio CLR, and Oxford Nanopore reads. Performs read correction, trimming, and overlap-layout-consensus assembly in three stages. Supports di	marbl/canu	Genomics	10
CD-HIT CD-HIT — fast sequence clustering and redundancy reduction for protein and nucleotide sequences. Clusters FASTA sequences at a user-defined identity threshold to remove redundancy from databases. Prov	weizhongli/cdhit	Phylogenetics	10
ChAMP Verified ChAMP — Chip Analysis Methylation Pipeline for Illumina 450K and EPIC arrays. Integrated R/Bioconductor pipeline for DNA methylation analysis covering IDAT loading, quality control, normalization (BMI	YuanTian1991/ChAMP	Epigenomics	16
Chromosight Chromosight — Python computer-vision tool for detecting chromatin loops, borders (TAD boundaries), hairpins, and other patterns in Hi-C contact maps using template matching with Pearson correlation. R	koszullab/chromosight	Genomics	10
chromVAR chromVAR — R/Bioconductor package for determining variation in chromatin accessibility across sets of annotations or peaks. Designed for single-cell ATAC-seq (scATAC-seq) and sparse bulk ATAC-seq data	GreenleafLab/chromVAR	Single-Cell	11
Circos Circos — Perl-based circular visualization tool for genomics data. Creates publication-quality circular plots for genome comparisons, synteny, GC content, gene density, chromosome ideograms, link/ribb	vigsterkr/circos	Imaging	9
CNVkit -- Copy Number Variant Detection CNVkit -- Python toolkit for detecting copy number variants (CNVs) from targeted DNA sequencing data (hybrid capture, amplicon) and whole-genome sequencing (WGS). Leverages both on-target and off-targ	etal/cnvkit	Genomics	9
CONCOCT CONCOCT — metagenomic binning tool that clusters contigs into genomes using sequence composition (k-mer frequencies) and coverage across multiple samples. Combines PCA dimensionality reduction with Ga	BinPro/CONCOCT	Metagenomics	9
cooler cooler — file format and Python library for Hi-C chromatin contact data. Stores sparse contact matrices in .cool (single-resolution) and .mcool (multi-resolution) HDF5 files. Supports ingestion from p	open2c/cooler	Genomics	9
Cramino Use when working with cramino — a fast QC tool for long-read alignment files (CRAM and BAM). Generates alignment statistics including N50, median length, mean quality, percent aligned, and yield from	wdecoster/cramino	QC & Preprocessing	10
csvtk Use when working with csvtk — csvtk — cross-platform, ultrafast CSV/TSV	shenwei356/csvtk	Utilities & Infrastructure	10
Cufflinks Cufflinks — RNA-seq transcript assembly, quantification, and differential expression suite. Assembles aligned reads into transcripts (cufflinks), performs differential expression analysis at transcrip	cole-trapnell-lab/cufflinks	Transcriptomics	10
DADA2 DADA2 — high-resolution amplicon sequence variant (ASV) inference from Illumina, 454, and Ion Torrent amplicon sequencing data. R/Bioconductor package that models sequencing errors to resolve exact bi	benjjneb/dada2	Metagenomics	11
DAS Tool DAS Tool — genome-resolved metagenomics bin refinement tool that integrates results from multiple binning algorithms to produce an optimized, non-redundant set of metagenome-assembled genomes (MAGs).	cmks/DAS_Tool	Metagenomics	10
datamash Use when working with datamash — datamash — GNU command-line tool for	manual	Utilities & Infrastructure	10
DeepVariant DeepVariant — CNN-based variant caller that converts aligned reads into pileup image tensors and classifies them with a deep neural network to produce SNP and indel calls in VCF/gVCF format. Supports	google/deepvariant	Genomics	11
dnaPipeTE Verified dnaPipeTE (do not assemble - PipelinE for Transposable Elements) is a bioinformatics pipeline for estimating the proportion and landscape of transposable elements (TEs) and other repeats in a genome f	clemgoub/dnaPipeTE	Genomics	9
DRAGMAP DRAGMAP — Illumina's open-source hash-table-based DNA short-read aligner, implementing the DRAGEN mapping algorithm. Produces SAM/BAM output from FASTQ inputs against a pre-built hash table index. Use	Illumina/DRAGMAP	Genomics	10
DRAM-v DRAM-v (Distilled and Refined Annotation of Metabolism — Viral) — annotate viral genomes and metagenome-assembled phage sequences for auxiliary metabolic genes (AMGs) and viral hallmarks. Produces ann	WrightonLabCSU/DRAM	Metagenomics	10
Drimseq DRIMSeq is a Bioconductor R package for Differential Transcript Usage (DTU) analysis using a Dirichlet-multinomial distribution to model transcript-level count proportions. Tests whether relative tran	gosianow/DRIMSeq	Transcriptomics	10
Dysgu Dysgu — structural variant caller for paired-end and long-read sequencing data (Illumina, PacBio HiFi, Oxford Nanopore). Detects deletions, insertions, duplications, inversions, and translocations fro	kcleal/dysgu	Genomics	10
EAGLE Use when detecting allele-specific expression (ASE), allelic imbalance, or phased allele counts from RNA-seq data with EAGLE (Empirical Analysis of Allele-Specific Gene Expression). Covers phased VCF	tonyd256/eagle	Transcriptomics	9
Eeglab Use when processing EEG or MEG data with EEGLAB (MATLAB toolbox). Covers loading raw data, filtering, re-referencing, epoching, ICA, artifact rejection, channel interpolation, time-frequency analysis,	sccn/eeglab	Systems Biology	10
EIGENSOFT smartpca -- Principal Component Analysis for Population Genetics EIGENSOFT smartpca -- C/C++ tool for principal component analysis of genome-wide SNP genotype data. Computes eigenvectors and eigenvalues for population structure analysis, ancestry inference, stratif	chrchang/eigensoft	Genomics	9
EMU EMU (Exact Mapping-based Unambiguous) — species-level 16S rRNA taxonomic profiling for long-read sequencing data (Oxford Nanopore, PacBio). Uses an expectation-maximization algorithm to resolve multi-	treangenlab/emu	Metagenomics	10
FACETS FACETS (Fraction and Allele-Specific Copy number Estimates from Tumor/normal Sequencing) is an R package for allele-specific somatic copy number analysis of matched tumor-normal sequencing data. It es	mskcc/facets	Clinical Genomics	10
Fastq Screen Fastq Screen — Perl tool for screening FASTQ sequencing reads against multiple reference genomes to detect sample contamination. Maps reads to user-defined genome databases (human, mouse, E. coli, Phi	StevenWingworthy/FastQ-Screen	QC & Preprocessing	9
FastQC FastQC — universal quality control tool for high-throughput sequencing data. Performs modular analysis of FASTQ and BAM files, generating per-base quality scores, GC content distribution, adapter cont	s-andrews/FastQC	QC & Preprocessing	11
featureCounts featureCounts — ultrafast read counting program for assigning aligned reads (SAM/BAM) to genomic features such as genes, exons, promoters, and genomic bins. Part of the Subread package. Supports singl	manual	Genomics	11
fgbio fgbio — command-line toolkit for working with genomic and next-generation sequencing data, specializing in UMI (Unique Molecular Identifier) processing, consensus read calling, duplex sequencing, and	fulcrumgenomics/fgbio	QC & Preprocessing	11

Browse Tools

Browse Tools