medaka -- neural network-based tool from Oxford Nanopore Technologies for creating consensus sequences and calling variants from nanopore sequencing data. Polishes draft assemblies from Flye, miniasm,

10

MEME Suite

MEME Suite — comprehensive motif analysis toolkit for DNA, RNA, and protein sequences. Provides de novo motif discovery (MEME, STREME), motif scanning (FIMO), motif comparison (TOMTOM), motif enrichme

10

Merqury

Merqury — reference-free quality, completeness, and phasing assessment for genome assemblies using k-mer databases built from Illumina reads. Computes QV (Phred-scaled quality value), k-mer completene

10

MetaBAT2

MetaBAT2 — adaptive binning of metagenomic contigs using empirical probabilistic distances of genome abundance and tetranucleotide frequency. Clusters assembled contigs into genome bins (MAGs) from me

10

metaSPAdes

metaSPAdes — de novo metagenomic assembler from the SPAdes suite using iterative de Bruijn graphs with multiple k-mer sizes optimized for uneven-coverage metagenomes. Assembles short-read (Illumina pa

10

MetaWRAP

MetaWRAP — flexible pipeline for genome-resolved metagenomic data analysis. Wraps read QC (trimming, host removal), metagenomic assembly (MEGAHIT, metaSPAdes), binning (MaxBin2, metaBAT2, CONCOCT), bi

10

Mikado

Mikado -- Python pipeline for selecting the best transcript models from multiple RNA-seq assemblies. Merges transcript sets from StringTie, Trinity, CLASS2, and other assemblers, then scores and filte

10

Miniasm

Miniasm — ultrafast OLC (overlap-layout-consensus) assembler for long reads (PacBio, Oxford Nanopore). Produces unitig-level assemblies from read overlaps computed by minimap2, without error correctio

10

minimap2-Based Phylogenetics

Use when building phylogenetic trees from genomic sequences using minimap2 for all-vs-all alignment and pairwise distance computation. Covers minimap2 PAF output, distance matrix construction, tree in

9

Minipolish

Use when working with minipolish — Racon-based polishing of miniasm long-read genome assemblies. Accepts raw nanopore or PacBio reads (FASTQ/FASTA) and a miniasm GFA assembly, runs iterative minimap2

9

MIP

Use when working with MIP (Medical Information Process) — a workflow management system for clinical whole-genome and whole-exome sequencing analysis. Runs end-to-end rare disease (rd_dna, rd_rna) and

9

MMseqs2

MMseqs2 — ultra-fast protein and nucleotide sequence searching, clustering, and taxonomy assignment. Performs all-vs-all comparisons, iterative profile searches, linclust linear-time clustering, easy-

10

modkit

modkit — Oxford Nanopore Technologies Rust CLI for modified base (methylation) analysis on nanopore sequencing data. Processes modBAM files (BAM with MM/ML tags) to compute per-position modification f

10

Mustache

Mustache — multi-scale detection of chromatin loops from Hi-C and Micro-C contact maps using scale-space representation. Identifies loops across resolutions from 10 kb down to 500 bp using computer-vi

10

Mutect2 -- GATK Somatic Short Variant Caller

Mutect2 -- GATK somatic short variant caller for detecting SNVs and indels in tumor samples. Supports tumor-only and tumor-normal paired calling modes with Panel of Normals and germline resource filte

9

NanoStat

NanoStat — Python tool for calculating statistics from Oxford Nanopore and PacBio long-read sequencing data. Reads FASTQ, FASTA, BAM, and sequencing summary files to produce read length distributions,

11

NanoSV

NanoSV — structural variant caller for long-read sequencing data (Oxford Nanopore). Detects deletions, insertions, inversions, duplications, and translocations from split-read alignments in BAM files.

10

NCBI Genome Workbench

NCBI Genome Workbench — desktop application for integrated genomic data visualization, sequence analysis, and annotation. Supports interactive browsing of GenBank/RefSeq entries, multiple sequence ali

10

Nextclade CLI

Nextclade CLI — viral genome quality control, clade assignment, mutation calling, and phylogenetic placement for SARS-CoV-2, influenza (H3N2, H1N1, H5N1), RSV, mpox, and 100+ other pathogen datasets.

10

NextDenovo

NextDenovo — string graph-based de novo assembler for long reads. Performs error correction (NextCorrect) then assembly (NextGraph) from PacBio CLR, PacBio HiFi, or Oxford Nanopore reads. Config-file

11

Novoalign

Novoalign — high-accuracy short-read sequence aligner for Illumina and MGI platforms using full Needleman-Wunsch algorithm with affine gap penalties. Maps single-end and paired-end reads up to 950bp o

10

Novocraft

Novocraft suite — short-read alignment and BAM sorting for DNA-seq and RNA-seq. novoalign maps Illumina, Ion Torrent, and 454 reads to indexed reference genomes with Smith-Waterman scoring for high-ac

10

NOVOPlasty

NOVOPlasty is a seed-and-extend de novo assembler for short circular genomes including mitochondrial genomes, chloroplast/plastid genomes, and small viral circular sequences from Illumina whole-genome

10

OptiType

Use when performing precision HLA class I typing from NGS data with OptiType. Covers read mapping to HLA reference with YARA, Integer Linear Programming allele calling at 4-digit resolution for HLA-A,

9

Panaroo

Panaroo — graph-based bacterial pan-genome analysis pipeline that corrects annotation errors. Takes GFF3 files from Prokka or Bakta, builds a population-level gene graph, and outputs core/accessory ge

10

Pangolin

Pangolin — Phylogenetic Assignment of Named Global Outbreak LINeages for SARS-CoV-2. Assigns Pango lineages to consensus genome sequences using UShER (phylogenetic placement) or pangoLEARN (ML classif

11

Pavian

Verified

Pavian — interactive R/Shiny web application for metagenomics analysis and visualization. Reads classification reports from Kraken, Kraken2, Bracken, MetaPhlAn, DIAMOND+MEGAN, and Centrifuge to genera

9

Pfam

Pfam — database of protein families with curated HMM profiles for domain annotation, functional classification, and evolutionary analysis. Search protein sequences against Pfam-A using HMMER's hmmscan

9

PGAP

PGAP — NCBI Prokaryotic Genome Annotation Pipeline for comprehensive annotation of bacterial and archaeal genomes. Runs gene prediction (GeneMarkS-2+, tRNAscan-SE), functional annotation via HMM/BLAST

10

phantompeakqualtools

Use when working with phantompeakqualtools — phantompeakqualtools (spp)

9

phytools

phytools — R package for phylogenetic comparative biology and the study of trait evolution on phylogenies. Provides ancestral state reconstruction (continuous and discrete), stochastic character mappi

10

Pilon

Pilon — automated genome assembly improvement and variant detection tool. Polishes draft assemblies by correcting SNPs, indels, and misassemblies using short-read alignments (BAM files). Fills gaps in

9

PinAPL-Py

PinAPL-Py (Pooled In/out Screen Analysis Pipeline for Python) is a complete analysis pipeline for pooled CRISPR sgRNA screens. Handles raw FASTQ alignment to sgRNA libraries, read counting, normalizat

10

PlantCV

Use when analyzing plant images for phenotyping with PlantCV. Handles visible (RGB), near-infrared (NIR), fluorescence, thermal, and hyperspectral imaging workflows. Performs image segmentation, shape

10

PLINK 1.9/2

Verified

PLINK 1.9/2 — high-performance command-line toolset for whole-genome association analysis, population stratification, identity-by-descent, linkage disequilibrium computation, and genotype data managem

10

Polypolish

Use when working with Polypolish — a short-read polishing tool for long-read genome assemblies. Corrects errors in ONT or PacBio draft assemblies using paired Illumina short reads aligned with BWA. Us

10

Porechop

Porechop — adapter trimming and demultiplexing tool for Oxford Nanopore long-read sequencing data. Finds and removes adapters from read ends, splits chimeric reads containing internal adapters, and de

9

PRINSEQ++

PRINSEQ++ — high-performance C++ tool for quality control and preprocessing of FASTQ sequencing reads. Filters and trims reads by quality score, length, GC content, sequence complexity (DUST/entropy),

11

ProteoWizard msconvert

ProteoWizard msconvert — command-line mass spectrometry data converter for proteomics workflows. Use when converting vendor RAW files (Thermo, AB SCIEX, Waters, Bruker, Agilent) to open formats (mzML,

12

Purge_dups

Purge_dups — haplotypic duplication identification and removal tool for genome assemblies. Detects and removes overlapping haplotigs and heterozygous duplications from primary assemblies using read de

11

Purge Haplotigs

Purge Haplotigs — pipeline for curating heterozygous diploid genome assemblies by identifying and reassigning haplotigs (alternative haplotype contigs). Analyzes read-depth histograms to set coverage

10

pybedtools

Use when working with pybedtools — pybedtools — Python interface to BEDTools

11

PycoQC

PycoQC — Python tool for Oxford Nanopore sequencing QC that parses basecaller summary files (sequencing_summary.txt from Guppy, Dorado, or MinKNOW) and BAM alignment files to generate interactive HTML

10

Pyrodigal

Pyrodigal is a Python library binding to Prodigal for gene prediction in prokaryotic genomes and metagenomes. Provides in-memory ORF finding with SIMD-accelerated scoring, single-genome and metagenomi

11

Qualimap

Qualimap — platform-independent quality control tool for next-generation sequencing alignment data. Provides BAM QC (coverage, insert size, GC content, mapping quality), RNA-seq QC (gene body coverage

11

QUAST

QUAST (Quality Assessment Tool for Genome Assemblies) — evaluate genome assembly quality using contiguity metrics including N50, L50, total length, largest contig, misassemblies, and mismatches. Suppo

10

Racon

Racon — ultrafast consensus module for raw de novo genome assembly of long uncorrected reads (PacBio CLR, Oxford Nanopore). Uses partial order alignment (POA) to polish draft assemblies from miniasm,

10

RagTag (Ragoo/RagTag)

RagTag — reference-guided genome assembly scaffolding, misassembly correction, gap patching, and multi-scaffold merging toolkit. Successor to RaGOO. Supports minimap2, unimap, and nucmer aligners. Pro

10

REGENIE

REGENIE — whole-genome regression for biobank-scale GWAS. Two-step ridge regression with leave-one-chromosome-out (LOCO) predictions for efficient single-variant and gene-based association testing. Su

11

RepeatMasker

RepeatMasker — screens DNA sequences for interspersed repeats and low-complexity regions. Uses Tandem Repeats Finder and curated repeat libraries (Dfam, RepBase) to identify SINEs, LINEs, LTR elements

10

Tool	Registry	Domain	Docs
Medaka medaka -- neural network-based tool from Oxford Nanopore Technologies for creating consensus sequences and calling variants from nanopore sequencing data. Polishes draft assemblies from Flye, miniasm,	nanoporetech/medaka	Genomics	10
MEME Suite MEME Suite — comprehensive motif analysis toolkit for DNA, RNA, and protein sequences. Provides de novo motif discovery (MEME, STREME), motif scanning (FIMO), motif comparison (TOMTOM), motif enrichme	duocang/MEME-Suite	Epigenomics	10
Merqury Merqury — reference-free quality, completeness, and phasing assessment for genome assemblies using k-mer databases built from Illumina reads. Computes QV (Phred-scaled quality value), k-mer completene	marbl/merqury	Genomics	10
MetaBAT2 MetaBAT2 — adaptive binning of metagenomic contigs using empirical probabilistic distances of genome abundance and tetranucleotide frequency. Clusters assembled contigs into genome bins (MAGs) from me	laczko/MetaBAT	Metagenomics	10
metaSPAdes metaSPAdes — de novo metagenomic assembler from the SPAdes suite using iterative de Bruijn graphs with multiple k-mer sizes optimized for uneven-coverage metagenomes. Assembles short-read (Illumina pa	ablab/spades	Metagenomics	10
MetaWRAP MetaWRAP — flexible pipeline for genome-resolved metagenomic data analysis. Wraps read QC (trimming, host removal), metagenomic assembly (MEGAHIT, metaSPAdes), binning (MaxBin2, metaBAT2, CONCOCT), bi	bxlab/metaWRAP	Metagenomics	10
Mikado Mikado -- Python pipeline for selecting the best transcript models from multiple RNA-seq assemblies. Merges transcript sets from StringTie, Trinity, CLASS2, and other assemblers, then scores and filte	EI-CoreBioinformatics/mikado	Transcriptomics	10
Miniasm Miniasm — ultrafast OLC (overlap-layout-consensus) assembler for long reads (PacBio, Oxford Nanopore). Produces unitig-level assemblies from read overlaps computed by minimap2, without error correctio	lh3/miniasm	Genomics	10
minimap2-Based Phylogenetics Use when building phylogenetic trees from genomic sequences using minimap2 for all-vs-all alignment and pairwise distance computation. Covers minimap2 PAF output, distance matrix construction, tree in	lh3/minimap2	Phylogenetics	9
Minipolish Use when working with minipolish — Racon-based polishing of miniasm long-read genome assemblies. Accepts raw nanopore or PacBio reads (FASTQ/FASTA) and a miniasm GFA assembly, runs iterative minimap2	rrwick/Minipolish	Genomics	9
MIP Use when working with MIP (Medical Information Process) — a workflow management system for clinical whole-genome and whole-exome sequencing analysis. Runs end-to-end rare disease (rd_dna, rd_rna) and	Clinical-Genomics/MIP	Workflows	9
MMseqs2 MMseqs2 — ultra-fast protein and nucleotide sequence searching, clustering, and taxonomy assignment. Performs all-vs-all comparisons, iterative profile searches, linclust linear-time clustering, easy-	soedinglab/MMseqs2	Phylogenetics	10
modkit modkit — Oxford Nanopore Technologies Rust CLI for modified base (methylation) analysis on nanopore sequencing data. Processes modBAM files (BAM with MM/ML tags) to compute per-position modification f	nanoporetech/modkit	Genomics	10
Mustache Mustache — multi-scale detection of chromatin loops from Hi-C and Micro-C contact maps using scale-space representation. Identifies loops across resolutions from 10 kb down to 500 bp using computer-vi	ay-lab/mustache	Genomics	10
Mutect2 -- GATK Somatic Short Variant Caller Mutect2 -- GATK somatic short variant caller for detecting SNVs and indels in tumor samples. Supports tumor-only and tumor-normal paired calling modes with Panel of Normals and germline resource filte	broadinstitute/gatk	Genomics	9
NanoStat NanoStat — Python tool for calculating statistics from Oxford Nanopore and PacBio long-read sequencing data. Reads FASTQ, FASTA, BAM, and sequencing summary files to produce read length distributions,	wdecoster/nanostat	QC & Preprocessing	11
NanoSV NanoSV — structural variant caller for long-read sequencing data (Oxford Nanopore). Detects deletions, insertions, inversions, duplications, and translocations from split-read alignments in BAM files.	mroosmalen/nanosv	Genomics	10
NCBI Genome Workbench NCBI Genome Workbench — desktop application for integrated genomic data visualization, sequence analysis, and annotation. Supports interactive browsing of GenBank/RefSeq entries, multiple sequence ali	manual	Utilities & Infrastructure	10
Nextclade CLI Nextclade CLI — viral genome quality control, clade assignment, mutation calling, and phylogenetic placement for SARS-CoV-2, influenza (H3N2, H1N1, H5N1), RSV, mpox, and 100+ other pathogen datasets.	nextstrain/nextclade	Phylogenetics	10
NextDenovo NextDenovo — string graph-based de novo assembler for long reads. Performs error correction (NextCorrect) then assembly (NextGraph) from PacBio CLR, PacBio HiFi, or Oxford Nanopore reads. Config-file	Nextomics/NextDenovo	Genomics	11
Novoalign Novoalign — high-accuracy short-read sequence aligner for Illumina and MGI platforms using full Needleman-Wunsch algorithm with affine gap penalties. Maps single-end and paired-end reads up to 950bp o	manual	Genomics	10
Novocraft Novocraft suite — short-read alignment and BAM sorting for DNA-seq and RNA-seq. novoalign maps Illumina, Ion Torrent, and 454 reads to indexed reference genomes with Smith-Waterman scoring for high-ac	manual	Genomics	10
NOVOPlasty NOVOPlasty is a seed-and-extend de novo assembler for short circular genomes including mitochondrial genomes, chloroplast/plastid genomes, and small viral circular sequences from Illumina whole-genome	ndierckx/NOVOPlasty	Genomics	10
OptiType Use when performing precision HLA class I typing from NGS data with OptiType. Covers read mapping to HLA reference with YARA, Integer Linear Programming allele calling at 4-digit resolution for HLA-A,	FRED-2/OptiType	Clinical Genomics	9
Panaroo Panaroo — graph-based bacterial pan-genome analysis pipeline that corrects annotation errors. Takes GFF3 files from Prokka or Bakta, builds a population-level gene graph, and outputs core/accessory ge	gtonkinhill/panaroo	Phylogenetics	10
Pangolin Pangolin — Phylogenetic Assignment of Named Global Outbreak LINeages for SARS-CoV-2. Assigns Pango lineages to consensus genome sequences using UShER (phylogenetic placement) or pangoLEARN (ML classif	cov-lineages/pangolin	Phylogenetics	11
Pavian Verified Pavian — interactive R/Shiny web application for metagenomics analysis and visualization. Reads classification reports from Kraken, Kraken2, Bracken, MetaPhlAn, DIAMOND+MEGAN, and Centrifuge to genera	fbreitwieser/pavian	Metagenomics	9
Pfam Pfam — database of protein families with curated HMM profiles for domain annotation, functional classification, and evolutionary analysis. Search protein sequences against Pfam-A using HMMER's hmmscan	ProteinsWebTeam/Pfam	Utilities & Infrastructure	9
PGAP PGAP — NCBI Prokaryotic Genome Annotation Pipeline for comprehensive annotation of bacterial and archaeal genomes. Runs gene prediction (GeneMarkS-2+, tRNAscan-SE), functional annotation via HMM/BLAST	ncbi/pgap	Genomics	10
phantompeakqualtools Use when working with phantompeakqualtools — phantompeakqualtools (spp)	kundajelab/phantompeakqualtools	Epigenomics	9
phytools phytools — R package for phylogenetic comparative biology and the study of trait evolution on phylogenies. Provides ancestral state reconstruction (continuous and discrete), stochastic character mappi	liamrevell/phytools	Phylogenetics	10
Pilon Pilon — automated genome assembly improvement and variant detection tool. Polishes draft assemblies by correcting SNPs, indels, and misassemblies using short-read alignments (BAM files). Fills gaps in	broadinstitute/pilon	Genomics	9
PinAPL-Py PinAPL-Py (Pooled In/out Screen Analysis Pipeline for Python) is a complete analysis pipeline for pooled CRISPR sgRNA screens. Handles raw FASTQ alignment to sgRNA libraries, read counting, normalizat	LewisLabUCSD/PinAPL-Py	Systems Biology	10
PlantCV Use when analyzing plant images for phenotyping with PlantCV. Handles visible (RGB), near-infrared (NIR), fluorescence, thermal, and hyperspectral imaging workflows. Performs image segmentation, shape	danforthcenter/plantcv	Imaging	10
PLINK 1.9/2 Verified PLINK 1.9/2 — high-performance command-line toolset for whole-genome association analysis, population stratification, identity-by-descent, linkage disequilibrium computation, and genotype data managem	chrchang/plink-ng	Population Genetics	10
Polypolish Use when working with Polypolish — a short-read polishing tool for long-read genome assemblies. Corrects errors in ONT or PacBio draft assemblies using paired Illumina short reads aligned with BWA. Us	rrwick/Polypolish	Genomics	10
Porechop Porechop — adapter trimming and demultiplexing tool for Oxford Nanopore long-read sequencing data. Finds and removes adapters from read ends, splits chimeric reads containing internal adapters, and de	rrwick/Porechop	QC & Preprocessing	9
PRINSEQ++ PRINSEQ++ — high-performance C++ tool for quality control and preprocessing of FASTQ sequencing reads. Filters and trims reads by quality score, length, GC content, sequence complexity (DUST/entropy),	schallerdavid/prinseq-plus-plus	QC & Preprocessing	11
ProteoWizard msconvert ProteoWizard msconvert — command-line mass spectrometry data converter for proteomics workflows. Use when converting vendor RAW files (Thermo, AB SCIEX, Waters, Bruker, Agilent) to open formats (mzML,	ProteoWizard/pwiz	Proteomics	12
Purge_dups Purge_dups — haplotypic duplication identification and removal tool for genome assemblies. Detects and removes overlapping haplotigs and heterozygous duplications from primary assemblies using read de	dfguan/purge_dups	Genomics	11
Purge Haplotigs Purge Haplotigs — pipeline for curating heterozygous diploid genome assemblies by identifying and reassigning haplotigs (alternative haplotype contigs). Analyzes read-depth histograms to set coverage	skingan/purge_haplotigs_multiBAM	Genomics	10
pybedtools Use when working with pybedtools — pybedtools — Python interface to BEDTools	daler/pybedtools	Utilities & Infrastructure	11
PycoQC PycoQC — Python tool for Oxford Nanopore sequencing QC that parses basecaller summary files (sequencing_summary.txt from Guppy, Dorado, or MinKNOW) and BAM alignment files to generate interactive HTML	a-slide/pycoQC	QC & Preprocessing	10
Pyrodigal Pyrodigal is a Python library binding to Prodigal for gene prediction in prokaryotic genomes and metagenomes. Provides in-memory ORF finding with SIMD-accelerated scoring, single-genome and metagenomi	althonos/pyrodigal	Metagenomics	11
Qualimap Qualimap — platform-independent quality control tool for next-generation sequencing alignment data. Provides BAM QC (coverage, insert size, GC content, mapping quality), RNA-seq QC (gene body coverage	scchess/Qualimap	QC & Preprocessing	11
QUAST QUAST (Quality Assessment Tool for Genome Assemblies) — evaluate genome assembly quality using contiguity metrics including N50, L50, total length, largest contig, misassemblies, and mismatches. Suppo	ablab/quast	Genomics	10
Racon Racon — ultrafast consensus module for raw de novo genome assembly of long uncorrected reads (PacBio CLR, Oxford Nanopore). Uses partial order alignment (POA) to polish draft assemblies from miniasm,	isovic/racon	Genomics	10
RagTag (Ragoo/RagTag) RagTag — reference-guided genome assembly scaffolding, misassembly correction, gap patching, and multi-scaffold merging toolkit. Successor to RaGOO. Supports minimap2, unimap, and nucmer aligners. Pro	malonge/RagTag	Genomics	10
REGENIE REGENIE — whole-genome regression for biobank-scale GWAS. Two-step ridge regression with leave-one-chromosome-out (LOCO) predictions for efficient single-variant and gene-based association testing. Su	rgcgithub/regenie	Population Genetics	11
RepeatMasker RepeatMasker — screens DNA sequences for interspersed repeats and low-complexity regions. Uses Tandem Repeats Finder and curated repeat libraries (Dfam, RepBase) to identify SINEs, LINEs, LTR elements	rmhubley/RepeatMasker	Genomics	10

Tool	Registry	Domain	Docs
Medaka medaka -- neural network-based tool from Oxford Nanopore Technologies for creating consensus sequences and calling variants from nanopore sequencing data. Polishes draft assemblies from Flye, miniasm,	nanoporetech/medaka	Genomics	10
MEME Suite MEME Suite — comprehensive motif analysis toolkit for DNA, RNA, and protein sequences. Provides de novo motif discovery (MEME, STREME), motif scanning (FIMO), motif comparison (TOMTOM), motif enrichme	duocang/MEME-Suite	Epigenomics	10
Merqury Merqury — reference-free quality, completeness, and phasing assessment for genome assemblies using k-mer databases built from Illumina reads. Computes QV (Phred-scaled quality value), k-mer completene	marbl/merqury	Genomics	10
MetaBAT2 MetaBAT2 — adaptive binning of metagenomic contigs using empirical probabilistic distances of genome abundance and tetranucleotide frequency. Clusters assembled contigs into genome bins (MAGs) from me	laczko/MetaBAT	Metagenomics	10
metaSPAdes metaSPAdes — de novo metagenomic assembler from the SPAdes suite using iterative de Bruijn graphs with multiple k-mer sizes optimized for uneven-coverage metagenomes. Assembles short-read (Illumina pa	ablab/spades	Metagenomics	10
MetaWRAP MetaWRAP — flexible pipeline for genome-resolved metagenomic data analysis. Wraps read QC (trimming, host removal), metagenomic assembly (MEGAHIT, metaSPAdes), binning (MaxBin2, metaBAT2, CONCOCT), bi	bxlab/metaWRAP	Metagenomics	10
Mikado Mikado -- Python pipeline for selecting the best transcript models from multiple RNA-seq assemblies. Merges transcript sets from StringTie, Trinity, CLASS2, and other assemblers, then scores and filte	EI-CoreBioinformatics/mikado	Transcriptomics	10
Miniasm Miniasm — ultrafast OLC (overlap-layout-consensus) assembler for long reads (PacBio, Oxford Nanopore). Produces unitig-level assemblies from read overlaps computed by minimap2, without error correctio	lh3/miniasm	Genomics	10
minimap2-Based Phylogenetics Use when building phylogenetic trees from genomic sequences using minimap2 for all-vs-all alignment and pairwise distance computation. Covers minimap2 PAF output, distance matrix construction, tree in	lh3/minimap2	Phylogenetics	9
Minipolish Use when working with minipolish — Racon-based polishing of miniasm long-read genome assemblies. Accepts raw nanopore or PacBio reads (FASTQ/FASTA) and a miniasm GFA assembly, runs iterative minimap2	rrwick/Minipolish	Genomics	9
MIP Use when working with MIP (Medical Information Process) — a workflow management system for clinical whole-genome and whole-exome sequencing analysis. Runs end-to-end rare disease (rd_dna, rd_rna) and	Clinical-Genomics/MIP	Workflows	9
MMseqs2 MMseqs2 — ultra-fast protein and nucleotide sequence searching, clustering, and taxonomy assignment. Performs all-vs-all comparisons, iterative profile searches, linclust linear-time clustering, easy-	soedinglab/MMseqs2	Phylogenetics	10
modkit modkit — Oxford Nanopore Technologies Rust CLI for modified base (methylation) analysis on nanopore sequencing data. Processes modBAM files (BAM with MM/ML tags) to compute per-position modification f	nanoporetech/modkit	Genomics	10
Mustache Mustache — multi-scale detection of chromatin loops from Hi-C and Micro-C contact maps using scale-space representation. Identifies loops across resolutions from 10 kb down to 500 bp using computer-vi	ay-lab/mustache	Genomics	10
Mutect2 -- GATK Somatic Short Variant Caller Mutect2 -- GATK somatic short variant caller for detecting SNVs and indels in tumor samples. Supports tumor-only and tumor-normal paired calling modes with Panel of Normals and germline resource filte	broadinstitute/gatk	Genomics	9
NanoStat NanoStat — Python tool for calculating statistics from Oxford Nanopore and PacBio long-read sequencing data. Reads FASTQ, FASTA, BAM, and sequencing summary files to produce read length distributions,	wdecoster/nanostat	QC & Preprocessing	11
NanoSV NanoSV — structural variant caller for long-read sequencing data (Oxford Nanopore). Detects deletions, insertions, inversions, duplications, and translocations from split-read alignments in BAM files.	mroosmalen/nanosv	Genomics	10
NCBI Genome Workbench NCBI Genome Workbench — desktop application for integrated genomic data visualization, sequence analysis, and annotation. Supports interactive browsing of GenBank/RefSeq entries, multiple sequence ali	manual	Utilities & Infrastructure	10
Nextclade CLI Nextclade CLI — viral genome quality control, clade assignment, mutation calling, and phylogenetic placement for SARS-CoV-2, influenza (H3N2, H1N1, H5N1), RSV, mpox, and 100+ other pathogen datasets.	nextstrain/nextclade	Phylogenetics	10
NextDenovo NextDenovo — string graph-based de novo assembler for long reads. Performs error correction (NextCorrect) then assembly (NextGraph) from PacBio CLR, PacBio HiFi, or Oxford Nanopore reads. Config-file	Nextomics/NextDenovo	Genomics	11
Novoalign Novoalign — high-accuracy short-read sequence aligner for Illumina and MGI platforms using full Needleman-Wunsch algorithm with affine gap penalties. Maps single-end and paired-end reads up to 950bp o	manual	Genomics	10
Novocraft Novocraft suite — short-read alignment and BAM sorting for DNA-seq and RNA-seq. novoalign maps Illumina, Ion Torrent, and 454 reads to indexed reference genomes with Smith-Waterman scoring for high-ac	manual	Genomics	10
NOVOPlasty NOVOPlasty is a seed-and-extend de novo assembler for short circular genomes including mitochondrial genomes, chloroplast/plastid genomes, and small viral circular sequences from Illumina whole-genome	ndierckx/NOVOPlasty	Genomics	10
OptiType Use when performing precision HLA class I typing from NGS data with OptiType. Covers read mapping to HLA reference with YARA, Integer Linear Programming allele calling at 4-digit resolution for HLA-A,	FRED-2/OptiType	Clinical Genomics	9
Panaroo Panaroo — graph-based bacterial pan-genome analysis pipeline that corrects annotation errors. Takes GFF3 files from Prokka or Bakta, builds a population-level gene graph, and outputs core/accessory ge	gtonkinhill/panaroo	Phylogenetics	10
Pangolin Pangolin — Phylogenetic Assignment of Named Global Outbreak LINeages for SARS-CoV-2. Assigns Pango lineages to consensus genome sequences using UShER (phylogenetic placement) or pangoLEARN (ML classif	cov-lineages/pangolin	Phylogenetics	11
Pavian Verified Pavian — interactive R/Shiny web application for metagenomics analysis and visualization. Reads classification reports from Kraken, Kraken2, Bracken, MetaPhlAn, DIAMOND+MEGAN, and Centrifuge to genera	fbreitwieser/pavian	Metagenomics	9
Pfam Pfam — database of protein families with curated HMM profiles for domain annotation, functional classification, and evolutionary analysis. Search protein sequences against Pfam-A using HMMER's hmmscan	ProteinsWebTeam/Pfam	Utilities & Infrastructure	9
PGAP PGAP — NCBI Prokaryotic Genome Annotation Pipeline for comprehensive annotation of bacterial and archaeal genomes. Runs gene prediction (GeneMarkS-2+, tRNAscan-SE), functional annotation via HMM/BLAST	ncbi/pgap	Genomics	10
phantompeakqualtools Use when working with phantompeakqualtools — phantompeakqualtools (spp)	kundajelab/phantompeakqualtools	Epigenomics	9
phytools phytools — R package for phylogenetic comparative biology and the study of trait evolution on phylogenies. Provides ancestral state reconstruction (continuous and discrete), stochastic character mappi	liamrevell/phytools	Phylogenetics	10
Pilon Pilon — automated genome assembly improvement and variant detection tool. Polishes draft assemblies by correcting SNPs, indels, and misassemblies using short-read alignments (BAM files). Fills gaps in	broadinstitute/pilon	Genomics	9
PinAPL-Py PinAPL-Py (Pooled In/out Screen Analysis Pipeline for Python) is a complete analysis pipeline for pooled CRISPR sgRNA screens. Handles raw FASTQ alignment to sgRNA libraries, read counting, normalizat	LewisLabUCSD/PinAPL-Py	Systems Biology	10
PlantCV Use when analyzing plant images for phenotyping with PlantCV. Handles visible (RGB), near-infrared (NIR), fluorescence, thermal, and hyperspectral imaging workflows. Performs image segmentation, shape	danforthcenter/plantcv	Imaging	10
PLINK 1.9/2 Verified PLINK 1.9/2 — high-performance command-line toolset for whole-genome association analysis, population stratification, identity-by-descent, linkage disequilibrium computation, and genotype data managem	chrchang/plink-ng	Population Genetics	10
Polypolish Use when working with Polypolish — a short-read polishing tool for long-read genome assemblies. Corrects errors in ONT or PacBio draft assemblies using paired Illumina short reads aligned with BWA. Us	rrwick/Polypolish	Genomics	10
Porechop Porechop — adapter trimming and demultiplexing tool for Oxford Nanopore long-read sequencing data. Finds and removes adapters from read ends, splits chimeric reads containing internal adapters, and de	rrwick/Porechop	QC & Preprocessing	9
PRINSEQ++ PRINSEQ++ — high-performance C++ tool for quality control and preprocessing of FASTQ sequencing reads. Filters and trims reads by quality score, length, GC content, sequence complexity (DUST/entropy),	schallerdavid/prinseq-plus-plus	QC & Preprocessing	11
ProteoWizard msconvert ProteoWizard msconvert — command-line mass spectrometry data converter for proteomics workflows. Use when converting vendor RAW files (Thermo, AB SCIEX, Waters, Bruker, Agilent) to open formats (mzML,	ProteoWizard/pwiz	Proteomics	12
Purge_dups Purge_dups — haplotypic duplication identification and removal tool for genome assemblies. Detects and removes overlapping haplotigs and heterozygous duplications from primary assemblies using read de	dfguan/purge_dups	Genomics	11
Purge Haplotigs Purge Haplotigs — pipeline for curating heterozygous diploid genome assemblies by identifying and reassigning haplotigs (alternative haplotype contigs). Analyzes read-depth histograms to set coverage	skingan/purge_haplotigs_multiBAM	Genomics	10
pybedtools Use when working with pybedtools — pybedtools — Python interface to BEDTools	daler/pybedtools	Utilities & Infrastructure	11
PycoQC PycoQC — Python tool for Oxford Nanopore sequencing QC that parses basecaller summary files (sequencing_summary.txt from Guppy, Dorado, or MinKNOW) and BAM alignment files to generate interactive HTML	a-slide/pycoQC	QC & Preprocessing	10
Pyrodigal Pyrodigal is a Python library binding to Prodigal for gene prediction in prokaryotic genomes and metagenomes. Provides in-memory ORF finding with SIMD-accelerated scoring, single-genome and metagenomi	althonos/pyrodigal	Metagenomics	11
Qualimap Qualimap — platform-independent quality control tool for next-generation sequencing alignment data. Provides BAM QC (coverage, insert size, GC content, mapping quality), RNA-seq QC (gene body coverage	scchess/Qualimap	QC & Preprocessing	11
QUAST QUAST (Quality Assessment Tool for Genome Assemblies) — evaluate genome assembly quality using contiguity metrics including N50, L50, total length, largest contig, misassemblies, and mismatches. Suppo	ablab/quast	Genomics	10
Racon Racon — ultrafast consensus module for raw de novo genome assembly of long uncorrected reads (PacBio CLR, Oxford Nanopore). Uses partial order alignment (POA) to polish draft assemblies from miniasm,	isovic/racon	Genomics	10
RagTag (Ragoo/RagTag) RagTag — reference-guided genome assembly scaffolding, misassembly correction, gap patching, and multi-scaffold merging toolkit. Successor to RaGOO. Supports minimap2, unimap, and nucmer aligners. Pro	malonge/RagTag	Genomics	10
REGENIE REGENIE — whole-genome regression for biobank-scale GWAS. Two-step ridge regression with leave-one-chromosome-out (LOCO) predictions for efficient single-variant and gene-based association testing. Su	rgcgithub/regenie	Population Genetics	11
RepeatMasker RepeatMasker — screens DNA sequences for interspersed repeats and low-complexity regions. Uses Tandem Repeats Finder and curated repeat libraries (Dfam, RepBase) to identify SINEs, LINEs, LTR elements	rmhubley/RepeatMasker	Genomics	10

Browse Tools

Browse Tools