Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 4 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
BAMtools — C++ API and command-line toolkit for reading, writing, sorting, indexing, filtering, merging, splitting, and converting BAM alignment files. Provides JSON-based filter expressions, multi-fo | pezmaster31/bamtools | Genomics | 10 |
bamutil — C++ toolkit for reading, writing, and manipulating BAM alignment files. Provides duplicate marking (dedup), base quality score recalibration (recab), overlapping read clipping (clipOverlap), | statgen/bamUtil | Utilities & Infrastructure | 10 |
Bandage (a Bioinformatics Application for Navigating De novo Assembly Graphs Easily) — interactive visualization tool for de novo assembly graphs in GFA and FASTG formats. Visualize genome assembly gr | rrwick/Bandage | Genomics | 10 |
BBDuk (Decontamination Using Kmers) — high-performance Java tool from the BBTools suite for adapter trimming, quality trimming, contaminant filtering, sequence masking, GC filtering, and format conver | BioInfoTools/BBMap | QC & Preprocessing | 11 |
BBKNN (Batch Balanced K Nearest Neighbours) -- lightweight Python batch correction method for single-cell RNA-seq that operates on the k-nearest neighbor graph rather than the expression matrix. Repla | Teichlab/bbknn | Transcriptomics | 9 |
BBMap/BBTools — suite of 265+ fast, multithreaded Java-based bioinformatics tools for DNA/RNA sequence analysis. Includes BBMap short read aligner, BBDuk adapter trimmer and quality filter, BBMerge pa | bbushnell/BBTools | Genomics | 10 |
Use when converting Illumina BCL (Binary Base Call) files to FASTQ format, demultiplexing sequencing runs by index sequences, or trimming adapters from short-read data. Handles HiSeq, MiSeq, NextSeq, | manual | Workflows | 10 |
BEDOPS — high-performance genomic interval arithmetic suite for BED, VCF, GFF, GTF, BAM, and SAM files. Provides set operations (intersection, union, difference, complement, symmetric difference) via | bedops/bedops | Utilities & Infrastructure | 9 |
bedToBigBed — UCSC Kent utility that converts BED/BED+ text files into indexed binary BigBed (.bb) format for fast random-access genome browser tracks. Supports BED3 through BED12, BED+N extra-field e | ucscGenomeBrowser/kent | Genomics | 11 |
Use when building, querying, or manipulating colored compacted de Bruijn graphs (ccDBG) with Bifrost for genome assembly, metagenomics, variant detection, or k-mer analysis. Covers graph construction | pmelsted/bifrost | Genomics | 9 |
bigWigMerge merges multiple bigWig signal tracks into a single bedGraph output by summing (or optionally taking the maximum of) per-base signal values. Essential for combining ChIP-seq, ATAC-seq, RNA- | ucscGenomeBrowser/kent | Genomics | 9 |
biobambam2 — C++ toolkit for early-stage BAM file processing built on libmaus2. Provides bamsormadup for parallel sorting with duplicate marking, bamcollate2 for name-collation, bammarkduplicates for | gt1/biobambam2 | Genomics | 11 |
BLASR (Basic Local Alignment with Successive Refinement) — PacBio long-read aligner for mapping SMRT sequencing reads to reference genomes. Uses suffix array indexing and global chaining for seed-and- | PacificBiosciences/blasr | Genomics | 10 |
Bowtie2 — ultrafast and memory-efficient short-read aligner using FM-index with gapped alignment and local alignment modes. Standard aligner for ChIP-seq, ATAC-seq, CUT&RUN, CUT&Tag, and general short | BenLangmead/bowtie2 | Genomics | 11 |
BRAKER — fully automated gene structure annotation pipeline combining GeneMark-ES/ET/EP/ETP with AUGUSTUS training. Supports RNA-Seq evidence (BRAKER1), protein evidence (BRAKER2), and combined RNA-Se | Gaius-Augustus/BRAKER | Genomics | 10 |
Use when working with bsseeker2 — BSseeker2 — bisulfite sequencing aligner | BSSeeker/BSseeker2 | Epigenomics | 10 |
BUSCO (Benchmarking Universal Single-Copy Orthologs) — assess genome assembly, transcriptome, and proteome completeness using lineage-specific single-copy orthologs from the OrthoDB database. Reports | WenchaoLin/BUSCO | Genomics | 10 |
BUStools — C toolkit for manipulating BUS (Barcode, UMI, Set) files from single-cell RNA-seq pseudoalignment. Provides sorting, barcode error correction, UMI counting, matrix generation, quality inspe | BUStools/bustools | Single-Cell | 10 |
BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace | bwa-mem2/bwa-mem2 | Genomics | 12 |
BWA-MEM2 — accelerated Burrows-Wheeler Aligner for mapping short DNA reads to reference genomes. Use when aligning Illumina WGS, WES, targeted panel, ChIP-seq, or ATAC-seq FASTQ reads. Drop-in replace | bwa-mem2/bwa-mem2 | Genomics | 9 |
Canu — long-read de novo genome assembler for PacBio HiFi, PacBio CLR, and Oxford Nanopore reads. Performs read correction, trimming, and overlap-layout-consensus assembly in three stages. Supports di | marbl/canu | Genomics | 10 |
CD-HIT — fast sequence clustering and redundancy reduction for protein and nucleotide sequences. Clusters FASTA sequences at a user-defined identity threshold to remove redundancy from databases. Prov | weizhongli/cdhit | Phylogenetics | 10 |
ChAMP Verified ChAMP — Chip Analysis Methylation Pipeline for Illumina 450K and EPIC arrays. Integrated R/Bioconductor pipeline for DNA methylation analysis covering IDAT loading, quality control, normalization (BMI | YuanTian1991/ChAMP | Epigenomics | 16 |
Chromosight — Python computer-vision tool for detecting chromatin loops, borders (TAD boundaries), hairpins, and other patterns in Hi-C contact maps using template matching with Pearson correlation. R | koszullab/chromosight | Genomics | 10 |
chromVAR — R/Bioconductor package for determining variation in chromatin accessibility across sets of annotations or peaks. Designed for single-cell ATAC-seq (scATAC-seq) and sparse bulk ATAC-seq data | GreenleafLab/chromVAR | Single-Cell | 11 |
Circos — Perl-based circular visualization tool for genomics data. Creates publication-quality circular plots for genome comparisons, synteny, GC content, gene density, chromosome ideograms, link/ribb | vigsterkr/circos | Imaging | 9 |
CNVkit -- Python toolkit for detecting copy number variants (CNVs) from targeted DNA sequencing data (hybrid capture, amplicon) and whole-genome sequencing (WGS). Leverages both on-target and off-targ | etal/cnvkit | Genomics | 9 |
CONCOCT — metagenomic binning tool that clusters contigs into genomes using sequence composition (k-mer frequencies) and coverage across multiple samples. Combines PCA dimensionality reduction with Ga | BinPro/CONCOCT | Metagenomics | 9 |
cooler — file format and Python library for Hi-C chromatin contact data. Stores sparse contact matrices in .cool (single-resolution) and .mcool (multi-resolution) HDF5 files. Supports ingestion from p | open2c/cooler | Genomics | 9 |
Use when working with cramino — a fast QC tool for long-read alignment files (CRAM and BAM). Generates alignment statistics including N50, median length, mean quality, percent aligned, and yield from | wdecoster/cramino | QC & Preprocessing | 10 |
Use when working with csvtk — csvtk — cross-platform, ultrafast CSV/TSV | shenwei356/csvtk | Utilities & Infrastructure | 10 |
Cufflinks — RNA-seq transcript assembly, quantification, and differential expression suite. Assembles aligned reads into transcripts (cufflinks), performs differential expression analysis at transcrip | cole-trapnell-lab/cufflinks | Transcriptomics | 10 |
DADA2 — high-resolution amplicon sequence variant (ASV) inference from Illumina, 454, and Ion Torrent amplicon sequencing data. R/Bioconductor package that models sequencing errors to resolve exact bi | benjjneb/dada2 | Metagenomics | 11 |
DAS Tool — genome-resolved metagenomics bin refinement tool that integrates results from multiple binning algorithms to produce an optimized, non-redundant set of metagenome-assembled genomes (MAGs). | cmks/DAS_Tool | Metagenomics | 10 |
Use when working with datamash — datamash — GNU command-line tool for | manual | Utilities & Infrastructure | 10 |
DeepVariant — CNN-based variant caller that converts aligned reads into pileup image tensors and classifies them with a deep neural network to produce SNP and indel calls in VCF/gVCF format. Supports | google/deepvariant | Genomics | 11 |
dnaPipeTE Verified dnaPipeTE (do not assemble - PipelinE for Transposable Elements) is a bioinformatics pipeline for estimating the proportion and landscape of transposable elements (TEs) and other repeats in a genome f | clemgoub/dnaPipeTE | Genomics | 9 |
DRAGMAP — Illumina's open-source hash-table-based DNA short-read aligner, implementing the DRAGEN mapping algorithm. Produces SAM/BAM output from FASTQ inputs against a pre-built hash table index. Use | Illumina/DRAGMAP | Genomics | 10 |
DRAM-v (Distilled and Refined Annotation of Metabolism — Viral) — annotate viral genomes and metagenome-assembled phage sequences for auxiliary metabolic genes (AMGs) and viral hallmarks. Produces ann | WrightonLabCSU/DRAM | Metagenomics | 10 |
DRIMSeq is a Bioconductor R package for Differential Transcript Usage (DTU) analysis using a Dirichlet-multinomial distribution to model transcript-level count proportions. Tests whether relative tran | gosianow/DRIMSeq | Transcriptomics | 10 |
Dysgu — structural variant caller for paired-end and long-read sequencing data (Illumina, PacBio HiFi, Oxford Nanopore). Detects deletions, insertions, duplications, inversions, and translocations fro | kcleal/dysgu | Genomics | 10 |
Use when detecting allele-specific expression (ASE), allelic imbalance, or phased allele counts from RNA-seq data with EAGLE (Empirical Analysis of Allele-Specific Gene Expression). Covers phased VCF | tonyd256/eagle | Transcriptomics | 9 |
Use when processing EEG or MEG data with EEGLAB (MATLAB toolbox). Covers loading raw data, filtering, re-referencing, epoching, ICA, artifact rejection, channel interpolation, time-frequency analysis, | sccn/eeglab | Systems Biology | 10 |
EIGENSOFT smartpca -- C/C++ tool for principal component analysis of genome-wide SNP genotype data. Computes eigenvectors and eigenvalues for population structure analysis, ancestry inference, stratif | chrchang/eigensoft | Genomics | 9 |
EMU (Exact Mapping-based Unambiguous) — species-level 16S rRNA taxonomic profiling for long-read sequencing data (Oxford Nanopore, PacBio). Uses an expectation-maximization algorithm to resolve multi- | treangenlab/emu | Metagenomics | 10 |
FACETS (Fraction and Allele-Specific Copy number Estimates from Tumor/normal Sequencing) is an R package for allele-specific somatic copy number analysis of matched tumor-normal sequencing data. It es | mskcc/facets | Clinical Genomics | 10 |
Fastq Screen — Perl tool for screening FASTQ sequencing reads against multiple reference genomes to detect sample contamination. Maps reads to user-defined genome databases (human, mouse, E. coli, Phi | StevenWingworthy/FastQ-Screen | QC & Preprocessing | 9 |
FastQC — universal quality control tool for high-throughput sequencing data. Performs modular analysis of FASTQ and BAM files, generating per-base quality scores, GC content distribution, adapter cont | s-andrews/FastQC | QC & Preprocessing | 11 |
featureCounts — ultrafast read counting program for assigning aligned reads (SAM/BAM) to genomic features such as genes, exons, promoters, and genomic bins. Part of the Subread package. Supports singl | manual | Genomics | 11 |
fgbio — command-line toolkit for working with genomic and next-generation sequencing data, specializing in UMI (Unique Molecular Identifier) processing, consensus read calling, duplex sequencing, and | fulcrumgenomics/fgbio | QC & Preprocessing | 11 |