Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 1 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
Tools for manipulating next-generation sequencing data stored in SAM/BAM/CRAM format, including sorting, indexing, and format conversion. | samtools/samtools | Genomics | 11 |
Ensembl VEP (Variant Effect Predictor) — annotates genomic variants with gene, transcript, protein, and regulatory consequences using Ensembl's annotation framework. Supports VCF, HGVS, and genomic co | Ensembl/ensembl-vep | Genomics | 16 |
Differential expression analysis for RNA-seq data using negative binomial generalized linear models with size factor normalization and empirical Bayes shrinkage. | thelovelab/DESeq2 | Transcriptomics | 8 |
Deep learning system for protein structure prediction from amino acid sequence with atomic accuracy — revolutionized structural biology. | google-deepmind/alphafold | Structure Prediction | 13 |
Spliced Transcripts Alignment to a Reference — ultrafast RNA-seq aligner with splice junction detection and novel transcript discovery. | alexdobin/STAR | Transcriptomics | 8 |
Differential expression analysis of RNA-seq and other count data using empirical Bayes estimation of the dispersion parameter for the negative binomial model. | manual | Transcriptomics | 8 |
Data-driven computational pipeline framework enabling scalable and reproducible scientific workflows using software containers. | nextflow-io/nextflow | Workflows | 8 |
MultiQC Verified Aggregate bioinformatics analysis results from many samples and tools into a single report — supports 150+ tools including FastQC, STAR, HISAT2, and Samtools. | MultiQC/MultiQC | QC & Preprocessing | 10 |
Java command-line tools from the Broad Institute for manipulating high-throughput sequencing data and formats including BAM, CRAM, and VCF. | broadinstitute/picard | Genomics | 8 |
HMMER Verified Biosequence analysis using profile hidden Markov models — for detecting remote homologs, protein domain identification, and multiple sequence alignments. | EddyRivasLab/hmmer | Genomics | 8 |
Versatile sequence alignment program for mapping long reads (PacBio/Oxford Nanopore), splice-aware RNA-seq alignment, and assembly-to-assembly alignment. | lh3/minimap2 | Genomics | 8 |
HISAT2 Verified Graph-based alignment of reads to a population of genomes using hierarchical indexing — successor to TopHat and HISAT for RNA-seq alignment. | DaehwanKimLab/hisat2 | Genomics | 8 |
Kallisto Verified Near-optimal probabilistic RNA-seq quantification using pseudoalignment against transcriptome index — orders of magnitude faster than traditional aligners. | pachterlab/kallisto | Transcriptomics | 8 |
BEDTools Verified BEDTools — Swiss-army knife for genome arithmetic on genomic intervals. Provides intersect, merge, subtract, complement, closest, window, coverage, genomecov, slop, flank, sort, groupby, getfasta, mas | arq5x/bedtools2 | Genomics | 15 |
Removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads. | marcelm/cutadapt | QC & Preprocessing | 8 |
Rapid prokaryotic genome annotation tool producing standards-compliant GFF3, GenBank, and FASTA output for submission and downstream analysis. | tseemann/prokka | Genomics | 10 |
Metagenomic phylogenetic analysis tool for profiling microbial community composition from shotgun metagenomics data using clade-specific markers. | biobakery/MetaPhlAn | Metagenomics | 8 |
Multiple sequence alignment tool with high accuracy and high throughput — one of the best-performing multiple alignment programs. | rcedgar/muscle | Phylogenetics | 8 |
Use when performing GWAS fine-mapping with PAINTOR (Probabilistic Annotation INTegratOR). Computes posterior inclusion probabilities (PIPs) for causal variants by integrating summary statistics (Z-sco | gkichaev/PAINTOR_V3.0 | Population Genetics | 10 |
SBayesR and SBayesRC are Bayesian methods for polygenic risk score (PRS) computation, SNP heritability estimation, and genetic architecture analysis using GWAS summary statistics and an LD reference p | zhilizheng/SBayesRC | Population Genetics | 8 |
> | manual | Utilities & Infrastructure | 9 |
Efficient transcript assembly and quantification from RNA-seq alignments using a network flow algorithm with optional guide annotation. | gpertea/stringtie | Transcriptomics | 8 |
Cvxpy Verified Use when working with CVXPY — a Python-embedded domain-specific language for convex optimization problems. Covers problem construction with the DCP (Disciplined Convex Programming) ruleset, all built- | cvxpy/cvxpy | Statistics | 14 |
Use when filtering or subsampling long reads (Oxford Nanopore, PacBio) by quality, length, or target coverage depth. Filtlong selects the best reads from a set rather than random reads, using quality | rrwick/Filtlong | QC & Preprocessing | 9 |
Seurat Verified Seurat — comprehensive R toolkit for single-cell genomics enabling QC, normalization (LogNormalize, SCTransform), feature selection, dimensionality reduction (PCA, UMAP, t-SNE), graph-based clustering | satijalab/seurat | Single-Cell | 15 |
rioxarray is a geospatial xarray extension powered by rasterio that adds a `.rio` accessor to DataArray and Dataset objects, enabling CRS management, reprojection, clipping, resampling, and GeoTIFF/Ne | corteva/rioxarray | Other | 8 |
nc-time-axis provides cftime-aware axis support for matplotlib, enabling the plotting of time series that use non-standard calendars (360_day, noleap, all_leap, julian, etc.) common in climate, weathe | SciTools/nc-time-axis | Other | 13 |
Crux Verified Use when working with Crux — the University of Washington Noble Lab's open-source toolkit for bottom-up tandem mass spectrometry (MS/MS) proteomics analysis. Covers the full peptide identification pip | limey-bean/CRUX_Creating-Reference-libraries-Using-eXisting-tools | Metagenomics | 13 |
PyDESeq2 Verified PyDESeq2 — Python implementation of DESeq2 for differential gene expression analysis from bulk RNA-seq count data. Performs size factor normalization, genewise dispersion estimation, Wald tests with B | owkin/PyDESeq2 | Transcriptomics | 11 |
M-CAP (Mendelian Clinically Applicable Pathogenicity) is a clinical-grade pathogenicity classifier for rare human missense variants. Developed at the Bejerano Lab (Stanford), it applies a gradient-boo | bejerano-lab/MCAP | Population Genetics | 12 |
openTSNE — modular Python library for t-distributed Stochastic Neighbor Embedding. Implements Barnes-Hut and FIt-SNE (interpolation-based) acceleration for scalable O(n log n) / near-linear t-SNE on l | pavlin-policar/openTSNE | Statistics | 13 |
Satpy is a Python library for reading, manipulating, and writing meteorological remote sensing data from various satellite instruments. It is part of the PyTroll project. | pytroll/satpy | Other | 8 |
sdm (R) is a reproducible framework for species distribution modelling (SDM) that unifies data preparation, multi-method model fitting, evaluation, ensemble forecasting, and spatial/temporal predictio | babaknaimi/sdm | Clinical Genomics | 11 |
Sentinel Hub is a cloud-based satellite imagery access and processing platform that provides programmatic access to Copernicus Sentinel, Landsat, MODIS, and commercial satellite data collections throu | sentinel-hub/sentinelhub-py | Visualization | 11 |
SourceTracker2 is a Bayesian microbial source-tracking tool that estimates source contributions to sink communities using Gibbs sampling. Use this skill for SourceTracker/SourceTracker2 workflows, mic | biota/sourcetracker2 | Other | 12 |
BCFtools Verified BCFtools — comprehensive toolkit for manipulating variant calls in VCF and BCF format. Provides variant calling (mpileup + call), filtering, querying, merging, concatenation, normalization, annotation | samtools/bcftools | Genomics | 9 |
Apollo Verified Apollo (Genome Architect) — collaborative, web-based genome annotation editor for manual curation of gene models. Embeds a JBrowse-based genome browser with real-time multi-user annotation editing via | GMOD/Apollo | Other | 10 |
BLAT (BLAST-Like Alignment Tool) is a rapid sequence alignment tool from the UCSC Genome Browser group for aligning mRNA, EST, or DNA sequences to a reference genome. Optimized for highly similar sequ | ucscGenomeBrowser/kent | Genomics | 9 |
ConsensusClusterPlus — R/Bioconductor package for determining cluster count and membership by stability evidence in unsupervised analysis, implementing the Monti et al. (2003) consensus clustering alg | manual | Statistics | 12 |
GATK (Genome Analysis Toolkit) Verified GATK (Genome Analysis Toolkit) — the industry-standard framework for variant discovery in high-throughput sequencing data. Covers the gatk command-line wrapper, Java/JVM tuning, resource bundle manage | broadinstitute/gatk | Genomics | 11 |
Molecular Transformer is an open-source sequence-to-sequence Transformer model (OpenNMT-py) for data-driven chemical reaction prediction and retrosynthesis planning. Trained on USPTO reaction datasets | pschwllr/MolecularTransformer | Drug Discovery | 13 |
nebula is an R package for large-scale differential gene expression (DGE) analysis of single-cell RNA-seq data using negative binomial mixed models (NBMM). Implements the NEBULA method (NEgative Binom | lhe17/nebula | Transcriptomics | 8 |
Open Data Cube (ODC) is an open-source geospatial data management and analysis framework that indexes Earth observation satellite imagery and gridded datasets into a PostgreSQL catalogue and exposes t | opendatacube/datacube-core | Other | 13 |
Use when working with Trycycler — a tool for generating consensus long-read genome assemblies from multiple assemblies of the same sample. Combines outputs from assemblers such as Flye, Miniasm, Raven | rrwick/Trycycler | Genomics | 9 |
Use when working with Convex.jl — a Julia package for Disciplined Convex Programming (DCP) that formulates and solves convex optimization problems with natural mathematical syntax. Covers problem cons | jump-dev/Convex.jl | Statistics | 13 |
rjags is the canonical R interface to JAGS (Just Another Gibbs Sampler), providing R functions for compiling BUGS-language models, running adaptive and burn-in phases, and drawing posterior samples as | andrewcparnell/rjags | Statistics | 11 |
iClusterPlus — Bioconductor R package for integrative clustering of multi-omics data. Performs joint latent variable modeling of multiple genomic data types (e.g., SNP copy number, methylation, gene e | manual | Systems Biology | 10 |
Query Reactome REST API for pathway analysis, enrichment, gene-pathway mapping, disease pathways, molecular interactions, expression analysis, for systems biology studies. | manual | Proteomics | 16 |
Spectral Verified Spectral Python (SPy) routing skill for hyperspectral image data representation, manipulation, and processing. Use this skill when users mention SPy, spectralpython, hyperspectral imagery, ENVI header | spectralpython/spectral | Other | 14 |
Xesmf Verified xESMF — Regrid geospatial and climate datasets between structured grids using ESMF (Earth System Modeling Framework). Use when working with climate model output (CMIP6, ERA5, CESM), satellite data, or | pangeo-data/xesmf | Other | 17 |