Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 20 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
RevBayes — Bayesian phylogenetic inference and evolutionary analysis using graphical probabilistic models and the Rev scripting language. Performs Bayesian MCMC-based phylogenetic tree estimation, div | revbayes/revbayes | Phylogenetics | 10 |
Rosetta — comprehensive macromolecular modeling suite for protein structure prediction, protein design, docking, loop modeling, and enzyme engineering. Includes RosettaFold for deep-learning structure | RosettaCommons/rosetta | Structure Prediction | 10 |
SAbDab (Structural Antibody Database) and ANARCI antibody numbering toolkit. Query the curated database of all antibody structures from the PDB with consistent Chothia, IMGT, Kabat, and Martin numberi | oxpig/ANARCI | Structure Prediction | 10 |
Scanorama -- efficient batch correction and integration of heterogeneous single-cell RNA-seq datasets using panoramic stitching. Aligns and merges multiple scRNA-seq experiments via mutual nearest nei | brianhie/scanorama | Transcriptomics | 10 |
scCRISPR — single-cell multiomics framework for CRISPR perturbation analysis combining scRNA-seq and scATAC-seq readouts. Built on the MPAL-Single-Cell-2019 codebase from the Greenleaf Lab, it integra | GreenleafLab/MPAL-Single-Cell-2019 | Systems Biology | 10 |
scGen — deep learning framework for predicting single-cell perturbation responses using variational autoencoders. Predicts how cells respond to unseen conditions (drug treatment, genetic knockout, dis | theislab/scgen | Single-Cell | 10 |
Use when working with scPrediXcan — a TWAS (Transcriptome-Wide Association Study) framework that extends PrediXcan to single-cell resolution using sc-eQTL prediction models. Maps GWAS loci to cell-typ | hakyimlab/scPrediXcan | Population Genetics | 9 |
Use when working with scvi-hub — scvi-hub — the HuggingFace Hub integration | scverse/scvi-tools | Single-Cell | 9 |
Sickle — sliding-window quality trimming tool for short-read Illumina FASTQ data. Trims low-quality bases from 3' and 5' ends using a windowed adaptive algorithm. Supports paired-end (pe) and single-e | najoshi/sickle | QC & Preprocessing | 9 |
softImpute — R package for matrix completion via iterative soft-thresholded SVD with nuclear-norm regularization. Implements ALS and SVD algorithms for imputing missing entries in large incomplete mat | cran/softImpute | Population Genetics | 9 |
CmdStan — command-line interface to the Stan probabilistic programming language for Bayesian statistical modeling and high-performance inference. Compiles Stan programs (.stan) to C++ executables via | stan-dev/cmdstan | Clinical Genomics | 13 |
STdeconvolve — reference-free cell-type deconvolution for spatial transcriptomics using Latent Dirichlet Allocation (LDA). Decomposes multi-cellular spatial pixels into cell-type proportions and trans | JEFworks-Lab/STdeconvolve | Single-Cell | 9 |
TCGA/GDC tools — NCI Genomic Data Commons Data Transfer Tool and API client for downloading cancer genomics data from TCGA, TARGET, CGCI, and related programs. Provides high-performance file downloads | manual | Utilities & Infrastructure | 9 |
Use when performing Transcriptome-Wide Association Studies (TWAS) with TIGAR, a nonparametric Bayesian tool from Yang Lab for training gene expression prediction models and testing gene-trait associat | yanglab-emory/TIGAR | Population Genetics | 9 |
TileDB for dense and sparse multi-dimensional arrays, data management, provenance-aware storage, and cloud/hybrid object-store access. Use this skill when users ask about TileDB Embedded, TileDB-Py, a | TileDB-Inc/TileDB | Workflows | 9 |
TMB/MSI analysis tools — Tumor Mutational Burden (TMB) estimation and Microsatellite Instability (MSI) detection from paired tumor-normal sequencing data. Covers MSIsensor for MSI scoring from BAM fil | ding-lab/msisensor | Clinical Genomics | 9 |
Trackpy is a Python package for fast, Pythonic particle tracking in 2D and 3D image sequences. Implements the Crocker-Grier algorithm to locate bright features (particles, colloids, cells, droplets) a | soft-matter/trackpy | Imaging | 10 |
Use when working with TreeMix for population genetics admixture graph inference, including input preparation from PLINK or VCF files, running maximum-likelihood tree fitting with migration events, and | manual | Other | 9 |
ukbREST — a REST API server for UK Biobank (UKB) phenotype data that enables programmatic querying of biobank-scale phenotype fields via HTTP endpoints. Backed by PostgreSQL, it supports phenotype ext | hakyimlab/ukbrest | Clinical Genomics | 9 |
Weights & Biases (wandb) — ML experiment tracking, hyperparameter sweeps, artifact versioning, and collaborative model management platform. Log training metrics, gradients, system stats, and media (im | wandb/wandb | Machine Learning | 9 |
Xarray — N-dimensional labelled arrays and datasets for scientific Python. Provides a toolkit for reading, writing, and manipulating multi-dimensional arrays with labels (dimensions, coordinates, attr | pydata/xarray | Other | 9 |
X!Tandem — open-source proteomics search engine for matching tandem mass spectra (MS/MS) to peptide sequences in protein databases. Performs database searching with configurable scoring, supports post | mhiley/xtandem | Proteomics | 10 |
Tools and workflows for accessing, querying, and analyzing data from the 1000 Genomes Project (1KGP) — the largest public catalog of human genetic variation with whole-genome sequences from 3,202 indi | manual | Utilities & Infrastructure | 11 |
10x Genomics — commercial platform for single-cell and spatial genomics assays including Chromium single-cell RNA-seq (scRNA-seq), single-cell ATAC-seq, single-cell multiome (RNA+ATAC), Visium spatial | manual | Utilities & Infrastructure | 10 |
Arima-HiC mapping pipeline — shell-based workflow for aligning and processing Hi-C data generated with Arima Genomics proximity ligation kits. Maps paired-end FASTQ reads with BWA-MEM using Hi-C-speci | ArimaGenomics/mapping_pipeline | Genomics | 9 |
Benchmark and compare bioinformatics tools for a given analysis task using published benchmark data (accuracy, runtime, memory), PrecisionFDA and CAMI challenge results, bc7score community metrics, an | Hordago-Labs/biocontext7 | Transcriptomics | 11 |
Use when querying ClinGen — the Clinical Genome Resource — for expert-curated gene-disease validity classifications, variant pathogenicity interpretations, dosage sensitivity assessments, and clinical | manual | Utilities & Infrastructure | 10 |
Use when working with cram-tools — CRAM tools — reference-based compression | samtools/htslib | Utilities & Infrastructure | 10 |
dbSNP tools — NCBI toolkit for querying, downloading, and processing dbSNP variant data. Look up rsIDs via REST API, bulk-download VCF/JSON releases, annotate variant lists with allele frequencies and | ncbi/dbsnp | Utilities & Infrastructure | 9 |
EBSeq is an R/Bioconductor package implementing an empirical Bayes hierarchical model for identifying differentially expressed (DE) genes and isoforms in RNA-seq experiments. It supports both two-cond | lengning/EBSeq | Transcriptomics | 11 |
Ensembl Database — EMBL-EBI's comprehensive genome annotation database covering 300+ species with gene models, variants, regulatory features, and comparative genomics. Query via REST API at rest.ensem | manual | Genomics | 16 |
FD — CRAN R package by Laliberté, Legendre, and Shipley for measuring functional diversity (FD) from multiple traits. Implements the distance-based multidimensional FD framework (functional richness F | cran/FD | Other | 12 |
FirePy — ECMWF's Python toolbox for global wildfire risk forecasting and fire emissions analysis. Covers the AI-Probability-of-Fire (PoF) XGBoost framework for 1 km / 10-day fire danger probability, C | ecmwf/fire-python | Other | 9 |
FlyBase is the primary model-organism knowledgebase for Drosophila melanogaster and related Drosophila species. Use this skill when users need FlyBase gene records, allele and phenotype data, transgen | manual | Utilities & Infrastructure | 9 |
Franklin — web-based clinical variant interpretation platform by Genoox for ACMG/AMP-based pathogenicity classification of SNVs, indels, and CNVs. Provides a community-curated variant database, REST A | WGLab/InterVar | Clinical Genomics | 11 |
GA4GH Schemas — the Global Alliance for Genomics and Health data models and APIs for standardized genomic data exchange. Defines Protobuf/JSON schemas for Reads (SAM-equivalent), Variants (VCF-equival | ga4gh/ga4gh-schemas | Utilities & Infrastructure | 9 |
Getz Lab tools (ABSOLUTE, MuTect/MuTect2) from the Broad Institute for cancer genomics. ABSOLUTE infers tumor purity, ploidy, and absolute copy numbers from segmented copy number data. MuTect2 (GATK4) | broadinstitute/ABSOLUTE | Genomics | 10 |
htsget — GA4GH REST streaming API for accessing genomic reads and variants from remote servers. Fetch BAM, CRAM, VCF, or BCF data by region using HTTP ticket-based retrieval. Supports Bearer token aut | ga4gh/htsget | Utilities & Infrastructure | 9 |
Hymenoptera Genome Database (HGD) — community resource for ant, bee, and wasp genome sequences, gene annotations, and comparative genomics. Provides BLAST search, genome browsers (JBrowse), gene expre | Hordago-Labs/biocontext7 | Genomics | 10 |
INSaFLU-TELEVIR is a web-based (and locally installable) bioinformatics suite for viral genomic surveillance and metagenomics-based virus detection. It processes Illumina, Ion Torrent, and Oxford Nano | INSaFLU/INSaFLU | Metagenomics | 11 |
UCSC Kent Utilities — command-line toolkit for genome coordinate conversion (liftOver), binary genome formats (.2bit, BigBed, BigWig), BLAT alignment, annotation format interconversion (GTF↔GenePred), | ucscGenomeBrowser/kent | Genomics | 10 |
Microreact — web-based visualization platform for genomic epidemiology that combines interactive phylogenetic trees, geographic maps, and timelines with metadata tables synchronized through client-sid | nickloman/microreact | Clinical Genomics | 12 |
MIDORI Reference 2 (MIDORI2) is a curated reference database of eukaryotic mitochondrial DNA and amino acid sequences used for taxonomic assignment of environmental and metabarcoding samples. It provi | VascoElbworthy/MIDORI | Metagenomics | 11 |
MissForest is a non-parametric imputation method for handling missing values in datasets with mixed-type data (continuous and categorical) based on Random Forests. It iteratively imputes missing value | stekhoven/missForest | Population Genetics | 11 |
mvabund is an R package providing a model-based approach to analyzing multivariate abundance data in ecology. It offers tools for data visualization, fitting predictive generalized linear models (GLMs | eco-stats/mvabund | Other | 13 |
PLIP (Protein-Ligand Interaction Profiler) — rule-based detection, profiling, and visualization of non-covalent protein-ligand interactions in 3D structures. Identifies hydrogen bonds, hydrophobic con | pharmai/plip | Proteomics | 10 |
RASQUAL (Robust Allele Specific QUantification and quality controL) — joint QTL mapping tool that integrates total read counts and allele-specific reads for cis-regulatory variant discovery. Detects i | natsuhiko/rasqual | Population Genetics | 11 |
Scrublet — Python toolkit for computational identification of cell doublets in single-cell RNA-seq data. Simulates synthetic doublets from observed transcriptomes and scores cells via k-nearest-neighb | swolock/scrublet | Single-Cell | 11 |
Search publications for a bioinformatics tool — find peer-reviewed papers that use, cite, or benchmark a tool using PubMed E-utilities, CrossRef REST API, and OpenAlex scholarly graph. Aggregates cita | manual | Utilities & Infrastructure | 11 |
UCSC Table Browser API — programmatic access to the UCSC Genome Browser's track data via REST endpoints. Retrieve genomic sequences, annotation tracks, gene models, conservation scores, and regulatory | ucscGenomeBrowser/kent | Genomics | 10 |