Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 3 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
Orfeo Toolbox Verified Orfeo ToolBox (OTB) is an open-source library for remote sensing image processing. It provides a wide range of state-of-the-art algorithms for high-resolution optical, multispectral, and radar images. | orfeotoolbox/OTB | Other | 15 |
bioio — plugin-based Python library for reading and writing biological microscopy and imaging files. Supports OME-TIFF, CZI (Zeiss), LIF (Leica), nd2 (Nikon), TIFF, PNG, and many more formats via form | bioio-devs/bioio | Imaging | 10 |
Search, discover, and download climate model data (CMIP6, CORDEX) from the Earth System Grid Federation. Access global datasets programmatically using the ESGF Search API via pyesgf. Handles authentic
| ESGF/esgf-pyclient |
| Utilities & Infrastructure |
| 9 |
Latch SDK — Python-native bioinformatics workflow platform for registering, executing, and sharing reproducible workflows on cloud infrastructure. Define workflows with @workflow and @task decorators, | latchbio/latch | Workflows | 10 |
Use when working with moveVis — an R package designed to visualize movement data (e.g., GPS tracking from animals or vehicles) and synchronous environmental data (e.g., remote sensing) through video a | 16EAGLE/moveVis | Other | 7 |
Haploflow is a C++ de novo assembler for strain-resolved viral haplotype reconstruction from mixed quasispecies sequencing data. It applies minimum-flow decomposition on a de Bruijn graph to recover i | hzi-bifo/Haploflow | Metagenomics | 12 |
matUtils Verified Use when working with matUtils — a toolkit for querying, interpreting, and manipulating mutation-annotated trees (MATs) in protobuf format. Part of the UShER project. Supports subcommands for tree sum | yatisht/usher | Phylogenetics | 10 |
odgi Verified Use when working with odgi — the optimized dynamic genome/graph implementation toolkit — for pangenome variation graph manipulation, sorting, visualization, and analysis. Converts GFA graphs to effici | pangenome/odgi | Genomics | 10 |
RepeatModeler Verified RepeatModeler — automated de novo repeat family identification and modeling pipeline for genomic sequences. Combines RECON and RepeatScout for consensus building, with optional LTR structural detectio | Dfam-consortium/RepeatModeler | Genomics | 11 |
RTG Tools Verified RTG Tools — Java-based toolkit from Real Time Genomics for haplotype-aware variant call comparison, VCF filtering, statistics, and pedigree analysis. Primary use: vcfeval for benchmarking variant call | RealTimeGenomics/rtg-tools | Genomics | 11 |
SeqKit Verified SeqKit — cross-platform ultrafast toolkit for FASTA/Q file manipulation. 38 subcommands for sequence statistics (stats), filtering by length or quality (seq), pattern searching with mismatches (grep, | shenwei356/seqkit | Genomics | 11 |
T Sne Verified t-SNE (t-distributed Stochastic Neighbor Embedding) for nonlinear dimensionality reduction and visualization of high-dimensional data. Covers Barnes-Hut and exact modes, FIt-SNE/openTSNE interpolation | jkrijthe/Rtsne | Transcriptomics | 16 |
ThermoRawFileParser Verified ThermoRawFileParser — cross-platform converter for Thermo Fisher RAW mass spectrometry files to open formats (mzML, MGF, Parquet). Supports batch conversion, metadata extraction, native peak picking, | compomics/ThermoRawFileParser | Proteomics | 10 |
Direct REST API access to UniProt (250M+ protein sequences, 570K+ reviewed Swiss-Prot entries). Protein search, FASTA/JSON retrieval, ID mapping across 200+ databases, batch retrieval, streaming, fiel | multimeric/Unipressed | Proteomics | 17 |
UShER Verified Use when working with UShER (Ultrafast Sample placement on Existing tRees) for phylogenetic placement of new samples onto existing mutation-annotated trees. Places sequences via maximum parsimony on l | yatisht/usher | Phylogenetics | 10 |
vg Verified Use when working with vg — the variation graph toolkit — for pangenome graph construction, read mapping to variation graphs, and graph-aware variant calling. Builds genome graphs from VCF + reference | vgteam/vg | Genomics | 10 |
Use when working with PolyFun — functionally-informed polygenic fine-mapping from GWAS summary statistics. Computes per-SNP heritability priors using S-LDSC with functional annotations, then runs SuSi | omerwe/polyfun | Population Genetics | 10 |
Use when working with poplddecay — popLDdecay — fast, memory-efficient | BGI-shenzhen/PopLDdecay | Population Genetics | 10 |
Use when working with SNeP — a tool for estimating effective population size (Ne) trajectories from genome-wide SNP linkage disequilibrium (LD) data. Reconstructs historical Ne by inverting the LD-Ne | jbarbero/SNeP | Other | 10 |
COMETS (Computation Of Microbial Ecosystems in Time and Space) — dynamic flux balance analysis (dFBA) platform for simulating individual microbes and multi-species microbial communities in spatially-s | segrelab/comets | Other | 11 |
GRITS Toolbox is a free, open-source desktop application for processing, annotating, and archiving glycomics mass spectrometry data. It automates MS/MS spectral annotation of glycan fragments against | manual | Proteomics | 11 |
HyenaDNA long-context genomic foundation model workflows for sequence classification, embedding extraction, and long-range regulatory analysis. Includes checkpoint selection by context window (1k to 1 | HazyResearch/hyena-dna | Genomics | 10 |
PDAL (Point Data Abstraction Library) — C/C++ toolkit for translating, filtering, and processing point cloud data from LiDAR, photogrammetry, and depth sensors. Provides JSON pipeline architecture wit | PDAL/PDAL | Imaging | 9 |
Nextstrain — an open-source project to harness the scientific and public health potential of pathogen genome data. Provides a continually-updated view of publicly available data alongside powerful ana | nextstrain/cli | Clinical Genomics | 7 |
SEACells — single-cell metacell identification using kernel archetypal analysis. Aggregates highly similar single cells into metacells that preserve transcriptional and epigenomic heterogeneity while | dpeerlab/SEACells | Single-Cell | 10 |
The 'spocc' (Species Occurrence Data) R package provides a unified interface for querying and retrieving species occurrence data from multiple major biodiversity databases (GBIF, iNaturalist, eBird, O | ropensci/spocc | Utilities & Infrastructure | 10 |
SQANTI3 (Structural and Quality Annotation of Novel Transcript Isoforms) is a tool for the quality control, structural characterization, and curation of long-read defined transcriptomes. It classifies | ConesaLab/SQANTI3 | Transcriptomics | 9 |
Toxcast Verified Programmatic access to the U.S. EPA ToxCast and Tox21 high-throughput screening (HTS) bioactivity data via the CTX APIs. Use to query chemical hazard profiles, AC50 values, and hit-calls across thousa | manual | Other | 17 |
Xorca Verified xorca (eXtensible ORCA Grid Adapter) preprocesses NEMO ocean model NetCDF outputs into XGCM-compatible Xarray datasets for grid-aware operations on curvilinear ORCA grids. Use when working with NEMO s | willirath/xorca | Other | 16 |
Estimate Compute Requirements Verified estimate-compute-requirements — predict CPU, memory, wall time, and storage for bioinformatics pipelines given input data size and tool selection. Generates per-step resource tables, platform-specific | manual | Single-Cell | 15 |
Use for deterministic conversion of workflow specifications into Nextflow DSL2, Snakemake 8+, WDL 1.0, and CWL v1.2 pipelines with container-aware resource routing and validation. | Hordago-Labs/biocontext7 | Utilities & Infrastructure | 9 |
cftime — Python library for CF-convention date/time handling and non-standard calendar arithmetic. Decode and encode NetCDF time coordinates using num2date, date2num, num2pydate, date2index, and the c | Unidata/cftime | Other | 8 |
Use when working with Nebulosa, the R/Bioconductor package for recovering single-cell gene expression signals via kernel density estimation (KDE). Covers plot_density(), weighted KDE (WKDE), joint den | powellgenomicslab/Nebulosa | Single-Cell | 8 |
NetCoMi (Network Construction and Comparison for Microbiome Data) — R package for constructing, analyzing, and comparing microbial association and dissimilarity networks from compositional count data. | stefpeschel/NetCoMi | Other | 9 |
Infernal Verified Infernal — RNA covariance model (CM) toolkit for homology search and structural alignment of RNA sequences. Builds profile CMs from Stockholm alignments (cmbuild), calibrates E-values (cmcalibrate), s | EddyRivasLab/infernal | Structure Prediction | 9 |
MRtrix3 Verified MRtrix3 — open-source neuroimaging toolkit for diffusion MRI (dMRI) analysis, including DWI preprocessing, fiber orientation distribution (FOD) estimation, probabilistic and deterministic tractography | MRtrix3/mrtrix3 | Imaging | 10 |
Bio Context Lookup Verified Look up documentation and code examples for bioinformatics tools using BioContext7 | manual | Transcriptomics | 10 |
A Python library for verifying weather and climate forecasts. | pangeo-data/climpred | Other | 8 |
ABySS — de novo genome assembler for short paired-end reads and genomes of all sizes. Uses Bloom filter mode for memory-efficient assembly (10x reduction vs hash table). Supports paired-end, mate-pair | bcgsc/abyss | Genomics | 10 |
Use when analyzing fMRI or MRI neuroimaging data with AFNI (Analysis of Functional NeuroImages). Covers full preprocessing pipelines, GLM-based activation analysis, group statistics, registration, sku | afni/afni | Imaging | 9 |
AGAT (Another Gff Analysis Toolkit) — comprehensive Perl toolkit for reading, writing, validating, fixing, and manipulating genome annotation files in GFF3, GTF, and GFF2 formats. Converts between for | NBISweden/AGAT | Genomics | 9 |
Use when working with Aldy — a pharmacogenomics star-allele genotyper for pharmacogenes such as CYP2D6, CYP2C19, CYP2C9, DPYD, TPMT, G6PD, and others. Aldy detects allelic decomposition and performs e | inumanag/aldy | Drug Discovery | 9 |
alevin-fry — fast, accurate, and memory-frugal Rust-based tool for single-cell and single-nucleus RNA-seq quantification. Processes RAD (Reduced Alignment Data) files from salmon alevin to generate ce | COMBINE-lab/alevin-fry | Transcriptomics | 10 |
AlphaPept — Python-based, open-source proteomics pipeline for DDA mass spectrometry data analysis. Provides feature detection, peptide identification via database search, deep learning-based scoring, | MannLabs/alphapept | Proteomics | 10 |
AlphaTims — open-source Python package for fast indexing, slicing, and visualization of unprocessed Bruker timsTOF Pro LC-TIMS-Q-TOF mass spectrometry data. Loads raw .d folders into a sparse five-dim | MannLabs/alphatims | Proteomics | 11 |
AnnotSV — integrated tool for structural variation annotation and ranking. Annotates DEL, DUP, INS, INV, and translocation SVs from VCF or BED input against 30+ databases (gnomAD-SV, DGV, ClinVar, OMI | lgmgeo/AnnotSV | Genomics | 10 |
Anvi'o — integrated analysis and visualization platform for 'omics data. Supports metagenomics (MAG binning, refinement, read recruitment), pangenomics (gene cluster analysis, ANI computation), phylog | merenlab/anvio | Metagenomics | 9 |
Artemis and ACT (Artemis Comparison Tool) — Java-based genome browser and annotation suite from the Wellcome Sanger Institute for viewing, creating, and editing sequence features and annotations in EM | sanger-pathogens/Artemis | Utilities & Infrastructure | 10 |
ascatNgs — allele-specific copy number analysis of tumour and matched normal whole-genome sequencing data. Implements the ASCAT algorithm for NGS data, estimating tumour purity, ploidy, and allele-spe | cancerit/ascatNgs | Clinical Genomics | 10 |
bamAlignCleaner — removes unaligned references from BAM/CRAM alignment files. Cleans alignment files by filtering out reference sequences with no mapped reads, reducing file size and speeding downstre | maxulysse/bamaligncleaner | Genomics | 11 |