Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 25 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
dittoSeq — universal single-cell and bulk RNA-seq visualization for R. Create UMAP/tSNE dim plots, bar plots, scatter plots, heatmaps, bubble plots, violin plots, and ridge plots from Seurat, SingleCe | dtm2451/dittoSeq | Visualization | 8 |
DKFZ OTP (One Touch Pipeline) — the German Cancer Research Center ODCF bioinformatics platform for large-scale genomic sequencing data management and analysis. Provides Roddy-based workflows for align | DKFZ-ODCF/OTP | Utilities & Infrastructure | 8 |
DMRcate — R/Bioconductor package for identifying differentially methylated regions (DMRs) from Illumina Infinium arrays (450K, EPIC, EPICv2) and whole genome bisulfite sequencing (WGBS/RRBS) data. Use |
| Hordago-Labs/biocontext7 |
| Epigenomics |
| 8 |
Use when working with DoRothEA or CollecTRI for transcription factor activity inference, gene regulatory network analysis, or regulon-based enrichment in single-cell or bulk RNA-seq data. DoRothEA pro | saezlab/dorothea | Systems Biology | 8 |
DOSE — Disease Ontology Semantic and Enrichment analysis R/Bioconductor package for gene set enrichment against disease ontologies. Performs over-representation analysis (ORA) and gene set enrichment | YuLab-SMU/DOSE | Transcriptomics | 8 |
DoubletFinder — R package for detecting neotypic doublets in single-cell RNA-seq data using artificial nearest neighbor (pANN) scoring. Interfaces with Seurat objects (v2–v5) to identify heterotypic d | chris-mcginnis-ucsf/DoubletFinder | Transcriptomics | 8 |
dplyr — grammar of data manipulation R package from the tidyverse. Provides consistent, expressive verbs for filtering rows, selecting columns, mutating values, summarising groups, and joining tables. | tidyverse/dplyr | Utilities & Infrastructure | 8 |
Use when working with dream, variancePartition differential expression in R, repeated measures RNA-seq analysis, mixed model differential expression, or linear mixed model testing across biological re | GabrielHoffman/variancePartition | Transcriptomics | 9 |
dreamlet is a Bioconductor R package for scalable pseudobulk differential expression analysis of single-cell and spatial RNA-seq data using mixed models. It handles repeated measures and complex desig | GabrielHoffman/dreamlet | Transcriptomics | 8 |
Use when working with DropletUtils for processing droplet-based single-cell RNA-seq data from 10x Genomics CellRanger output. DropletUtils provides read10xCounts() for loading MEX/HDF5 output, emptyDr | MarioniLab/DropletUtils | Single-Cell | 8 |
Use when working with DSS (Dispersion Shrinkage for Sequencing) — a Bioconductor R package for differential methylation analysis of bisulfite sequencing data (WGBS, RRBS). Covers the full DSS pipeline | haowulab/DSS | Epigenomics | 8 |
Duplex-Seq-Pipeline (duplex-seq-tools) is a Snakemake-based pipeline for ultra-accurate error-corrected sequencing using duplex sequencing (DS). Removes PCR and sequencing errors by collapsing read fa | Kennedy-Lab-UW/Duplex-Seq-Pipeline | Genomics | 8 |
EAGER2 / nf-core/eager — reproducible Nextflow pipeline for ancient DNA (aDNA) and palaeogenomics. Handles adapter trimming with AdapterRemoval2, read merging, low-complexity filtering, mapping with B | nf-core/eager | Genomics | 8 |
eCAVIAR — GWAS and eQTL colocalization tool that computes the Colocalization Posterior Probability (CLPP) for each variant to identify shared causal variants across two traits. Extends CAVIAR to joint | fhormoz/caviar | Population Genetics | 8 |
ECMWF API Client — Python library for programmatic access to the European Centre for Medium-Range Weather Forecasts (ECMWF) data archives. Enables retrieval of weather, climate, and atmospheric compos | ecmwf/ecmwf-api-client | Other | 8 |
eemont extends the Google Earth Engine Python API with 250+ spectral index computation (spectralIndices), automatic cloud masking (maskClouds), scale and offset correction (scaleAndOffset), time serie | davemlz/eemont | Other | 8 |
elPrep is a high-performance, multi-pass-fused preprocessing tool for SAM/BAM/CRAM files in NGS variant-calling pipelines. Use when users need single-pass duplicate marking, base quality score recalib | ExaScience/elprep | Utilities & Infrastructure | 8 |
EMBL-EBI (European Bioinformatics Institute) — access bioinformatics databases and analysis services from EMBL-EBI including EBI Search for cross-database discovery, EMBL-EBI Tools Job Dispatcher for | manual | Utilities & Infrastructure | 8 |
ENCODE DCC ATAC-seq pipeline — ENCODE consortium-standard WDL/Cromwell pipeline (run via Caper) for processing bulk ATAC-seq data from FASTQ through alignment, deduplication, peak calling (MACS2), IDR | ENCODE-DCC/atac-seq-pipeline | Epigenomics | 8 |
Enformer — transformer-based deep learning model from DeepMind that predicts gene expression, chromatin accessibility, histone modifications, and TF binding from 196,608 bp raw DNA sequence at 128 bp | google-deepmind/deepmind-research | Machine Learning | 8 |
ENIGMA Toolbox — Python/MATLAB framework for cross-disorder neuroimaging and genomics analysis using ENIGMA consortium summary statistics. Loads precomputed subcortical and cortical disorder effect si | MICA-MNI/ENIGMA | Systems Biology | 8 |
Ensembl Regulation (ensembl-funcgen) is the EMBL-EBI framework for annotating functional regulatory genomic elements including regulatory features, enhancers, promoters, transcription factor binding s | Ensembl/ensembl-funcgen | Utilities & Infrastructure | 8 |
Ensembl REST API — HTTP/JSON web service for programmatic access to Ensembl genomic data without installing local databases. Supports gene/transcript/protein lookup by stable ID or gene symbol, sequen | Ensembl/ensembl-rest | Utilities & Infrastructure | 8 |
Use when working with Ensembl VEP (Variant Effect Predictor) to annotate SNPs, indels, and structural variation with gene, transcript, and protein-level consequences. Supports VCF, HGVS, and coordinat | Ensembl/ensembl-vep | Genomics | 8 |
Ensembl Vet Tools — bioinformatics resources for veterinary and livestock genomics using the Ensembl infrastructure. Covers genome browsing for FAO Priority Livestock Species (cattle, pig, chicken, sh | manual | Other | 8 |
Use when working with Epiclomal — a Python tool for probabilistic clustering of sparse single-cell DNA methylation data to infer epigenetic clonal structure. Covers the full Epiclomal pipeline: prepar | shahcompbio/epiclomal | Epigenomics | 8 |
Use when working with eQTL Catalogue — a uniformly processed, open-access resource of expression quantitative trait loci (eQTL) summary statistics from 100+ studies across bulk tissues, primary cell t | eQTL-Catalogue/eQTLcatalogueR | Single-Cell | 9 |
ESM-2 — Meta AI protein language model family (8M–15B parameters) for sequence embedding, contact prediction, and structure prediction via ESMFold. Generates per-residue and per-sequence representatio | facebookresearch/esm | Proteomics | 8 |
ESM-2/ESM-IF — Meta AI protein language models for embeddings, structure prediction, and inverse folding. ESM-2 provides learned representations (up to 15B parameters) for zero-shot variant effect pre | facebookresearch/esm | Structure Prediction | 8 |
ESM-1v zero-shot variant-effect prediction for protein mutations from sequence. Use when users ask for ESM1v, mutation effect scoring, deep mutational scan labeling, or Meier et al. 2021 workflows wit | facebookresearch/esm | Genomics | 8 |
Use when predicting variant pathogenicity with EVE (Evolutionary model of Variant Effect) — an unsupervised deep generative model from the Marks Lab (OATML/Harvard) that scores missense variants using | OATML-Markslab/EVE | Genomics | 8 |
Use when working with EvoAug -- a PyTorch library for evolution-inspired data augmentation of DNA sequences for training genomic deep learning models. EvoAug applies biologically motivated transforms: | p-koo/evoaug | Genomics | 8 |
Use when working with ExpansionHunter — Illumina's tool for genotyping short tandem repeats (STRs) and repeat expansions from PCR-free whole-genome sequencing data. Detects pathogenic repeat expansion | Illumina/ExpansionHunter | Clinical Genomics | 8 |
FarmCPU (Fixed and random model Circulating Probability Unification) — fast, powerful GWAS method for plant and agricultural genomics, implemented in the rMVP R package. Performs genome-wide associati | xiaolei-lab/rMVP | Other | 8 |
Use when performing Bayesian colocalization analysis between GWAS and eQTL data using fastENLOC (fast enrichment estimation-based co-localization). fastENLOC integrates eQTL probabilistic annotations | xqwen/fastenloc | Population Genetics | 8 |
Use when mapping expression quantitative trait loci (eQTL), splicing QTL (sQTL), or protein QTL (pQTL) with FastQTL. Covers nominal pass, permutation pass, conditional analysis, beta-approximation FDR | francois-a/fastqtl | Population Genetics | 8 |
Fiji/ImageJ — batteries-included distribution of ImageJ2 for scientific bioimage analysis. Batch macro scripting for cell counting, fluorescence quantification, and particle tracking. Bio-Formats supp | fiji/fiji | Imaging | 8 |
Fiji/ImageJ — open-source scientific image analysis platform for life sciences. Provides image segmentation, particle analysis, cell tracking, colocalization, 3D visualization, and batch processing of | fiji/fiji | Imaging | 8 |
Use when working with the rfishbase R package to query the FishBase database for fish species records, ecology, morphology, length-weight parameters, trophic levels, taxonomy, distribution, and reprod | ropensci/rfishbase | Metagenomics | 8 |
FLAMES — Full-Length Analysis of Mutations and Splicing for bulk and single-cell long-read RNA-seq data. R/Bioconductor pipeline for barcode demultiplexing, isoform discovery, transcript quantificatio | OliverVoogworthy/FLAMES | Transcriptomics | 8 |
FlowSOM — high-performance self-organizing map (SOM) clustering for flow cytometry and mass cytometry (CyTOF) data. Build a SOM grid from multi-parameter single-cell marker expression, then apply cons | SofieVG/FlowSOM | Single-Cell | 8 |
Use when performing TWAS fine-mapping or colocalization with FOCUS (Fine-mapping Of CaUsal Sets) or MA-FOCUS (Multiple Ancestry). Covers focus finemap, focus munge, focus import, posterior inclusion p | mancusolab/ma-focus | Population Genetics | 8 |
FoodMicrobionet routing skill for food microbiome network analysis, food microbial ecology, and co-occurrence network construction from 16S rRNA amplicon data. Use this skill when users mention FoodMi | epsiloneridani/FoodMicrobionet | Other | 8 |
Use this skill for FPSim2, the RDKit-based Python toolkit for high-throughput molecular fingerprint similarity search. Covers fingerprint database creation with fpsim2-create-db or create_db_file, in- | chembl/FPSim2 | Drug Discovery | 8 |
Funannotate is a eukaryotic genome annotation pipeline that integrates ab initio gene prediction, homology-based evidence, and functional annotation. Use when users ask about funannotate, eukaryotic g | nextgenusfs/funannotate | Genomics | 8 |
FUNGuild — ecological guild annotation tool for fungal OTU/ASV tables from amplicon metagenomics. Assigns trophic modes (Saprotrophic, Pathotroph, Symbiotroph) and ecological guilds (Ectomycorrhizal, | UMNFuN/FUNGuild | Metagenomics | 8 |
GA4GH Beacon — federated genomic variant discovery API standard developed by the Global Alliance for Genomics and Health. Query whether a specific genomic variant (SNV, indel, CNV, structural variant) | ga4gh-beacon/beacon-v2 | Clinical Genomics | 8 |
GA4GH VRS (Variation Representation Specification) is a standard and Python library for representing biological sequence variants in a computable, interoperable form. Generates globally unique GA4GH V | ga4gh/vrs | Clinical Genomics | 8 |
Use this skill for GATK copy-number variation workflows including germline gCNV cohorts and somatic copy-ratio segmentation from WES or WGS data. Trigger on requests about CollectReadCounts, Determine | broadinstitute/gatk | Genomics | 8 |
Use when working with GATK MarkDuplicates, Picard MarkDuplicates through the GATK wrapper, duplicate marking in BAM or CRAM files, duplicate metrics, or optical duplicate detection in short-read DNA s | broadinstitute/gatk | Utilities & Infrastructure | 8 |