Browse the BioContext7 deep skill library. Tool pages surface documentation, registry links, and install details for agent-facing workflows.
2,064 tools — page 31 of 42
| Tool | Registry | Domain | Docs |
|---|---|---|---|
Trim Galore — convenience wrapper around Cutadapt and FastQC for consistent, automated adapter trimming, quality filtering, and post-trimming QC of high-throughput sequencing reads. Auto-detects Illum | FelixKrueger/TrimGalore | Transcriptomics | 8 |
TriTrypDB is a VEuPathDB genomics platform for kinetoplastid parasites including Trypanosoma and Leishmania species. Use this skill for gene discovery, functional annotation lookup, orthology queries, | manual | Utilities & Infrastructure | 8 |
TriTrypDB — integrated functional genomics resource for kinetoplastid parasites (Trypanosoma, Leishmania, and related species). Part of the VEuPathDB Bioinformatics Resource Center, hosting 83+ annota |
| manual |
| Utilities & Infrastructure |
| 8 |
tRNAscan-SE — genome-wide tRNA gene prediction tool that uses covariance models via Infernal to identify, score, and classify transfer RNA genes in eukaryotic, bacterial, archaeal, and mitochondrial s | UCSC-LoweLab/tRNAscan-SE | Structure Prediction | 8 |
Use when working with Bioconductor TxDb annotation packages, especially TxDb.Hsapiens.UCSC.hg38.knownGene, GenomicFeatures transcript models, UCSC knownGene annotations, gene and transcript coordinate | manual | Utilities & Infrastructure | 8 |
Use when working with UCell, an R/Bioconductor package for gene-set enrichment scoring in single-cell RNA-seq data using the AUCell rank-based method. Computes per-cell UCell scores for custom gene si | carmonalab/UCell | Transcriptomics | 8 |
Use when working with UINMF in `rliger` for mosaic single-cell integration with shared and unshared features. UINMF extends LIGER's iNMF workflow to keep modality-specific genes, intergenic accessibil | welch-lab/liger | Single-Cell | 8 |
UMAP (Uniform Manifold Approximation and Projection) — nonlinear dimensionality reduction for visualization and general-purpose embedding. Constructs fuzzy simplicial complex from high-dimensional dat | manual | Statistics | 8 |
UpSetR is an R package for visualizing intersections of multiple sets using UpSet plots — a scalable alternative to Venn diagrams for 3+ sets. Use when you need to compare membership across gene lists | hms-dbmi/UpSetR | Imaging | 8 |
Use uwot for R-native UMAP and related manifold learning methods (`umap`, `umap2`, `tumap`, `lvish`, and `umap_transform`) on dense, sparse, and distance inputs. Route this skill when users ask for no | jlmelville/uwot | Statistics | 8 |
Use when working with VADR, the NCBI Viral Annotation DefineR suite for validating, classifying, and annotating viral genome sequences against VADR model libraries. Covers `v-scan.pl` for library-awar | ncbi/vadr | Metagenomics | 8 |
Use when working with varbvs, the R package for large-scale Bayesian variable selection in genome-wide association studies (GWAS). Covers fitting variational Bayes models for continuous and binary phe | pcarbo/varbvs | Population Genetics | 8 |
Use when working with the R/Bioconductor package variancePartition for quantifying the contribution of biological and technical factors to gene expression variability, fitting linear mixed models to R | GabrielHoffman/variancePartition | Transcriptomics | 8 |
VARITY -- machine-learning variant effect predictor for missense variants. Predicts pathogenicity of single amino acid substitutions using gradient boosting trained on ClinVar. Provides two models: VA | nmaus/varity | Proteomics | 8 |
Use when working with VAST-TOOLS for alternative splicing analysis in RNA-seq data. Routes to references and scripts for VASTDB setup, splicing event quantification (PSI/PIR), differential splicing wi | vastgroup/vast-tools | Transcriptomics | 8 |
VectorBase — bioinformatics resource for invertebrate vector genomics providing gene search, BLAST, genome browsing (JBrowse), population biology mapping (MapVEu), and REST API access for arthropod ve | manual | Utilities & Infrastructure | 8 |
velocyto.R skill for RNA velocity analysis in single-cell RNA-seq data. Use when users ask about velocyto.R, RNA velocity in R, deterministic velocity models, or working with .loom files containing sp | velocyto-team/velocyto.R | Single-Cell | 9 |
Use when working with VILOCA, the viral haplotype reconstruction and mutation calling tool for short-read and long-read sequencing data. Covers the `viloca` CLI, `run` and `snv` subcommands, quality-s | cbg-ethz/VILOCA | Metagenomics | 8 |
Use when working with viral-ngs, Broad Institute viral genomics tooling, viral sequencing pipelines, pathogen read depletion, metagenomic classification, viral assembly, or intrahost/interhost variant | broadinstitute/viral-ngs | Genomics | 8 |
VISION (Viewpoint Informed Sequencing Analysis of Networks) is an R package for interpreting single-cell RNA-seq data by linking gene expression signatures to cell metadata through an interactive web | YosefLab/VISION | Single-Cell | 8 |
Vitessce — interactive web-based visualization tool for exploration of multimodal and spatially resolved single-cell data. Use for configuring Jupyter widget layouts with VitessceConfig, loading OME-T | vitessce/vitessce | Imaging | 8 |
Use this skill for wallace workflows in R when you need a Shiny-based platform for reproducible species niche and distribution modeling, including occurrence-data retrieval, WorldClim environmental la | wallaceEcoMod/wallace | Other | 8 |
WeatherBench 2 is a benchmarking framework for next-generation data-driven global weather models. It provides tools for accessing re-forecast and observational data, computing performance metrics (RMS | google-research/weatherbench2 | Machine Learning | 8 |
xcms preprocesses LC-MS and GC-MS data in Bioconductor workflows. Use it for peak detection, retention-time alignment, feature grouping, and gap-filling when you need reproducible feature-by-sample in | sneumann/xcms | Workflows | 8 |
ZFIN (Zebrafish Information Network) — the model organism database for Danio rerio (zebrafish). Provides curated gene records, expression data (in situ hybridization, antibody staining), mutant and tr | rtaylorzfin/zfin | Utilities & Infrastructure | 8 |
ZINB-WaVE (Zero-Inflated Negative Binomial Wave) for dimensionality reduction and normalization of single-cell RNA-seq data. Fits a zero-inflated negative binomial (ZINB) model to scRNA-seq count matr | drisso/zinbwave | Single-Cell | 8 |
Use when working with clinvar-tools — clinVar tools — NCBI utilities | ncbi/clinvar | Clinical Genomics | 9 |
EGA tools — EMBL-EBI European Genome-phenome Archive client and submission utilities. Covers encrypted download (pyEGA3), file encryption (EGA Cryptor), metadata schema validation, submission API inte | EGA-archive/ega-download-client | Utilities & Infrastructure | 9 |
generate-cwl-from-nextflow — cross-compile Nextflow DSL2 process and workflow definitions into CWL v1.2 CommandLineTool and Workflow YAML documents for portable execution on CWL-compatible platforms ( | common-workflow-language/cwltool | Utilities & Infrastructure | 11 |
GSEA — Gene Set Enrichment Analysis determines whether a priori defined gene sets show statistically significant, concordant differences between two biological states. Computes enrichment scores via a | ctlab/fgsea | Transcriptomics | 10 |
H3 hexagonal hierarchical geospatial indexing for mapping latitude/longitude points and polygons to stable multi-resolution cells, running neighborhood traversal (grid disk/ring/distance), compacting | uber/h3 | Other | 9 |
INLA (Integrated Nested Laplace Approximations) — R package for fast approximate Bayesian inference on latent Gaussian models. Provides deterministic approximations to posterior marginals as an altern | hrue/r-inla | Statistics | 13 |
JACKS — Joint Analysis of CRISPR/Cas9 Knock-out Screens. Bayesian method for jointly estimating gene essentiality and gRNA efficacy across multiple CRISPR knockout screens. Shares information across c | felicityallen/JACKS | Systems Biology | 11 |
Limnology (freshwater science) — computational toolkit for freshwater ecosystem analysis including lake thermal stratification (rLakeAnalyzer), lake metabolism estimation (LakeMetabolizer), eDNA metab | AquaticEcoDynamics/GLM | Utilities & Infrastructure | 11 |
MPRADesignTools — R package for designing Massively Parallel Reporter Assay (MPRA) experiments. Provides oligo library design, sample size estimation, barcode generation, sequence tiling, and statisti | andrewGhazi/mpradesigntools | Transcriptomics | 9 |
Use when working with OBO Foundry ontologies or the OBO tooling ecosystem: pronto (Python OBO/OWL parser), ROBOT (Java CLI for ontology operations), oaklib/OAK (Ontology Access Kit), or fastobo. Cover | manual | Systems Biology | 10 |
Orphanet — global reference portal for rare diseases and orphan drugs. Provides ORPHA identifiers (ORPHA codes), the Orphanet Rare Disease Ontology (ORDO), gene-disease associations, clinical trial li | Orphanet/ORDO | Utilities & Infrastructure | 10 |
p.adjust — Multiple testing p-value correction in base R (stats package). Adjusts a vector of p-values using one of seven methods: Bonferroni, Holm (1979), Hochberg (1988), Hommel (1988), Benjamini-Ho | manual | Transcriptomics | 9 |
Wellcome Sanger Institute tools and resources — access COSMIC somatic mutation data, run Cancer Genome Project (CGP) WGS somatic variant calling pipelines (CaVEMan SNVs, Pindel indels, BRASS SVs, ASCA | cancerit/CaVEMan | Genomics | 10 |
SGD (Saccharomyces Genome Database) — the authoritative curated resource for Saccharomyces cerevisiae genome biology. Provides a REST API for querying genes, ORFs, phenotypes, GO annotations, genetic | manual | Utilities & Infrastructure | 9 |
Use this skill for systems biology model simulation with Tellurium when users ask to run Antimony or SBML models, generate time-course simulations, switch between deterministic and stochastic integrat | sys-bio/tellurium | Systems Biology | 9 |
> | Gaius-Augustus/Augustus | Transcriptomics | 10 |
IQ-TREE — efficient phylogenomic software for maximum likelihood tree inference. Integrates ModelFinder for automatic model selection (10-100x faster than jModelTest/ProtTest), ultrafast bootstrap (UF | iqtree/iqtree2 | Phylogenetics | 8 |
Pierian Dx (now Velsera) Clinical Genomics Workspace (CGW) — comprehensive clinical interpretation and analysis platform for NGS data. Supports variant annotation, clinical significance assessment, re | manual | Genomics | 8 |
Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a fast, accurate gene prediction tool for prokaryotic genomes and metagenomes. Predicts protein-coding genes in bacterial and archae | hyattpd/Prodigal | Metagenomics | 8 |
AlphaFold Database — public repository of 200M+ AI-predicted protein structures maintained by DeepMind and EMBL-EBI. Access predictions by UniProt accession via REST API at alphafold.ebi.ac.uk. Downlo | google-deepmind/alphafold | Proteomics | 15 |
HL7 FHIR Genomics — clinical genomic data exchange using FHIR R4/R5 profiles. Represent variants, haplotypes, copy-number alterations, and pharmacogenomics findings as interoperable FHIR resources (Mo | elimuinformatics/vcf2fhir | Clinical Genomics | 9 |
MetaPhlAn 4 + StrainPhlAn 4 — marker-gene-based metagenomic profiling and strain-level phylogenetics. MetaPhlAn profiles microbial communities at species level from shotgun metagenomes using ~5.1M uni | biobakery/MetaPhlAn> | Phylogenetics | 10 |
Use when working with NetworkX, graph analysis in Python, biological network analysis, protein-protein interaction networks, gene regulatory networks, metabolic networks, pathway analysis, network top | networkx/networkx | Machine Learning | 10 |
Use this skill when working with Movebank's animal-tracking platform, Movebank REST API, attribute dictionary, and official integration examples. Covers study discovery, authenticated and public data | movebank/movebank-api-doc | Other | 7 |