CD-HIT — fast sequence clustering and redundancy reduction for protein and nucleotide sequences. Clusters FASTA sequences at a user-defined identity threshold to remove redundancy from databases. Prov
Use with AI
Install the MCP server or CLI to instantly fetch CD-HIT documentation:
Install command
claude mcp add biocontext7 -- npx @biocontext7/mcpOr share this page: biocontext7.com/tools/cd-hit
Kipoi — unified Python API and CLI for 2000+ pre-trained machine learning models for genomic sequence analysis. Load, run, and interpret models for variant effect prediction, transcription factor bind
3 shared topics
MutPred2 — machine learning pathogenicity predictor for amino acid substitutions (missense variants). Generates a pathogenicity score (0–1) and ranked molecular mechanism hypotheses (loss/gain of PTMs
3 shared topics
Boltz-1 — open-source deep learning model for predicting biomolecular 3D structures and interactions, approaching AlphaFold3-level accuracy. Supports protein, DNA, RNA, and small-molecule ligand struc
2 shared topics
Ensembl Database — EMBL-EBI's comprehensive genome annotation database covering 300+ species with gene models, variants, regulatory features, and comparative genomics. Query via REST API at rest.ensem
2 shared topics
OrthoFinder — fast, accurate ortholog inference for comparative genomics. Identifies orthogroups, orthologs, gene trees, rooted species trees, and gene duplication events from protein or nucleotide se
2 shared topics