tooluniverse-protein-therapeutic-design

Design novel protein therapeutics (binders, enzymes, scaffolds) using AI-guided de novo design. Uses RFdiffusion for backbone generation, ProteinMPNN for sequence design, ESMFold/AlphaFold2 for validation. Use when asked to design protein binders, therapeutic proteins, or engineer protein function.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "tooluniverse-protein-therapeutic-design" with this command: npx skills add mims-harvard/tooluniverse/mims-harvard-tooluniverse-tooluniverse-protein-therapeutic-design

Therapeutic Protein Designer

AI-guided de novo protein design using RFdiffusion backbone generation, ProteinMPNN sequence optimization, and structure validation for therapeutic protein development.

KEY PRINCIPLES:

  1. Structure-first - Generate backbone geometry before sequence
  2. Target-guided - Design binders with target structure in mind
  3. Iterative validation - Predict structure to validate designs
  4. Developability-aware - Consider aggregation, immunogenicity, expression
  5. Evidence-graded - Grade designs by confidence metrics
  6. Actionable output - Provide sequences ready for experimental testing
  7. English-first queries - Always use English terms in tool calls

When to Use

Apply when user asks to:

  • Design a protein binder, therapeutic protein, or scaffold
  • Optimize a protein sequence for function
  • Design a de novo enzyme
  • Generate protein variants for target binding

Workflow Overview

Phase 1: Target Characterization
  Get structure (PDB, EMDB cryo-EM, AlphaFold), identify binding epitope

Phase 2: Backbone Generation (RFdiffusion)
  Define constraints, generate >= 5 backbones, filter by geometry

Phase 3: Sequence Design (ProteinMPNN)
  Design >= 8 sequences per backbone, sample with temperature control

Phase 4: Structure Validation (ESMFold/AlphaFold2)
  Predict structure, compare to backbone, assess pLDDT/pTM

Phase 5: Developability Assessment
  Aggregation, pI, expression prediction

Phase 6: Report Synthesis
  Ranked candidates, FASTA, experimental recommendations

Critical Requirements

Report-First Approach (MANDATORY)

  1. Create [TARGET]_protein_design_report.md first with section headers
  2. Progressively update as designs are generated
  3. Output [TARGET]_designed_sequences.fasta and [TARGET]_top_candidates.csv

Design Documentation (MANDATORY)

Every design MUST include: Sequence, Length, Target, Method, and Quality Metrics (pLDDT, pTM, MPNN score, binding prediction).


NVIDIA NIM Tools

ToolPurposeKey Parameter
NvidiaNIM_rfdiffusionBackbone generationdiffusion_steps (NOT num_steps)
NvidiaNIM_proteinmpnnSequence designpdb_string (NOT pdb)
NvidiaNIM_esmfoldFast validationsequence (NOT seq)
NvidiaNIM_alphafold2High-accuracy validationsequence, algorithm
NvidiaNIM_esm2_650mSequence embeddingssequences, format

Common Parameter Mistakes

ToolWrongCorrect
NvidiaNIM_rfdiffusionnum_steps=50diffusion_steps=50
NvidiaNIM_proteinmpnnpdb=contentpdb_string=content
NvidiaNIM_esmfoldseq="MVLS..."sequence="MVLS..."
NvidiaNIM_alphafold2seq="MVLS..."sequence="MVLS..."

NVIDIA NIM Requirements

  • API Key: NVIDIA_API_KEY environment variable required
  • Rate limits: 40 RPM (1.5 second minimum between calls)
  • AlphaFold2 may return 202 (polling required); RFdiffusion and ESMFold are synchronous

Supporting Tools

ToolPurposeKey Parameters
PDB_search_by_uniprotFind PDB structuresuniprot_id
PDB_get_structureDownload PDB filepdb_id
alphafold_get_predictionGet AlphaFold DB structureaccession
emdb_searchSearch cryo-EM mapsquery
emdb_get_entryGet entry detailsentry_id
UniProt_get_protein_sequenceGet target sequenceaccession
InterPro_get_protein_domainsGet domainsaccession

Evidence Grading

TierCriteria
T1 (best)pLDDT >85, pTM >0.8, low aggregation, neutral pI
T2pLDDT >75, pTM >0.7, acceptable developability
T3pLDDT >70, pTM >0.65, developability concerns
T4Failed validation or major developability issues

Completeness Checklist

  • Target structure obtained (PDB or predicted)
  • Binding epitope identified
  • >= 5 backbones generated, top 3-5 selected
  • >= 8 sequences per backbone, MPNN scores reported
  • All sequences validated (ESMFold), pLDDT/pTM reported, >= 3 passing
  • Developability assessed (aggregation, pI, expression)
  • Ranked candidate list, FASTA file, experimental recommendations

Reference Files

  • DESIGN_PROCEDURES.md - Phase-by-phase code examples, sampling parameters, fallback chains
  • TOOLS_REFERENCE.md - Complete tool documentation with code examples
  • EXAMPLES.md - Sample design workflows and outputs
  • CHECKLIST.md - Detailed phase checklists and quality metrics
  • design_templates.md - Report templates and output format examples

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

tooluniverse-sequence-retrieval

No summary provided by upstream source.

Repository SourceNeeds Review
General

tooluniverse

No summary provided by upstream source.

Repository SourceNeeds Review
General

setup-tooluniverse

No summary provided by upstream source.

Repository SourceNeeds Review
General

tooluniverse-chemical-compound-retrieval

No summary provided by upstream source.

Repository SourceNeeds Review