bio-binning-qc

Perform metagenomic binning, refinement, and QC with completeness/contamination checks.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "bio-binning-qc" with this command: npx skills add fmschulz/omics-skills/fmschulz-omics-skills-bio-binning-qc

Bio Binning QC

Perform metagenomic binning, refinement, and QC with completeness/contamination checks.

Instructions

  1. Compute depth/coverage per sample.
  2. Run multiple binners (MetaBAT2, SemiBin2, QuickBin).
  3. Classify bins by domain (bacteria/archaea vs eukaryotes).
  4. Run domain-specific QC:
  5. CheckM2 for bacterial and archaeal bins
  6. EukCC for eukaryotic bins
  7. GUNC for contamination detection (all domains).

Quick Reference

TaskAction
Run workflowFollow the steps in this skill and capture outputs.
Validate inputsConfirm required inputs and reference data exist.
Review outputsInspect reports and QC gates before proceeding.
Tool docsSee docs/README.md.
References- See ../bio-skills-references.md

Input Requirements

Prerequisites:

  • Tools available in the active environment (Pixi/conda/system). See docs/README.md for expected tools.
  • Reference DB root: set BIO_DB_ROOT (default /media/shared-expansion/db/ on WSU).
  • Coverage/depth tables or reads available to compute coverage. Inputs:
  • contigs.fasta
  • coverage.tsv (per-sample depth table)

Output

  • results/bio-binning-qc/bins/
  • results/bio-binning-qc/bin_metrics.tsv
  • results/bio-binning-qc/bin_qc_report.html
  • results/bio-binning-qc/logs/

Quality Gates

  • Completeness and contamination meet project thresholds.
  • Chimera and contamination flags are below thresholds.
  • On failure: retry with alternative parameters; if still failing, record in report and exit non-zero.
  • Verify contigs.fasta and coverage.tsv are non-empty.
  • Verify reference DBs for QC tools exist under the reference root.

Examples

Example 1: Expected input layout

contigs.fasta
coverage.tsv (per-sample depth table)

Troubleshooting

Issue: Missing inputs or reference databases Solution: Verify paths and permissions before running the workflow.

Issue: Low-quality results or failed QC gates Solution: Review reports, adjust parameters, and re-run the affected step.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

bio-protein-clustering-pangenome

No summary provided by upstream source.

Repository SourceNeeds Review
General

science-writing

No summary provided by upstream source.

Repository SourceNeeds Review
General

bio-phylogenomics

No summary provided by upstream source.

Repository SourceNeeds Review