bio-prefect-dask-nextflow

Design and scaffold bioinformatics pipelines using Prefect+Dask for local/distributed execution or Nextflow for HPC schedulers.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "bio-prefect-dask-nextflow" with this command: npx skills add fmschulz/omics-skills/fmschulz-omics-skills-bio-prefect-dask-nextflow

Bio Prefect + Dask + Nextflow

Choose and scaffold the right workflow engine for local, distributed, or HPC bioinformatics pipelines.

Instructions

  1. Collect requirements (scheduler, container policy, data location, scale).
  2. Choose engine: Prefect+Dask, Nextflow, or Hybrid.
  3. Generate a runnable scaffold with clear data layout and resources.
  4. Validate with a small test and resume/retry checks.

Quick Reference

TaskAction
Engine choiceSee decision-matrix.md
Prefect+Dask scaffoldSee prefect-dask.md
Prefect on SlurmSee prefect-hpc-slurm.md
Nextflow on HPCSee nextflow-hpc.md
ExamplesSee examples.md

Input Requirements

  • Workflow requirements and steps
  • Target environment (local, cluster, cloud)
  • Scheduler and container constraints
  • Data locations and expected volumes

Output

  • Engine recommendation with rationale
  • Runnable scaffold (files + commands)
  • Resource plan per step
  • Validation plan and checkpoints

Quality Gates

  • Tiny test run completes end-to-end
  • Resume/retry behavior verified
  • Resource plan matches cluster limits

Examples

Example 1: Engine recommendation

Choice: Nextflow
Why: CLI-heavy pipeline, HPC scheduler required, reproducible cache/resume needed.

Troubleshooting

Issue: Workflow fails on HPC due to environment mismatch Solution: Pin container/conda versions and validate with a minimal test dataset.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

bio-workflow-methods-docwriter

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

agent-browser

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

notebook-ai-agents-skill

No summary provided by upstream source.

Repository SourceNeeds Review
General

bio-phylogenomics

No summary provided by upstream source.

Repository SourceNeeds Review