funsloth-hfjobs

Training manager for Hugging Face Jobs - launch fine-tuning on HF cloud GPUs with optional WandB monitoring

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "funsloth-hfjobs" with this command: npx skills add chrisvoncsefalvay/funsloth/chrisvoncsefalvay-funsloth-funsloth-hfjobs

Hugging Face Jobs Training Manager

Run Unsloth training on Hugging Face Jobs (cloud GPU training).

Prerequisites

  1. HF Authentication: huggingface-cli whoami (login if needed)
  2. HF Jobs Access: Requires PRO subscription or org compute access
  3. Training notebook/script: From funsloth-train

Workflow

1. Select Hardware

GPUVRAMCostBest For
A10G24GB~$1.50/hr7-14B LoRA
A100 40GB40GB~$4/hr14-34B
A100 80GB80GB~$6/hr70B
H10080GB~$8/hrFastest

See references/HARDWARE_GUIDE.md for model-to-GPU mapping.

2. Convert Notebook to Script

HF Jobs requires PEP 723 script format:

# /// script
# requires-python = ">=3.10"
# dependencies = [
#     "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git",
#     "torch>=2.0",
#     "transformers>=4.45",
#     "trl>=0.12",
#     "peft>=0.13",
#     "datasets>=2.18",
# ]
# ///

Use scripts/train_sft.py as a template.

3. Optional: WandB Integration

Add to script:

import wandb
wandb.init(project="funsloth-training")
# Add report_to="wandb" in TrainingArguments

Set: export WANDB_API_KEY="your-key"

4. Estimate Costs

Use the cost estimator:

python scripts/estimate_cost.py --tokens {total_tokens} --platform hfjobs

5. Launch Job

# Create job config
cat > job_config.yaml << 'EOF'
compute:
  gpu: {gpu_type}
  gpu_count: 1
script: train_hfjobs.py
outputs:
  - /outputs/*
EOF

# Submit
huggingface-cli jobs create --config job_config.yaml

6. Monitor Progress

huggingface-cli jobs status {job_id}
huggingface-cli jobs logs {job_id} --follow

WandB: https://wandb.ai/{username}/funsloth-training

7. Download Artifacts

from huggingface_hub import snapshot_download
snapshot_download(repo_id="{username}/funsloth-job", local_dir="./outputs")

8. Handoff

Offer funsloth-upload for Hub upload with model card.

Error Handling

ErrorResolution
No HF Jobs accessGet PRO subscription
OOMReduce batch size or upgrade GPU
Job timeoutEnable checkpointing
Script errorCheck PEP 723 dependencies

Bundled Resources

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

funsloth-check

No summary provided by upstream source.

Repository SourceNeeds Review
General

funsloth-local

No summary provided by upstream source.

Repository SourceNeeds Review
General

d3-viz

No summary provided by upstream source.

Repository SourceNeeds Review
General

Zip

Zip - command-line tool for everyday use

Registry SourceRecently Updated