troubleshooting

Dataiku Troubleshooting Guide

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "troubleshooting" with this command: npx skills add jediv/dataiku-chat-control/jediv-dataiku-chat-control-troubleshooting

Dataiku Troubleshooting Guide

Debugging Checklist

  • Environment activated? which python should show dataiku-env

  • Variables set? echo $DSS_URL

  • Can connect? Run scripts/bootstrap.py

  • Recipe saved? Check for settings.save()

  • Job ran? Check for recipe.run()

  • Job succeeded? Check job.get_status()

  • Schema correct? Run autodetect_settings()

Top-10 Error Quick Reference

Error Cause Solution

Connection refused

Wrong DSS_URL or instance down Verify URL, check instance status

401 Unauthorized

Invalid or expired API key Regenerate key in Dataiku UI

Project not found

Wrong project key or no access client.list_project_keys() to verify

Settings not saved Missing settings.save()

Always call settings.save() after changes

Recipe ran but no data Filter/join removed all rows Check inputs, join keys, filters

Job failed Schema mismatch, missing inputs Inspect job status and logs

invalid identifier (quoted) Lowercase column names in SQL schema Normalize schema to UPPERCASE

table does not exist

Upstream dataset not built Build datasets in dependency order

Insert value list mismatch

Output schema doesn't match recipe output Run recipe.compute_schema_updates() and apply

ModuleNotFoundError: dataikuapi

Virtual environment not activated source ~/dataiku-env/bin/activate

Job Failure Investigation Pattern

Get the most recent job and extract error details

jobs = project.list_jobs() job = project.get_job(jobs[0]['def']['id']) status = job.get_status() state = status.get("baseStatus", {}).get("state") # "DONE" or "FAILED"

if state == "FAILED": activities = status.get("baseStatus", {}).get("activities", {}) for name, info in activities.items(): if info.get("firstFailure"): print(f"Error: {info['firstFailure'].get('message')}")

# Or get full log
print(job.get_log())

Important: recipe.run() already waits for completion internally. Use recipe.run(no_fail=True) to prevent exceptions on failure, then inspect the returned job object.

Detailed Error References

For full details on each error category including causes, code examples, and solutions:

  • references/connection-errors.md — Connection refused, 401 Unauthorized, Project not found

  • references/recipe-errors.md — Settings not saved, empty output, job failures, job API usage patterns

  • references/sql-errors.md — Invalid identifier (quoted/general), table does not exist, pre-join computed columns, insert value list mismatch

  • references/environment-errors.md — ModuleNotFoundError, missing env vars, getting more help

Scripts

  • scripts/debug_job.py — Standalone script to debug the most recent failed job

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

recipe-patterns

No summary provided by upstream source.

Repository SourceNeeds Review
General

dataset-management

No summary provided by upstream source.

Repository SourceNeeds Review
General

data-catalog

No summary provided by upstream source.

Repository SourceNeeds Review