data-analytics

Create data pipeline and analytics architecture diagrams using PlantUML syntax with database/analytics stencil icons. Best for ETL pipelines, data lakes, real-time streaming, data warehousing, and BI dashboard design.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "data-analytics" with this command: npx skills add markdown-viewer/skills/markdown-viewer-skills-data-analytics

Data Analytics Diagram Generator

Quick Start: Define data sources → Declare ingestion/ETL icons → Connect to storage/warehouse → Add BI/visualization → Wrap in ```plantuml fence.

⚠️ IMPORTANT: Always use ```plantuml or ```puml code fence. NEVER use ```text — it will NOT render as a diagram.

Critical Rules

  • Every diagram starts with @startuml and ends with @enduml
  • Use left to right direction for data pipelines (Source → Ingest → Transform → Store → Visualize)
  • Use mxgraph.aws4.* stencil syntax for analytics, database, and storage icons
  • Default colors are applied automatically — you do NOT need to specify fillColor or strokeColor
  • Use rectangle "Zone" { ... } or package "Layer" { ... } for grouping pipeline stages
  • Directed flows use -->, async/streaming flows use ..> (dashed)

Full stencil reference: See stencils/README.md for 9500+ available icons.

Mxgraph Stencil Syntax

mxgraph.aws4.<icon> "Label" as <alias>

Analytics & ETL Stencils

CategoryStencilsPurpose
Query Engineathena, athena_data_source_connectorsServerless SQL on S3 data
ETLglue, glue_crawlers, glue_data_catalog, aws_glue_data_quality, aws_glue_for_rayData integration & cataloging
Streamingkinesis, kinesis_data_streams, kinesis_data_firehose, kinesis_data_analytics, kinesis_video_streamsReal-time data streaming
MapReduceemr, emr_engine, emr_engine_mapr_m3, emr_engine_mapr_m5Big data processing (Spark, Hive)
Data Warehouseredshift, redshift_ra3, redshift_streaming_ingestion, redshift_mlColumnar analytics warehouse
Searchopensearch_service_data_node, opensearch_ingestion, cloudsearchFull-text search & log analytics
BIquicksightDashboards & visualizations
Data Lakelake_formation, s3, glacier, glacier_deep_archiveGoverned data lake storage
Catalogdatazone_custom_asset_type, data_exchangeData governance & sharing
Streaming Kafkamsk, msk_connectManaged Kafka streaming

Database Stencils

CategoryStencilsPurpose
Relationalaurora, aurora_instance, rds, rds_instance, rds_mysql_instance, rds_postgresql_instanceTransactional databases
NoSQLdynamodb, dynamodb_table, dynamodb_global_secondary_index, dynamodb_streamKey-value & document store
GraphneptuneGraph database
In-Memoryelasticache, elasticache_for_redis, elasticache_for_memcachedCache & session store
Documentdocumentdb, documentdb_with_mongodb_compatibilityDocument database
Ledgerquantum_ledger_databaseImmutable transaction log
Wide-ColumnkeyspacesCassandra-compatible

Connection Types

SyntaxMeaningUse Case
A --> BSolid arrowBatch data flow / API call
A ..> BDashed arrowStreaming / async / CDC
A -- BSolid lineBidirectional sync
A --> B : "label"Labeled connectionDescribe data format or volume

Quick Example

@startuml
left to right direction
mxgraph.aws4.s3 "Data Lake\n(S3)" as s3
mxgraph.aws4.glue "Glue\nETL" as glue
mxgraph.aws4.redshift "Redshift" as rs
mxgraph.aws4.quicksight "QuickSight" as qs

s3 --> glue
glue --> rs
rs --> qs
@enduml

Data Analytics Architecture Types

TypePurposeKey StencilsExample
Data LakeCentralized raw data stores3, lake_formation, glue, athenadata-lake.md
Real-time StreamingEvent stream processingkinesis, msk, lambda_function, opensearch_servicereal-time-streaming.md
Data WarehouseStar-schema analyticsredshift, glue, quicksightdata-warehouse.md
ETL PipelineExtract-transform-loadglue, glue_crawlers, glue_data_catalog, s3etl-pipeline.md
Log AnalyticsCentralized loggingkinesis_data_firehose, opensearch_service, lambda_functionlog-analytics.md
ML Feature StoreFeature engineering pipelineglue, s3, athena, emrml-feature-pipeline.md
CDC PipelineDatabase change capturedynamodb_streams, kinesis, lambda_function, redshiftcdc-pipeline.md
Multi-source BICross-database reportingaurora, dynamodb, redshift, quicksightmulti-source-bi.md

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

architecture

No summary provided by upstream source.

Repository SourceNeeds Review
General

uml

No summary provided by upstream source.

Repository SourceNeeds Review
General

infographic

No summary provided by upstream source.

Repository SourceNeeds Review
General

graphviz

No summary provided by upstream source.

Repository SourceNeeds Review