The Challenge
Teams need to understand data engineering, ETL pipelines, and analytics infrastructure with data lake architecture, schema discovery, and serverless analytics.
What You'll Build
- → S3 data lake with structured folders
- → Glue crawlers for schema discovery
- → Glue ETL jobs for data transformation
- → Athena for SQL queries
- → Lifecycle policies for cost optimization
Try This Prompt in Pulumi Neo
Edit the prompt below and run it directly in Neo to deploy your infrastructure.
Best For
Use this prompt to implement data engineering, ETL, analytics, and cost optimization. Perfect for teams building data lake patterns and serverless analytics.
Key Features
This deployment includes:
- Data Lakes - Multi-stage architecture
- ETL - Extract, transform, load pipelines
- Schema Discovery - Automatic cataloging
- Analytics - SQL queries with Athena
- Cost Optimization - Lifecycle policies
Implements data engineering fundamentals!