It’s the teams final day of Dashboard week, does that mean I’m going to make it easy? Hell no. This time the focus is on their SQL and visualisation skills.
For today’s challenge, DS50 will explore summer-themed LEGO sets using the complete Rebrickable LEGO CSV files (sets, parts, colors, inventories, themes, etc.).
Your Task
- Download all the tables and upload them individually to your own schema on Snowflake
- Clean, join and create views in Snowflake
- Identify summer-themed LEGO sets and themes using keywords in the datasets.
- Analyze relationships between sets, parts, colors, and themes etc with a summer theme.
- Explore trends such as:
- Set releases over time
- Popular parts and colors
- Set sizes and part categories
- Highlight rare or unique parts exclusive to summer sets.
- Connect to your newly created data from Snowflake and Create an interactive dashboard using a visualization tool different from your previous Monday challenge.
- Document your findings clearly with visuals and summaries.
- Organize your cleaned data files, SQL queries, dashboards, and documentation in a GitHub repo.
Data Source
Access the full Rebrickable LEGO CSV datasets here
Deliverables
- Interactive dashboard
- GitHub repository containing:
- Cleaned and prepared datasets
- SQL scripts or data prep files
- Documentation and summaries
Presentations are at 3pm, and I expect a blog by 5pm, you know the drill by now.
Good luck DS50! Unleash your creativity and data skills to make summer LEGO sets shine in your dashboards.