Data can be cleaned in many ways, but two of the most practical tools you’ll use are Tableau Prep and Alteryx. Both help you take messy, unpredictable data and turn it into something organized, reliable, and ready for analysis. During Week 1, I learnt the basics of cleaning data in Tableau Prep.
Why do we need to clean data?
If the data has errors, duplicates, or missing values, the charts can be wrong or confusing. Cleaning makes sure everything’s accurate, consistent, and ready to tell the real story.
1. Connect to the data
Think of Tableau Prep as an all‑access stage manager. You tell it where the instruments (data) are stored—maybe in Spotify reports, Excel sheets, or cloud databases. It brings them all to the same stage.
Example:
You load two files: “User_Accounts.xlsx” and “Listening_History.csv.”
2. Profile the data
Now the sound check begins. Tableau Prep lights up your data with visuals that show what’s offbeat—missing notes, odd characters, or wrong tempos.
Example:
You notice:
“Beyoncé,” “Beyoncé,” and “BEYONCE” appear as three separate artists.
Some users have no country listed.
“Listening time” has strange negative numbers.
3. Clean the data
Time to tune your instruments. You polish and fix anything that sounds wrong.
Common fixes:
- Group & Replace: Merge all spellings of “Beyoncé.”
- Remove Duplicates: Delete repeated records for the same song play.
- Fix Data Types: Convert “play_date” text (string) into an actual date.
- Handle Missing Values: Fill unknown user country with “Unknown” or leave it blank if needed.
Example:
A user played a song for “‑120 seconds”? You correct or remove that record.
4. Reshape the data
This is remix time—arranging your data for the best rhythm in Tableau.
You might:
Join (makes your table wider): Combine’ “User_Accounts” with “Listening_History” so each song plays links to user details.
Union (makes your table longer): Stack monthly activity files into one big playlist.
Pivot: Switch rows and columns—maybe compare artists across regions.
Aggregate: Summarize total listening time per artist or per country.
5. Output the final dataset
The show’s ready! Export your “Music_Data_Clean_Final.csv” — your clean, perfectly‑tuned dataset — and send it straight to Tableau Desktop to create dashboards about top artists, listening trends, or user habits.
To learn more, explore Tableau’s official guides for data prep best practices:
https://www.tableau.com/learn/whitepapers/data-prep-best-practices
https://www.tableau.com/blog/best-practices-authoring-data-preparation-flow
