Dashboard Week Day 2: Web Scraping

Today is day 2 of dashboard week. We were given the challenge to web scrape data from The Great British Bakeoff website and create a dashboard from it. Looking at the website HTML, I noticed the tab of 'All Recipes' would be a great place to start. This section of the website has hundreds of recipes by contestants of the show broken down into different pages. I decided to go through all the pages on this part of the site to get all the recipe details.

To start, I looked at the HTML to figure out what I could scrape successfully. I made a basic plan with screenshots from the HTML

I then took the website URL and brought it into Alteryx. I used some HTML parsing to find exactly how many pages were on the site so I knew how many I had to go through to find all the recipes:

I then scraped each page of the GBB recipe site. I was able to locate all of the pieces of information I was interested in, with the exception of images of the bakers. My full Alteryx flow looks like:

The final dataset looks like:

To download the images of each of the recipes, I used a method from another data school blog, written by Robbin Vernooij: https://thedataschool.com/a/robbin-vernooij/dashboard-week-day-3-api-image-extraction-with-alteryx-and-assign-multiple-images-as-custom-shapes-in-tableau/. This blog is very helpful for scraping a bunch of images in Alteryx and assigning them to the correct fields in Tableau.

I also decided I wanted to include a little bit of data on how the contestants performed in the competition, so I created a short dataset for Season 11. This included the contestant name and placement. I was able to relate that dataset to my web scraped one on contestant name.

I wanted to zoom in on the season 11 recipes, so I created a Tableau dashboard to see who created which recipe and to show the details on the ones that the dashboard user is most interested in. I came up with the final result of:

To view the dashboard, head to my Tableau Public: https://public.tableau.com/app/profile/george.chachkes/viz/HowtoBakeLikeaWinnerGBBRecipes/Dashboard1#1.

Can't wait for tomorrows challenge!

Author:
George Chachkes
Powered by The Information Lab
1st Floor, 25 Watling Street, London, EC4M 9BR
Subscribe
to our Newsletter
Get the lastest news about The Data School and application tips
Subscribe now
© 2026 The Information Lab