Today is day 2 of dashboard week. We were given the challenge to web scrape data from The Great British Bakeoff website and create a dashboard from it. Looking at the website HTML, I noticed the tab of 'All Recipes' would be a great place to start. This section of the website has hundreds of recipes by contestants of the show broken down into different pages. I decided to go through all the pages on this part of the site to get all the recipe details.
To start, I looked at the HTML to figure out what I could scrape successfully. I made a basic plan with screenshots from the HTML

I then took the website URL and brought it into Alteryx. I used some HTML parsing to find exactly how many pages were on the site so I knew how many I had to go through to find all the recipes:

I then scraped each page of the GBB recipe site. I was able to locate all of the pieces of information I was interested in, with the exception of images of the bakers. My full Alteryx flow looks like:

The final dataset looks like:

To download the images of each of the recipes, I used a method from another data school blog, written by Robbin Vernooij: https://thedataschool.com/a/robbin-vernooij/dashboard-week-day-3-api-image-extraction-with-alteryx-and-assign-multiple-images-as-custom-shapes-in-tableau/. This blog is very helpful for scraping a bunch of images in Alteryx and assigning them to the correct fields in Tableau.
I also decided I wanted to include a little bit of data on how the contestants performed in the competition, so I created a short dataset for Season 11. This included the contestant name and placement. I was able to relate that dataset to my web scraped one on contestant name.
I wanted to zoom in on the season 11 recipes, so I created a Tableau dashboard to see who created which recipe and to show the details on the ones that the dashboard user is most interested in. I came up with the final result of:

To view the dashboard, head to my Tableau Public: https://public.tableau.com/app/profile/george.chachkes/viz/HowtoBakeLikeaWinnerGBBRecipes/Dashboard1#1.
Can't wait for tomorrows challenge!
