Wednesday brought in a bit of web-scraping, which is a really powerful way of accessing information from the web. We covered web-scraping a couple times during training with PGB, where we accessed IMDB data to set up a data source with the top 250 movies. Web-scraping in Alteryx (and in general) usually involves 3 steps:
data:image/s3,"s3://crabby-images/187ee/187ee3d5b0c499da789113fa3ce0522052a418e1" alt=""
For today's project, we took a look a NCD Risk Factor Collaboration data, and I focused specifically on the data pertaining to Diabetes. In order to scrape the required data (which would be through the individual countries tab), I first needed to create the list of countries, which I did by inspecting the page and then editing the section with all the countries as HTML, as it was stored in a Javascript pop-up list.
data:image/s3,"s3://crabby-images/35f31/35f31bac0c895a71e3c6831f6b5e33ffbe200db3" alt=""
Once I had copied this list, I added it as a text input in Alteryx and parsed it out in order to get a clean list of countries:
data:image/s3,"s3://crabby-images/d8778/d877807c1d3d835a72e35dc69a921d2f5cf7dc10" alt=""
I thing accessed the URL of the page in order to request the data for each specific country:
data:image/s3,"s3://crabby-images/84004/840042e6361c8357b8f33254c784f9e938837a9b" alt=""
I then did a bit of reshaping, and also added in some population data to boost the data set a bit. Finally I outputted the data into a .hyper file so I could begin visualizing the data.
data:image/s3,"s3://crabby-images/2ae87/2ae876a22a7f9d1b7712dcf7aa22cb1d840d1f3a" alt=""
For the dashboarding part, I wanted to work on my dashboard design and incorporate some new ideas. I decided to try set up my dashboard as a 'Patient Form', where each patient would be a country and you could look at their status in terms of Diabetes. The outcome is as follows:
data:image/s3,"s3://crabby-images/15aa4/15aa4cff1bd76f1ab82c134930e5f53d08b12848" alt=""
It's a simple dashboard but gives quality insight into the diabetes status of specific countries!