I procrastinated on starting this assignment quite a bit. My course load for Software Engineering has overshadowed most of my other classes, and I’m doing my best.
Anyways, this weekend I sought out to make some good progress on this assignment. Previously, I found a data source, and I wanted to put it to use. The next day, I found an appropriate source for another set of data that would go well with the first.
The first set of data I found was an excel file containing the estimated percentage of internet users per country by year for 2001-2009 from the International Telecommunications Union. Overall, this data set contained information from about 220 countries or so.
To actually use this data, I had to do a fair amount of scripting.
ITU’s data was simple enough, I copied the data (by selecting the rows I needed) into Python, and after some string replacement, I had some usable lists of data.
The data from the IDB was a bit harder to grab. First of all, the data was on 10 different pages (one for each year), and it was not so friendly. My steps for getting the data went something like:
- Select all data and paste into python.
- Run the string of data through a parsing function that I wrote.
Now that the data was in a usable form, I had a bit more mashing to do to get the data to make sense with each other. I’m not going to go into too much detail, but after much reorganization, I ended up with two data sets. The first set the population data from 188 countries over the course of ten years (2000-2009). The second set is the estimated population of the same 188 countries over the same ten years.
In the end, yeah, I lost about 40 countries or so. Honestly, both sets of data probably contained the same countries (or at least were probably closer than they ended up), but due to differences in names and naming conventions, some countries were dropped from the set.
Now, to the visualization.
That’s about all I have to say before I throw up the link to the final version. Oh, users beware, I can only vouch for this working in Chrome on Windows 7. I’m pretty sure this doesn’t do much of anything in Firefox (it doesn’t in 3.6, can’t speak on 4). IE? Good luck.
Oh, and if you are on Chrome, then be sure to click around. You can change the year that is currently shown and you can also change which countries you wish to see visualized.
..also.. its really best to have a large screen and view it full-screen if you can.
Anyways, here it is: Population vs Estimated Internet Users 2000-2009.
If I get a chance I may be updating this, but that chance is looking slim at the moment.