The main data set used in this project is the awesome Olympic_history by rgriff23.
Other complementary data sets:
continent.csv is used for extracting the ISO-3166 three letter country code and the corresponding continent name.
host_countries.csv is used in vis 3.
continent_4.xlsx is used to extract the IOC (International Olympic Committee) country codes and the ISO-3166 three letter country codes. We merge
continent_4withcontinentto get the corresponding continent name of each IOC code. We then merge this with summer.csv to produce continent_percentage_tidy and continent_percentage_untidy, two of which we used to visualize changes in female particiation in the Olympics in different continents.
For more detailed information about our data sources, and our codes for data manipulation and visualization, please check out our olymvis-data repository on GitHub.