International College of Digital Innovation, CMU
July 2, 2025
This dataset provides economic and health indicators for countries around the world from the year 1952 to 2007. The data comes from the Gapminder Foundation, an organization that develops educational tools for understanding global development and sustainability.
The dataset contains 6 key variables:
country
: Name of the country (e.g., Afghanistan, Brazil, China)
continent
: Continent where the country is located (Africa, Americas, Asia, Europe, Oceania)
year
: Year the data was recorded (1952, 1957, …, 2007)
lifeExp
: Life Expectancy: the
average number of years a person is expected to live in each country
pop
: Population: total number of people living in the country
gdpPercap
: GDP per Capita: Gross Domestic Product divided by population (in US dollars)
You can download the Gapminder dataset from Google Drive: Click here to download
Why use Gapminder?
The Gapminder dataset has a rich and structured format, making it ideal for:
Creating visualizations in Excel
Analyzing global development trends
Comparing data across countries, continents, and time
Use Case: Visualizing the trend of life expectancy (lifeExp) for a specific country over time.
Goal: Show the trend of life expectancy in Thailand from 1952 to 2007.
Step-by-Step Instructions
Filter the data for Thailand
Select two columns:
year
(X-axis)
lifeExp
(Y-axis)
Go to Insert → choose Line Chart
Choose chart type: Line with Markers is recommended
Adjust axis labels:
X-axis = Year
Y-axis = Life Expectancy
A smooth line chart showing how Thailand’s life expectancy has increased from 1952 to 2007.
Use Case: Compare GDP per capita (gdpPercap) among countries in a specific year.
Goal: Compare GDP per capita of countries in Asia for the year 2007.
Step-by-Step Instructions
Filter the data for:
continent == "Asia"
year == 2007
Select two columns:
country
(X-axis)
gdpPercap
(Y-axis)
Go to Insert → choose Bar Chart
Choose the chart type: Clustered Bar or Clustered Column
Adjust the axis:
X-axis = Country
Y-axis = GDP per Capita
(Optional) Sort GDP values from highest to lowest for better communication.
A bar chart that shows the economic disparity among Asian countries in 2007, highlighting countries with high or low GDP per capita.
Use Case: Visualize the population share (pop) of countries within a specific continent and year.
Goal: Show the population proportions of countries in Africa in the year [2007.
Step-by-Step Instructions
Filter the data for:
continent == "Africa"
year == 2007
Select two columns:
country
(as categories)
pop
(as values)
Go to Insert → choose Pie Chart
Choose a chart type: 2D Pie or 3D Pie
Add Data Labels: Show either percentage or actual population
A pie chart showing the distribution of population among African countries in 2007, giving a clear view of how population is concentrated.
Use Case: Explore the relationship between GDP per capita and Life Expectancy.
Goal: Visualize the relationship between GDP per capita and Life Expectancy
for all countries in the year [2007.
Step-by-Step Instructions
Filter the data for: year == 2007
Select two columns:
gdpPercap
(X-axis)
lifeExp
(Y-axis)
Go to Insert → choose Scatter Chart
Choose a chart type:
Scatter with Markers
(Optional) Scatter with Smooth Lines
Adjust axis labels:
X-axis = GDP per Capita
Y-axis = Life Expectancy
💡 Tip: Sort your data by gdpPercap
or lifeExp
(ascending or descending) before plotting for clearer insights and cleaner visuals.
The chart shows how countries with higher GDP per capita often have longer life expectancy — a sign of better health outcomes.
Use Case: Show the trend of population (pop) over time for a specific continent.
Goal: Visualize the population growth in Europe from [1952 to 2007.
Step-by-Step Instructions
Filter the data for: continent == "Europe"
Summarize population by year: Use a PivotTable to sum pop
for all countries in each year
Go to Insert → choose Area Chart (found under the Line Chart options)
Choose chart type:
Stacked Area
or 100% Stacked Area (for proportion comparison)
An area chart that shows how the population of Europe has changed over time,
emphasizing both magnitude and growth trend.
💡 Tip
Use Stacked Area when comparing parts of a whole (e.g., population by region)
Use Standard Area when focusing on total growth over time.
Use Case: Compare population (pop) across countries in the same continent, separated by year.
Goal: Compare the population of countries in Americas between the years 1952 and [2007.
Step-by-Step Instructions
Use a PivotTable and Slicer to filter:
continent == "Americas"
Years 1952
and 2007
Select data:
country
as categories
pop
for both years
Go to Insert → choose Stacked Bar Chart
Choose chart type:
Clustered Column (grouped bars for each year)
or Stacked Column (stacked segments for each year)
A grouped or stacked bar chart comparing how populations have changed in countries across the Americas from 1952 to 2007.
💡 Tip: Use Clustered Column for side-by-side year comparison, and Stacked Column for visualizing the contribution of each country to the total population.
Use Case: Visualize the relationship between [GDP per capita, [Life Expectancy, and Population size using a Bubble Chart.
Goal: Show the relationship between GDP per capita and Life Expectancy in 2007,
with bubble size representing the population.
Step-by-Step Instructions
Filter the data for:
year == 2007
Select three columns:
gdpPercap
→ X-axis
lifeExp
→ Y-axis
pop
→ Bubble size
Go to Insert → choose Bubble Chart
Set bubble size to represent population (pop
)
(Optional) Add country labels or tooltips for clarity
A bubble chart showing:
Countries with high GDP and high life expectancy in the upper-right
Bubble sizes indicating how population varies among them
💡 Tip: Sort or filter out countries with extremely high populations if bubbles overlap too much. You may also normalize bubble sizes for better visualization.
Use Case: Highlight trends and variations in values (e.g., life expectancy) using color gradients.
Goal: Create a heatmap showing Life Expectancy (lifeExp) of each country in Asia
from 1952 to 2007 using Conditional Formatting in Excel.
Step-by-Step Instructions
Filter the data for: continent == "Asia"
Use a PivotTable:
Rows = country
Columns = year
Values = average of lifeExp
Select the PivotTable cells with numeric values
Go to Home → Conditional Formatting → choose Color Scales
Pick a color scale (e.g., green-yellow-red) to indicate value differences
A color-coded table where:
Green = Higher life expectancy
Red = Lower life expectancy
Clear visual patterns over time across countries
💡 Tip: Use a diverging color scale to emphasize differences and make comparisons easier.
This approach works well for time-series or multivariate country data.
Use Case: Compare the same variable (e.g., life expectancy) across multiple countries over time.
Goal: Compare Life Expectancy trends of countries in Asia from 1952 to 2007 using multiple line plots.
Step-by-Step Instructions
Use a [PivotTable:
Rows = year
Columns = country
Values = average of lifeExp
(Filter only for Asia)
Highlight the data table
Go to Insert → choose Line Chart
Each line will represent one country’s life expectancy trend
A chart with multiple lines showing:
How life expectancy changed over time
Differences and similarities between Asian countries
💡 Tip: Use consistent colors or label key countries directly on the lines to enhance readability and avoid clutter.