Friday, December 27, 2024
Google search engine
HomeLanguagesPlotting Geospatial Data using GeoPandas

Plotting Geospatial Data using GeoPandas

GeoPandas is an open source tool to add support for geographic data to Pandas objects. In this, article we are going to use GeoPandas and Matplotlib for plotting geospatial data.

Installation

We are going to install GeoPandas, Matplotlib, NumPy and Pandas.

pip install geopandas
pip install matplotlib
pip install numpy
pip install pandas

Note: If you don’t want to install these modules locally on your computer, use Jupyter Notebook or Google Colab.

Getting Started

Importing modules and dataset

We are going to import Pandas for the dataframe data structure, NumPy for some mathematical functions, GeoPandas for supporting and handling geospatial data and Matplotlib for actually plotting the maps.

import pandas as pd
import geopandas as gpd
import numpy as np
import matplotlib.pyplot as plt

GeoPandas gives us some default datasets along with its installation to play around with. Let’s read one of the datasets.

Python3




import pandas as pd
import geopandas as gpd
import numpy as np
import matplotlib.pyplot as plt
  
world = gpd.read_file(gpd.datasets.get_path('naturalearth_lowres'))
world.head()


Output:

world.head()

Some of the other datasets to play with are ‘naturalearth_cities’ and ‘nybb’. Feel free to experiment with them later. We can use world and plot the same using Matplotlib.

Python3




world.plot()


Output:

World Plot

Analyse the datasets

Now, if we see world, we have a lot of fields. One of them is GDP estimate(or gdp_md_est). However, to show how easily data can be filtered in or out in pandas, let’s filter out all continents except Asia. 

Python3




worldfiltered = world[world.continent == "Asia"]
worldfiltered.plot(column ='gdp_md_est', cmap ='Reds')


GDP of Countries in Asia

cmap property is used to plot the data in the shade specified. The darker shades mean higher value while the lighter shades means lower value. Now, let’s analyse the data for population estimate(pop_est).

Python3




world.plot(column ='pop_est')


Output:

Population Estimate

The above image is not very good in conveying the data. So let’s change some properties to make it more comprehensible. First, let’s increase the size of the figure and then set an axis for it. We first plot the world map without any data to on the axis and then we overlay the plot with the data on it with the shade red. This way the map is more clear and dark and makes the data more understandable. However, this map is still a little vague and won’t tell us what the shades mean.

Python3




fig, ax = plt.subplots(1, figsize =(16, 8))
world.plot(ax = ax, color ='black')
world.plot(ax = ax, column ='pop_est', cmap ='Reds')


Output:

World Population 

Let’s import the toolkits that allow us to make dividers within the plot. After this we are going to plot the graph as we did before, but this time we are going to add a facecolor. The facecolor property is going to change the background to a color it is set to(in this case, light blue). Now we need to create a divider for creating the color box within the graph, much like dividers in HTML. We are creating a divider and setting its properties like size, justification etc. 

Then we need to create the color box in the divider we created. So obviously, the highest value in the color box is going to be the highest population in the dataset and the lowest value is going to be zero. 

Python3




from mpl_toolkits.axes_grid1 import make_axes_locatable
  
  
fig, ax = plt.subplots(1, figsize =(16, 8),
                       facecolor ='lightblue')
  
world.plot(ax = ax, color ='black')
world.plot(ax = ax, column ='pop_est', cmap ='Reds',
           edgecolors ='grey')
  
# axis for the color bar
div = make_axes_locatable(ax)
cax = div.append_axes("right", size ="3 %", pad = 0.05)
  
# color bar
vmax = world.pop_est.max()
mappable = plt.cm.ScalarMappable(cmap ='Reds',
                                 norm = plt.Normalize(vmin = 0, vmax = vmax))
cbar = fig.colorbar(mappable, cax)
  
ax.axis('off')
plt.show()


Output:

World Population

Thus in this article we have seen how we can use GeoPandas to get geospatial data and plot it using Matplotlib. Custom datasets can be used to analyse specific data and city-wise data can also be used. Also, GeoPandas can be used with Open Street Maps, which provides very specific geospatial data(example, streets, hospitals in a city etc., ). The same knowledge can be extended further and can be used for specific statistical and data analysis. 

RELATED ARTICLES

Most Popular

Recent Comments