Thursday, December 18, 2025
HomeLanguagesPandas Groupby and Computing Median

Pandas Groupby and Computing Median

The Pandas in Python is known as the most popular and powerful tool for performing data analysis. It because of the beauty of Pandas functionality and the ability to work on sets and subsets of the large dataset. So in this article, we are going to study how pandas Group By functionality works and saves tons of effort while working on a large dataset. Also, we will solve real-world problems using Pandas Group By and Median functionalities.

Pandas groupby()

The groupby() method in pandas splits the dataset into subsets to make computations easier. Generally, groupby() splits the data, applies the functionalities, and then combine the result for us. Let’s take an example if we have data on alcohol consumption of different countries and we want to perform data analysis continent-wise, this problem can be minimized using groupby() method in pandas. It splits the data continent-wise and calculates median using the median() method.

Syntax :

DataFrame.groupby(by=None, axis=0, level=None, as_index=True, sort=True, group_keys=True, squeeze=<object object>, observed=False, dropna=True) 
 

Example 1: Find the median of alcohol consumption continent-wise on a given dataset.

Dataset: Drinksbycountry.csv

Python3




# import the packages
import pandas as pd
 
# read Dataset
data = pd.read_csv("drinksbycountry.csv")
data.head()
 
# perform groupby on continent and find median
# of total_litres_of_pure_alcohol
data.groupby(["continent"])["total_litres_of_pure_alcohol"].median()
 
# perform groupby on continent and find median
# of wine_serving
data.groupby(["continent"])["wine_servings"].median()


Output :

median of total_litres_of_pure_alcohol

median of wine_serving

Example 2: Find the median of the total population group by age on a given dataset.

Dataset: WorldPopulationByAge2020.csv

Python3




# import packages
import pandas as pd
 
# read Dataset
data = pd.read_csv("WorldPopulationByAge2020.csv")
data.head()
 
# perform group by AgeGrp and find median
data.groupby(["AgeGrp"])["PopTotal"].median()


Output :

Group by Age 

RELATED ARTICLES

Most Popular

Dominic
32455 POSTS0 COMMENTS
Milvus
108 POSTS0 COMMENTS
Nango Kala
6823 POSTS0 COMMENTS
Nicole Veronica
11958 POSTS0 COMMENTS
Nokonwaba Nkukhwana
12036 POSTS0 COMMENTS
Shaida Kate Naidoo
6958 POSTS0 COMMENTS
Ted Musemwa
7203 POSTS0 COMMENTS
Thapelo Manthata
6910 POSTS0 COMMENTS
Umr Jansen
6890 POSTS0 COMMENTS