Collapse multiple Columns in Pandas

27 July 2024

0

While operating dataframes in Pandas, we might encounter a situation to collapse the columns. Let it be
cumulated data of multiple columns or collapse based on some other requirement. Let’s see how to collapse multiple columns in Pandas.

Following steps are to be followed to collapse multiple columns in Pandas:

Step #1: Load numpy and Pandas.
Step #2: Create random data and use them to create a pandas dataframe.
Step #3: Convert multiple lists into a single data frame, by creating a dictionary for each list with a name.
Step #4: Then use Pandas dataframe into dict. A data frame with columns of data and column for names is ready.
Step #5: Specify which columns are to be collapsed. That can be done by specifying the mapping as a dictionary, where the keys are the names of columns to be combined or collapsed and the values are the names of the resulting column.

Example 1:

# Python program to collapse 
# multiple Columns using Pandas 
import pandas as pd 
  
# sample data 
n = 3
Sample_1 = [57, 51, 6] 
Sample_2 = [92, 16, 19] 
Sample_3 = [15, 93, 71] 
Sample_4 = [28, 73, 31] 
  
sample_id = zip(["S"]*n, list(range(1, n + 1))) 
  
s_names = [''.join([w[0], str(w[1])]) for w in sample_id] 
  
d = {'s_names': s_names, 'Sample_1': Sample_1,  
     'Sample_2': Sample_2, 'Sample_3': Sample_3, 
     'Sample_4': Sample_4} 
  
df_1 = pd.DataFrame(d) 
  
mapping = {'Sample_1': 'Result_1', 
           'Sample_2': 'Result_1',  
           'Sample_3': 'Result_2',  
           'Sample_4': 'Result_2'} 
  
df = df_1.set_index('s_names').groupby(mapping, axis = 1).sum() 
  
df.reset_index(level = 0) 

Output:

Example 2:

# Python program to collapse 
# multiple Columns using Pandas 
import pandas as pd 
df = pd.DataFrame({'First': ['Manan ', 'Raghav ', 'Sunny '], 
                   'Last': ['Goel', 'Sharma', 'Chawla'], 
                   'Age':[12, 24, 56]}) 
  
mapping = {'First': 'Full Name', 'Last': 'Full Name'} 
  
df = df.set_index('Age').groupby(mapping, axis = 1).sum() 
  
df.reset_index(level = 0) 

Output:

Collapse multiple Columns in Pandas

Java Program for Longest Common Subsequence

Maximum height of Tree when any Node can be considered as Root

Print Fibonacci sequence using 2 variables

LEAVE A REPLY Cancel reply

Most Popular

Samsung offers free screen replacements for users still suffering green line issues

7 Best Free Antiviruses for Mac in 2024: Are They Any Good? by Katarina Glamoslija

Is Microsoft Teams Secure? Use Teams Safely in 2024 by Tyler Cross

Interview With Willem Dewulf – CEO of ProBackup by Shauli Zacks

Recent Comments

EDITOR PICKS

Samsung offers free screen replacements for users still suffering green line issues

7 Best Free Antiviruses for Mac in 2024: Are They Any Good? by Katarina Glamoslija

Is Microsoft Teams Secure? Use Teams Safely in 2024 by Tyler Cross

POPULAR POSTS

Samsung offers free screen replacements for users still suffering green line issues

7 Best Free Antiviruses for Mac in 2024: Are They Any Good? by Katarina Glamoslija

Is Microsoft Teams Secure? Use Teams Safely in 2024 by Tyler Cross

POPULAR CATEGORY

ABOUT US

FOLLOW US