Thursday, December 26, 2024
Google search engine
HomeLanguagesHow to Count the NaN Occurrences in a Column in Pandas Dataframe?

How to Count the NaN Occurrences in a Column in Pandas Dataframe?

The data frame is divided into cells, which can store a value belonging to some data structure as well as it may contain missing or NA values. The pandas package contains various in-built functions, to check if the value in the cell of a data frame is either NA or not, and also to perform aggregations over these NA values.

Method #1: Using In-built methods isna() and sum() on the dataframe.

The isna() function is used to detect missing/none values and return a boolean array of length equal to the data frame element over which it is applied and the sum() method is used to calculate a total of these missing values.

Python3




# importing necessary packages
import pandas as pd
import numpy as np
  
# creating data
data = [[1, "M", np.nan], [5, "A", 3.2], [
    np.nan, np.nan, 4.6], [1, "D", np.nan]]
  
# converting data to data frame
data_frame = pd.DataFrame(data, 
                          columns=["col1", "col2", "col3"])
  
# printing original data frame
print("\nOriginal Data Frame:")
print(data_frame)
  
# counting NaN values of col1
cnt = data_frame["col1"].isna().sum()
  
# printing count of NaN values
print("\nNan values in col1:", cnt)


Output:

Method #2: Using the length of the dataframe

The count of the values contained in any particular column of the data frame is subtracted from the length of dataframe, that is the number of rows in the data frame. The count() method gives us the total number of NaN values in a specified column and the length(dataframe) gives us the length of the data frame, that is the total number of rows in the frame. 

Python3




# importing necessary packages
import pandas as pd
import numpy as np
  
# creating data
data = [[1, "M", np.nan], [5, "A", 3.2],
        [np.nan, np.nan, 4.6], [1, "D", np.nan]]
  
# converting data to data frame
data_frame = pd.DataFrame(data, columns=["col1", "col2", "col3"])
  
# printing original data frame
print("\nOriginal Data Frame:")
print(data_frame)
  
# counting NaN values of col1
length = len(data_frame)
count_in_col3 = data_frame['col3'].count()
cnt = length - count_in_col3
  
# printing count of NaN values
print("\nNan in col3:", cnt)


Output:

Dominic Rubhabha-Wardslaus
Dominic Rubhabha-Wardslaushttp://wardslaus.com
infosec,malicious & dos attacks generator, boot rom exploit philanthropist , wild hacker , game developer,
RELATED ARTICLES

Most Popular

Recent Comments