Count the number of paragraph tag using BeautifulSoup

26 July 2024

1

Sometimes, while extracting data from an HTML webpage, do you want to know how many paragraph tags are used in a given HTML document? Don’t worry we will discuss about this in this article.

Syntax:

print(len(soup.find_all("p")))

Approach:

Step 1: First, import the libraries, BeautifulSoup, and os.

from bs4 import BeautifulSoup as bs
import os

Step 2: Now, remove the last segment of the path by entering the name of the Python file in which you are currently working.

base=os.path.dirname(os.path.abspath(‘#Name of Python file in which you are currently working’))

Step 3: Then, open the HTML file from which you want to read the value.

html=open(os.path.join(base, ‘#Name of HTML file from which you wish to read value’))

Step 4: Moreover, parse the HTML file in BeautifulSoup.

soup=bs(html, 'html.parser')

Step 5: Next, print a certain line if you want to.

print("Number of paragraph tags:")

Step 6: Finally, calculate and print the number of paragraph tags in the HTML document.

print(len(soup.find_all("p")))

Implementation:

Example 1

Let us consider the simple HTML webpage, which has numerous paragraph tags.

HTML

<!DOCTYPE html> 
<html> 
    
 <head> 
   Geeks For Geeks 
 </head> 
    
 <body> 
     
 <div> 
       
<p>King</p> 
  
<p>Prince</p> 
  
<p>Queen</p> 
  
 </div> 
     
 <p id="vinayak">Princess</p> 
  
  </body> 
    
</html>

For finding the number of paragraph tags in the above HTML webpage, implement the following code.

Python

# Python program to get number of paragraph tags 
# of a given HTML document in Beautifulsoup 
  
# Import the libraries beautifulsoup  
# and os 
from bs4 import BeautifulSoup as bs 
import os 
  
# Open the HTML file 
html = open('gfg.html') 
  
# Parse HTML file in Beautiful Soup 
soup = bs(html, 'html.parser') 
  
# Print a certain line 
print("Number of paragraph tags:") 
  
# Calculating and printing the 
# number of paragraph tags 
print(len(soup.find_all("p"))) 

Output:

Example 2

In the below program, we will find the number of paragraph tags on a particular website.

Python

# Python program to get number of paragraph tags 
# of a given Website in Beautifulsoup 
  
# Import the libraries beautifulsoup  
# and os 
from bs4 import BeautifulSoup as bs 
import os 
import requests 
  
# Assign URL 
URL = 'https://www.geeksforgeeks.org/'
  
# Page content from Website URL 
page = requests.get(URL) 
  
# Parse HTML file in Beautiful Soup 
soup = bs(page.content, 'html.parser') 
  
# Print a certain line 
print("Number of paragraph tags:") 
  
# Calculating and printing the 
# number of paragraph tags 
print(len(soup.find_all("p"))) 

Output:

Count the number of paragraph tag using BeautifulSoup

Syntax:

Approach:

Implementation:

HTML

Python

Python

Java Program for Longest Common Subsequence

Maximum height of Tree when any Node can be considered as Root

Print Fibonacci sequence using 2 variables

LEAVE A REPLY Cancel reply

Most Popular

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

Android 16 beta 4 quietly removes a small but useful Pixel feature

Recent Comments

EDITOR PICKS

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

POPULAR POSTS

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

POPULAR CATEGORY

ABOUT US

FOLLOW US