Extract CSS tag from a given HTML using Python

By Dominic Rubhabha-Wardslaus

25 July 2024

0

2

Prerequisite: Implementing Web Scraping in Python with BeautifulSoup

In this article, we are going to see how to extract CSS from an HTML document or URL using python.

Module Needed:

bs4: Beautiful Soup(bs4) is a Python library for pulling data out of HTML and XML files. This module does not come built-in with Python. To install this type the below command in the terminal.

pip install bs4

requests: Requests allows you to send HTTP/1.1 requests extremely easily. This module also does not comes built-in with Python. To install this type the below command in the terminal.

pip install requests

Approach:

Import module
Create an HTML document and specify the CSS tag into the code
Pass the HTML document into the Beautifulsoup() function
Now traverse the tag with the select() method.

Implementation:

Python3

# import module 
from bs4 import BeautifulSoup 
  
# Html doc 
html_doc = """ 
<html> 
<head> 
<title>Geeks</title> 
</head> 
<body> 
<h2>paragraphs</h2> 
  
<p>Welcome Lazyroar.</p> 
  
  
<p>Hello Lazyroar.</p> 
  
<a class="example" href="www.neveropen.com" id="dsx_23">java</a> 
<a class="example" href="www.neveropen.com/python"  id="sdcsdsdf">python</a> 
</body> 
</html> 
"""
soup = BeautifulSoup(html_doc, "lxml") 
  
# traverse CSS from soup 
print("display by CSS class:") 
print(soup.select(".example")) 

Output:

display by CSS class:
[<a class="example" href="www.neveropen.com" id="dsx_23">java</a>, 
<a class="example" href="www.neveropen.com/python" id="sdcsdsdf">python</a>]

Now let’s get the CSS tag with URL:

Python3

# import module 
from bs4 import BeautifulSoup 
import requests 
  
# link for extract html data 
# Making a GET request  
      
def getdata(url): 
    r=requests.get(url) 
    return r.text 
html_doc = getdata('https://www.geeksforgeeks.org/') 
soup = BeautifulSoup(html_doc,"lxml") 
  
# traverse CSS from soup 
  
print("\nTags by CSS class:") 
print(soup.select(".header-main__wrapper"))

Output:

Extract CSS tag from a given HTML using Python

Python3

Python3

Java Program for Longest Common Subsequence

Maximum height of Tree when any Node can be considered as Root

Print Fibonacci sequence using 2 variables

LEAVE A REPLY Cancel reply

Most Popular

Verizon will basically pay you to buy the new, awesome Barbie phone

8 Best VPNs for Apple TV in 2024: Fast & Secure by Penka Hristovska

Samsung offers free screen replacements for users still suffering green line issues

7 Best Free Antiviruses for Mac in 2024: Are They Any Good? by Katarina Glamoslija

Recent Comments

EDITOR PICKS

Verizon will basically pay you to buy the new, awesome Barbie phone

8 Best VPNs for Apple TV in 2024: Fast & Secure by Penka Hristovska

Samsung offers free screen replacements for users still suffering green line issues

POPULAR POSTS

Verizon will basically pay you to buy the new, awesome Barbie phone

8 Best VPNs for Apple TV in 2024: Fast & Secure by Penka Hristovska

Samsung offers free screen replacements for users still suffering green line issues

POPULAR CATEGORY

ABOUT US

FOLLOW US