Wednesday, December 25, 2024
Google search engine
HomeLanguagesHow to delete child element in BeautifulSoup?

How to delete child element in BeautifulSoup?

Beautifulsoup is a Python library used for web scraping. This powerful python tool can also be used to modify html webpages. This article depicts how beautifulsoup can be employed to delete child element. For this, various methods of the module is used.

Methods Used:

  • clear(): Tag.clear() deletes the tag from the tree of a given HTML document.
  • decompose(): Tag.decompose() removes a tag from the tree of a given HTML document, then completely destroys it and its contents.
  • replace(): Tag.replace() replaces a particular tag with a new tag.

Approach:

  • Import module.
  • Scrap data from webpage.
  • Parse the string scraped to html.
  • Find the tag whose child element to be deleted.
  • Use any of the methods: clear(), decompose() or replace().
  • Print replaced content.

Example 1:

Python3




# importing module
from bs4 import BeautifulSoup
  
markup = """
  
<!DOCTYPE>
<html>
  <head><title>Example</title></head>
  <body>
  <div id="parent">
    
<p>
    This is child of div with id = "parent".
    <span>Child of "P"</span>
  </p>
  
  <div>
  Another Child of div with id = "parent".
  </div>
  </div>
  
    
<p>
  Piyush
  </p>
  
                     
  </body>
</html>
"""
  
# parsering string to HTML
soup = BeautifulSoup(markup, 'html.parser')
  
# finding tag whose child to be deleted
div_bs4 = soup.find('div')
  
# delete the child element
div_bs4.clear()
  
print(div_bs4)


Output:

<div id="parent"></div>

Example 2:

Python3




# importing module
from bs4 import BeautifulSoup
  
markup = """
  
<!DOCTYPE>
<html>
  <head><title>Example</title></head>
  <body>
  <div id="parent">
    
<p>
    This is child of div with id = "parent".
    <span>Child of "P"</span>
  </p>
  
  <div>
  Another Child of div with id = "parent".
  </div>
  </div>
  
    
<p>
  Piyush
  </p>
  
                     
  </body>
</html>
"""
  
# parsering string to HTML
soup = BeautifulSoup(markup, 'html.parser')
  
# finding tag whose child to be deleted
div_bs4 = soup.find('div')
  
# delete the child element
div_bs4.decompose()
  
print(div_bs4)


Output:

<None></None>

Example 3:

Python3




# importing module
from bs4 import BeautifulSoup
  
markup = """
  
<!DOCTYPE>
<html>
  <head><title>Example</title></head>
  <body>
  <div id="parent">
    
<p>
    This is child of div with id = "parent".
    <span>Child of "P"</span>
  </p>
  
  <div>
  Another Child of div with id = "parent".
  </div>
  </div>
  
    
<p>
  Piyush
  </p>
  
                     
  </body>
</html>
"""
  
# parsering string to HTML
soup = BeautifulSoup(markup, 'html.parser')
  
# finding tag whose child to be deleted
div_bs4 = soup.find('div')
  
# delete the child element
div_bs4.replaceWith('')
  
print(div_bs4)


Output:

<div id="parent">
<p>
    This is child of div with id = "parent".
    <span>Child of "P"</span>
</p>
<div>
  Another Child of div with id = "parent".
  </div>
</div>

RELATED ARTICLES

Most Popular

Recent Comments