Prerequisite : Pattern Matching with Python Regex
Given the URL text-file, the task is to extract all the email-ids from that text file and print the urllib.request
library can be used to handle all the URL related work.
Example :
Input : Hello This is GeeksforLazyroar review-team@geeksforgeeks.org review-team@geeksforgeeks.org GfG is a portal for Lazyroar feedback@geeksforgeeks.org careers@geeksforgeeks.org Output : [] [] ['review-team@geeksforgeeks.org'] ['review-team@geeksforgeeks.org'] [] ['feedback@geeksforgeeks.org'] ['careers@geeksforgeeks.org']
URL text file can be handled using urllib.request
. For extracting the emails using regular expressions, re
library can be used. For more details of Regular Expression, refer this.
# library that handles the URL stuff import urllib.request # Importing module required for # regular expressions import re # Assign urlopen to a file object variable fhand = urllib.request.urlopen for line in fhand: # Getting the text file # content line by line. s = line.decode().strip() # regex for extracting all email-ids # from the text file reg = re.findall(r "[A-Za-z0-9._%+-]+" r "@[A-Za-z0-9.-]+" r "\.[A-Za-z]{2,4}" , s) # printing the list output print (reg) |
Output :
[] [] ['review-team@geeksforgeeks.org'] ['review-team@geeksforgeeks.org'] [] ['feedback@geeksforgeeks.org'] ['careers@geeksforgeeks.org']