Sometimes, while working with Python list, we can have a problem in which we can have a list of tuples and we wish to remove them basis of first element of tuple to avoid it’s consecutive duplication. Let’s discuss certain way in which this problem can be solved.
Method : Using groupby() + itemgetter() + next() This task can be performed using combination of these functions. In this, we convert the list to iterator for faster access using next(), itemgetter() is used to get the index of tuple on which we need to perform the deletion(in this case first) and groupby() performs the final grouping of elements.
Python3
# Python3 code to demonstrate working of # Remove Consecutive tuple according to key # using itemgetter() + next() + groupby() from operator import itemgetter from itertools import groupby # initialize list test_list = [( 4 , 5 ), ( 4 , 6 ), ( 7 , 8 ), ( 7 , 1 ), ( 7 , 0 ), ( 8 , 1 )] # printing original list print ("The original list is : " + str (test_list)) # Remove Consecutive tuple according to key # using itemgetter() + next() + groupby() res = [ next (group) for key, group in groupby(test_list, key = itemgetter( 0 ))] # printing result print (" List after Consecutive tuple removal : " + str (res)) |
The original list is : [(4, 5), (4, 6), (7, 8), (7, 1), (7, 0), (8, 1)] List after Consecutive tuple removal : [(4, 5), (7, 8), (8, 1)]
Time Complexity: O(n*n), where n is the length of the list test_list
Auxiliary Space: O(n) additional space of size n is created where n is the number of elements in the res list
Method#2 : Using loop()
In this code, we loop over each tuple in the original list and check if its first element is different than the previous tuple’s first element (or if it’s the first tuple). If it is, we add it to the filtered list. If it’s not, we skip it. This way, we only keep the first tuple with each unique first element in the original list. Finally, we print the filtered list.
Python3
# Define the original list original_list = [( 4 , 5 ), ( 4 , 6 ), ( 7 , 8 ), ( 7 , 1 ), ( 7 , 0 ), ( 8 , 1 )] # Initialize an empty list to hold the filtered items filtered_list = [] # Loop over each tuple in the original list for i in range ( len (original_list)): # Check if this tuple has a different first element than the previous one (or if it's the first one) if i = = 0 or original_list[i][ 0 ] ! = original_list[i - 1 ][ 0 ]: filtered_list.append(original_list[i]) # Print the filtered list print (filtered_list) |
[(4, 5), (7, 8), (8, 1)]
Time complexity: O(n)
Auxiliary Space: O(k)
Method#3 : Using numpy:
Algorithm:
- Create the original list of tuples: original_list = [(4, 5), (4, 6), (7, 8), (7, 1), (7, 0), (8, 1)]
- Convert the original list into a numpy array: array = np.array(original_list)
- Use array[:, 0] to get the first column of the numpy array, which contains the unique values we want to filter by.
- Use np.unique with return_index=True to get the indices of the unique elements in the first column: indices = np.unique(array[:, 0], return_index=True)[1]
- Use these indices to extract the unique rows from the original array: unique_rows = array[indices]
- Convert the unique rows back into tuples using a list comprehension: filtered_list = [tuple(row) for row in unique_rows]
- Print the resulting filtered list: print(filtered_list)
Python3
import numpy as np # Define the original list original_list = [( 4 , 5 ), ( 4 , 6 ), ( 7 , 8 ), ( 7 , 1 ), ( 7 , 0 ), ( 8 , 1 )] # printing original list print ( "The original list is : " + str (original_list)) # Convert the list into a numpy array array = np.array(original_list) # Get the indices of the unique elements based on the first column indices = np.unique(array[:, 0 ], return_index = True )[ 1 ] # Get the unique elements based on the indices res = [ tuple (row) for row in array[indices]] # printing result print ( "List after Consecutive tuple removal : " + str (res)) #This code is contributed by jyothi pinjala. |
Output:
The original list is : [(4, 5), (4, 6), (7, 8), (7, 1), (7, 0), (8, 1)] List after Consecutive tuple removal : [(4, 5), (7, 8), (8, 1)]
Time complexity:
Converting the original list to a numpy array takes O(n) time, where n is the number of elements in the list.
Finding the unique elements based on the first column using np.unique() takes O(nlogn) time.
Extracting the unique elements based on the indices takes O(k), where k is the number of unique elements.
Converting the extracted elements back to tuples takes O(k) time.
Therefore, the overall time complexity is O(nlogn), dominated by the np.unique() function.
Space complexity:
Converting the original list to a numpy array takes O(n) space.
Finding the unique elements based on the first column using np.unique() takes O(n) space for the intermediate arrays created.
Extracting the unique elements based on the indices takes O(k) space for the output array.
Converting the extracted elements back to tuples takes O(k) space for the output list.
Therefore, the overall space complexity is O(n).