Wednesday, December 25, 2024
Google search engine
HomeLanguagesPython – Extract elements with Frequency greater than K

Python – Extract elements with Frequency greater than K

Given a List, extract all elements whose frequency is greater than K.

Input : test_list = [4, 6, 4, 3, 3, 4, 3, 4, 3, 8], K = 3 
Output : [4, 3] 
Explanation : Both elements occur 4 times. 

Input : test_list = [4, 6, 4, 3, 3, 4, 3, 4, 6, 6], K = 2 
Output : [4, 3, 6] 
Explanation : Occur 4, 3, and 3 times respectively.

Method #1 : Using count() + loop

In this, we use count() to get the frequency, and a loop is used to iterate for each of the elements of the List. 

Python3




# Python3 code to demonstrate working of
# Extract elements with Frequency greater than K
# Using count() + loop
 
# initializing list
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
 
# printing string
print("The original list : " + str(test_list))
 
# initializing K
K = 2
 
res = []
for i in test_list:
     
    # using count() to get count of elements
    freq = test_list.count(i)
     
    # checking if not already entered in results
    if freq > K and i not in res:
        res.append(i)
 
# printing results
print("The required elements : " + str(res))


Output

The original list : [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
The required elements : [4, 3]

Time Complexity: O(n)
Auxiliary Space: O(n)

Method #2 : Using list comprehension + Counter()

In this, we perform the task of counting using Counter() and the iteration part is done inside list comprehension.

Python3




# Python3 code to demonstrate working of
# Extract elements with Frequency greater than K
# Using list comprehension + Counter()
from collections import Counter
 
# initializing list
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
 
# printing string
print("The original list : " + str(test_list))
 
# initializing K
K = 2
 
# using list comprehension to bind result
res = [ele for ele, cnt in Counter(test_list).items() if cnt > K]
 
# printing results
print("The required elements : " + str(res))


Output

The original list : [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
The required elements : [4, 3]

Time complexity: O(n), where n is the length of the test_list. The list comprehension + Counter() takes O(n) time
Auxiliary Space: O(n), extra space of size n is required

Method #3: Using a dictionary

In this we keep count of the elements, if the count of the element id K + 1, we add that element to the output.

Python3




test_list = [4, 6, 4, 3, 3, 4, 3, 4, 6, 6]
k = 2
unique_elems = []
freq_dict = {}
output = []
 
# printing string
print("The original list : " + str(test_list))
 
for i in test_list:
    # Append in the unique element list
    if i not in unique_elems:
        unique_elems.append(i)
        freq_dict[i] = 1
    else:
        # increment the counter if element is duplicate
        freq_dict[i] += 1
 
    # Add in the output list only once
    if freq_dict[i] == k + 1:
        output.append(i)
 
print('The required elements : ', str(output))


Output

The original list : [4, 6, 4, 3, 3, 4, 3, 4, 6, 6]
The required elements :  [4, 3, 6]

Time Complexity: O(n), where n is the length test_list. 
Auxiliary Space: O(n), where n is number of elements in output list.

Method #4: Using operator.countOf() method

Python3




# Python3 code to demonstrate working of
# Extract elements with Frequency greater than K
import operator as op
 
# initializing list
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
 
# printing string
print("The original list : " + str(test_list))
 
# initializing K
K = 2
unique_elements = set(test_list)
res = []
for i in unique_elements:
 
    # using operatorcountOf() to get count of elements
    if op.countOf(test_list, i) > K:
        res.append(i)
 
# printing results
print("The required elements : " + str(res))


Output

The original list : [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
The required elements : [3, 4]

Time Complexity:O(N)
Auxiliary Space:O(N)

Method#5: using NumPy module

Python3




import numpy as np
 
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
K = 2
 
# use numpy unique to extract unique elements and their frequency
unique_elements, counts = np.unique(test_list, return_counts=True)
 
# extract elements with frequency greater than K
res = unique_elements[counts > K].tolist()
 
# printing results
print("The required elements : " + str(res))
#this code is contributed by Asif_Shaik


Output:

The required elements : [3, 4]

Time Complexity:O(N)
Auxiliary Space:O(N)

Method #6: Using filter() and lambda()

Algorithm:

  1. Import the Counter class from the collections module.
  2. Define the input list test_list and the threshold value K.
  3. Use the Counter class to count the frequency of each element in the test_list.
  4. Use the filter() function with a lambda function to keep only the elements whose frequency is greater than K.
  5. Convert the result into a list and print it

Python3




from collections import Counter
 
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
K = 2
freq_dict = Counter(test_list)
res = list(filter(lambda ele: freq_dict[ele] > K, freq_dict))
print("The required elements : " + str(res))
#This code is contributed By Vinay Pinjala.


Output

The required elements : [4, 3]

Time Complexity:

The time complexity of this code is O(n), where n is the length of the input list. This is because the Counter() function from the collections module has a time complexity of O(n), where n is the number of elements in the list. The filter() function and the conversion to a list take linear time, which is also O(n).

Auxiliary Space:

The space complexity of this code is also O(n), where n is the length of the input list. This is because the Counter() function creates a dictionary that stores the frequency count of each element, and the size of this dictionary is proportional to the number of elements in the input list. The output list created by filter() can also be at most n elements long, so the space complexity of the final result is also O(n).

Method #7: Using  defaultdict module from the collections library:

Algorithm:

  1. Import the defaultdict module from the collections library.
  2. Initialize an empty dictionary, freq_dict, using defaultdict(int) to store the frequency count of each element in the test_list.
  3. Set the value of K.
  4. Loop through each key-value pair in freq_dict.
  5. If the frequency of an element is greater than K, append that element to the res list.
  6. Print the final result.
     

Python3




from collections import defaultdict
 
# initializing list
test_list = [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
  
# printing string
print("The original list : " + str(test_list))
  
freq_dict = defaultdict(int)
for num in test_list:
    freq_dict[num] += 1
K = 2
res = []
for num, freq in freq_dict.items():
    if freq > K:
        res.append(num)
 
# printing results
print("The required elements : " + str(res))
#This code is contributed by Jyothi Pinjala.


Output

The original list : [4, 6, 4, 3, 3, 4, 3, 7, 8, 8]
The required elements : [4, 3]

Time Complexity: O(n), where n is the length of the input list. This is because we iterate through the list once to build the frequency dictionary and once again to find elements with frequency greater than K.

Auxiliary Space: O(n), where n is the length of the input list. This is because we store the frequency counts of each element in the test_list in the freq_dict dictionary. The space taken by the res list is proportional to the number of elements with frequency greater than K. However, the worst-case scenario is when all elements have a unique frequency count, resulting in a dictionary with n key-value pairs.
 

RELATED ARTICLES

Most Popular

Recent Comments