2022-09-17
Numpy And Pandas Deduplication

numpy remove duplicates from array

1
2
print(np.unique(ar, axis=1))

dupandas: remove duplicates with custom rules like levenshtein distance, spelling differences and phonetics (fuzzy maching) for english (most likely?)

1
2
pip install dupandas

pandas drop_duplicates

1
2
df.drop_duplicates(subset=['brand', 'style'], keep='last')

Read More