turicreate.SFrame.drop_duplicates¶
-
SFrame.
drop_duplicates
(subset)¶ Returns an SFrame with duplicate rows removed.
Parameters: - subset : column label or sequence of labels
Use only these columns for identifying duplicates.
Examples
>>> import turicreate as tc >>> sf = tc.SFrame({'A': ['a', 'b', 'a', 'C'], 'B': ['b', 'a', 'b', 'D'], 'C': [1, 2, 1, 8]}) >>> sf.drop_duplicates(subset=["A","B"]) Columns: A str B str C int Rows: 3 Data: +---+---+---+ | A | B | C | +---+---+---+ | b | a | 2 | | C | D | 8 | | a | b | 1 | +---+---+---+ [3 rows x 3 columns]