turicreate.SFrame.drop_duplicates

SFrame.drop_duplicates(subset)

Returns an SFrame with duplicate rows removed.

Parameters:
subset : column label or sequence of labels

Use only these columns for identifying duplicates.

Examples

>>> import turicreate as tc
>>> sf = tc.SFrame({'A': ['a', 'b', 'a', 'C'], 'B': ['b', 'a', 'b', 'D'], 'C': [1, 2, 1, 8]})
>>> sf.drop_duplicates(subset=["A","B"])
Columns:
        A       str
        B       str
        C       int
Rows: 3
Data:
+---+---+---+
| A | B | C |
+---+---+---+
| b | a | 2 |
| C | D | 8 |
| a | b | 1 |
+---+---+---+
[3 rows x 3 columns]