Definition: This function can allow you to remove duplicate values. If there are duplicate values which are encountered then the first value will be kept. While using de-duplicate you can keep single column name or even keep multiple columns also which are comma separated. In case if you specify multiple columns, then it will search for duplicate values across those multiple columns and when it sees it then it will remove.

Example: We are deduplicating based on 2 columns


where in we are checking of any duplicate set of records on locality and student_age and removing those duplicate records.


Note 1

If you have entered some command on Ask On Data and it is asking for variables/input for that command but you want to abort that operation, then simply type "Abort".

Note 2

We will keep on retraining Ask On Data ML algorithm, hence its accuracy will keep on increasing and it will be capable of understanding more syntaxes in simple language.

Reach out on for any questions or help