Generally, when you think of “Removing Duplicates”, what comes to the mind is removing duplicate entries, that is, when all columns data of that row and same with any other row.
Thus, the rows which have all data same, gets removed and only 1 entry remains.
Excel has a beautiful feature here that enables you to check only certain columns (based on your choice) for same data, and then remove duplicate data accordingly.
For eg.: –
- You have 5 columns in your database, namely, Name, DOB, Department, Region, Branch.
- You want to remove the duplicate entries not by checking all the columns, but by checking only the Department, Region and Branch columns.
- Thus after removing the duplicates what you get is only 1 record (whichever is on the top in the database) with a unique combination of Department, Region and Branch.
To access the Remove Duplicates option in Excel,
- Go to the Data tab on the top, then to Remove Duplicates (under the Data Tools section).
- In the dialog box that opens, you get a list of all column headers in your database. Select the columns that you want to check for duplicates (in this case, see the image as shown below), and press OK.
All duplicate records will be removed and only the first record prevails.