It will take you about 10 minutes to read this article and in the next 5 minutes (or even faster if you choose the 2nd solution described in the article) you will easily compare two Excel columns for duplicates and remove or highlight the found dupes. Okay, the countdown is started!
Excel is a very powerful and really cool application for creating and processing large arrays of data. Now that you have lots of workbooks with a pool of data, or maybe just one huge table, you may want to compare 2 columns for duplicates and then do something with found entries, for example delete duplicate rows, color dupes or clear the contents of duplicated cells. These two columns may be located in one table, contiguously or non-contiguously, or they may reside in 2 different worksheets or even workbooks.
Say, you have 2 columns with people names - 5 names in column A and 3 names in column B, and you want to compare data between these two columns to find duplicates. As you understand, this is bogus data just for a quick example; in real worksheets you usually have thousands and tens of thousands of entries.
Variant A: Both columns are located on one sheet, in a single table: Column А and Column B
Variant B: Two columns are located on different sheets: Column A in Sheet2 and Column A in Sheet3
The built-in Remove Duplicate tool available in Excel 2016, Excel 2013 and 2010 cannot handle this scenario because it cannot compare data between 2 columns. Furthermore, it can only remove dupes, no other choice such as highlighting or coloring is available, alas :-(.
Further on, I am going to describe 2 possible ways of comparing two Excel columns that let you find and remove duplicate entries:
=IF(ISERROR(MATCH(A1,$B$1:$B$10000,0)),"Unique","Duplicate")
In our formula, A1 is the first cell of the first column that we want to use for comparison. $B$1 and $B$10000 are the addresses of the first and the last cell of the 2nd column that you want to compare against.Pay attention to the absolute cell reference - dollar signs ($) preceding the column letters and row numbers. I use the absolute reference on purpose, in order for the cell addresses to remain unchanged when copying the formula.
If you want to find dupes in Column B, swap the column names so that the formula looks like this:
=IF(ISERROR(MATCH(B1,$A$1:$A$10000,0)),"Unique","Duplicate")
Instead of "Unique"/"Duplicate" you can write your own labels, e.g. "Not found"/"Found", or leave only "Duplicate" and type "" instead of "Unique". In the latter case, you will have empty cells next to cells for which duplicates were not found, I believe such presentation is more convenient for data analysis.
Click the left mouse button and holding it down drag the border downward selecting all cells where you want to copy the formula. When all needed cells are selected, release the left mouse button:
=IF(ISERROR(MATCH(A1,Sheet3!$A$1:$A$10000,0)),"","Duplicate")
Where Sheet3 is the name of the sheet on which the 2nd column is located, and $A$1:$A$10000 are the addresses of the first and last cells of that 2nd column.
Click to download the worksheet with the above examples and the formula to compare 2 columns to find duplicates.
Perfect, we have found the entries in the first column (Column A) that also exist in the second column (Column B). Now we need to do something with them :)
It would be rather ineffective and would take too much time to look through the entire table and review the duplicate entries manually. There are much better ways.
If your columns do not have headers, you need to add them. To do this, put the cursor on the number indicating the 1st row and it will change to a black arrow as shown in the screenshot:
Right click the selected row and choose "Insert" from the context menu:
Give names to your columns, e.g. "Name" and "Duplicate?". Then switch to the Data tab and click Filter:
After that click a tiny grey arrow next to "Duplicate?" to open a drop down list, uncheck all items other than Duplicate in that list, and click OK:
That's it, now you see only those cells of Column A that have duplicated values in Colum B. There are only three such cells in our test worksheet, as you understand in real sheets there are likely to be more, far more of them:
In order to display all rows of Column A again, click the filter symbol in Column B that now looks like a funnel with a tiny arrow and check "Select all". Alternatively, you can do the same via Data tab -> Select & Filter -> Clear, as shown in the screenshot:
If the "Duplicate" flag does not suffice for your purposes and you want to mark duplicated cells by font color or fill color or in some other way…
Then filter the duplicates as explained above, select all filtered cells and press Ctrl + F1 to open the Format Cells dialog box. As an example, let's change the background color of duplicated rows to bright yellow. Of course, you can change the background color of cells using the Fill color option on the Home tab, but the advantage of the Format Cells dialog box is that it lets you make all formatting changes at a time:
Now you definitely won't miss a single duplicated cell:
Filter your table so that only cells with duplicated values show up, and select all those cells.
If 2 columns you are comparing are located on different worksheets, i.e. in separate tables, right-click the selected range and choose "Delete Row" from the context menu:
Click OK when Excel will ask you to confirm that you really want to "Delete entire sheet row" and then clear the filter. As you can see, only the rows with unique values are left:
If 2 columns are located on one worksheet, next to each other (adjacent) or not touching each other (nonadjacent), the removing duplicates is a bit more complex. We cannot delete entire rows that contain duplicate values because this would delete corresponding cells in the 2nd column too. So, in order to leave only unique entries in Column A, you do the following:
As you see, it's not so difficult to remove duplicates between two Excel columns using formulas. Though it's very time-consuming and boring process to write and copy the formula, apply and clear the filter every time you need to compare 2 columns in your worksheets. The other solution I am going to bring to your attention is much simpler and will take just a fraction of time we've spent on the first method. I believe you will find more pleasant things to spend the saved time on ;)
And now let me show you how to compare two columns for duplicates by using our Dedupe tools for Excel.
Remember the previous solution and feel the difference :) It is fast and easy indeed to dedupe your worksheets with Compare Two Tables. In fact, it will take you less time than you've spent on reading this article.
Currently, Compare Tables is part of our Ultimate Suite for Excel, a collection of 60+ professional tools that covert over 300 use cases. The clock is ticking, so hurry up and download it right now!
If you have questions or something is left unclear, please drop me a comment and I'll happily elaborate further. Thank you for reading!
68 responses to "How to compare two columns in Excel and delete duplicates (highlight, color, move)"
Awesome, very helpful.
Thanks for posting.
Ur post is superb. Thanks for doing it and educating us.
Thanks for sharing it. It is a great help to me in fulfill a project which is based on excel sheet. Thanks once again.
Thanks this puts me in safe hands of the excel
Big Thankyou for the clear and immensly helpful tips. Also thanks to Google for searching your link for me :)
Thank you so much... saved me hours of work.
Malene
Thanku so much , Really save lot of work and efforts..
This is a great well written guide. Thanks for the help.
Really this article is very useful one.
Thanks for sharing. Its very useful :)
Thanks for sharing.
To the point information, very nice, thx for sharing this information.
Thanks for the help. This worked like a charm!!!
I can't believe you taught me how to use Excel. Usually it seems so opaque that I give up before trying.
Thanks!
Alex
Awesome, just as you said took few min to read the article and my job was completed in less than a min.
You just saved me a ton of stress.
I really really appreciate you for this.
From Nigeria, Regards!
It didn't work for me either. But this did
=IF(ISERROR(MATCH(B2|$A$2:$A$10000|0))|"Unique"|"Duplicate")
First of all, thanks for the article, but...
This formula doesn't work for me:
=IF(ISERROR(MATCH(B1,$A$1:$A$10000,0)),"Unique","Duplicate")
Hi, I completely understand your question. This was the same problem with me in excel.
And then I got a YouTube channel ( Learn With Lokesh Lalwani) I always learn everything about the excel this channel. I would like to suggest that you can also learn from this channel.
Thanks
It highlights the B1,$A$1 part.
oh thank god for this article. Can't believe how hard it is to find info online on comparing two columns and deduping.
Ok, now this is the working one:
=IF(ISERROR(MATCH(B1;$A$1:$A$10000;0));”Unique”;”Duplicate”)
Merely wanted to mention I am lucky that i stumbled upon your site.
How can i find same values in same coloumn i inserted. (i.e)if i create one column for customer phone number, in tha same column if enter one number again how can i find that?
I am sorry for a delay in response. Please have a look at the variant described in this blog post, it may help:
https://www.ablebits.com/office-addins-blog/2013/10/20/prevent-duplicates-in-excel-column/
If you want to quickly find the previous occurrence, the easiest way is apply Autofilter to your table and filter according to the necessary value when it is needed. This variant is more convenient then using Find All in the column.
How do i find duplicate value of mixed up alphabets say
abc ( Dup)
cba (Dup )
def
ghi
bcd
bca (DUp )
You need a special macro to fulfill your task. You can ask for it in the VBA sections on http://www.mrexcel.com or http://www.excelforum.com
Hi Alex,
Thank you so much for sharing this. It really helped me so much.
Thanks
Hello Alex,
Thank you for this guide. Using the formulas, is it possible to after comparing the two columns and identifying that the value is unique, instead of returning a text value of "unique" or "duplicate" to return the unique value itself?
Hi,
I want compare two sheets data(ALL Columns) into another sheet. I don't know how it is possible please can anyone solve it.
Sheet1:
ID SALARY CLEAVE SLEAVE
12 1000 8 5
11 2000 5 6
10 1500 4 7
Sheet2:
ID SALARY CLEAVE SLEAVE
12 1000 8 5
11 2000 9 6
10 2500 4 7
I want the result like this given below,If both of the sheet have same data that will not display into the result sheet otherwise if there any difference.
Sheet 3:RESUT
RESULT
11 2000 5 6
11 2000 9 6
I want to the result like above.
Dear All,
Could you please help me below scenario :
I need to compare data from first sheet & second sheet and get data column A from first sheet .. for example :
First Sheet :
Original Search key & Code Id
Second Sheet :
Old Search key & Code Id
I need to compare First & Second sheet Code Id's and return Original Search key in second sheet column "C".
I tried VLOOPUP .. I am getting "#NA" value
Regards,
SSRAO
I am sorry, it is difficult to recommend you something without seeing your data.
For us to be able to assist you better, please post a small sample workbook with your data and formula on our forums.
We will look into the issue and try to help.
Great...Really helped
Thank you millions. Helped me a lot
Great! Helped me :)
Great Help.... Thanks a Lot
Thanks a lot buddy. Cheers
how delete even numbers in raw in excel such as:
A B C
1
2
3
4
1001
1002
.
.
these formulas are very helpful, but does anyone have a way to find duplicates in two columns when the data isn't an exact match? I'm comparing ISBN's, and column a contains multiple values in each cell. Column b contains one value (ISBN) per cell. So, I really need to find cells in column a that partially match the value of cells in column b. Make sense?
hey what is the process for the same issue in excel 2007 ?
Thanks for explanation
Hi,
I have two columns as below. Would like to know how I can get the unique data from these two columns in a third column. I have provided below the sample data for merging and the expected data as well. Sample data is in Column A & B while unique data should be in Column C.
Below is the data
Column A
Apple
Orange
Banana
Strawberry
Grapes
Column B
Plums
Apple
Banana
Grapes
Blueberry
Iam looking at the below data in COlumn C
Column C
Apple
Orange
Banana
Strawberry
Grapes
Plums
Blueberry
Would be good if you could share me the excel formula for this.
So incredibly helpful! Thank you!
You are amazing! I've been looking for this information for a while now, so a huge thank you!!! :-)
Thanks.. it served my purpose.
Thank you so much for posting this info.
Thank you for posting! Great formula and thinking!
How do I flag out duplicate data in multiple columns instead of just two columns?
Thank you!!!You are a Champ!
Just what I needed. Thanks
Exactly what I needed. I tried so many other formulas and wasted so much time.
I simply copied and pasted and it worked!
Thank you!
When I try to find the question mark character (?) and hidden space in an Excel sheet, Excel cannot locate the cells containing them. How can I search for ? in Excel?
Really helpful, thank you so much
How To remove both duplicate ?
Doesnt work. Identifies 3 Duplicates and all the rest it claims are unique.
This is awesome - thank you so much!!!
I have 2 columns with some duplicate data within one column and need to eliminate the duplicated portion of the second column. Can anyone please tell me how to do this without manually doing so? See below example; the S.A. Comunale Co. needs to be removed from the second column
First Column Second Column
S.A. Comunale Co. Mr. Art Pecca Comunale Co.
United Way Of St. Paul Mr. Sam Gild United Way Of St. Paul
Basley's Point Sea-Doo Mr. Paul Innoce Beesley's Point Sea-Doo
Please help! I have data in 2 columns on same worksheet. There is a portion within the second column that is duplicate of the first and I need to eliminate the portion that is duplicate of the first. Is there a way to do this without having to go through the excel spreadsheet manually and do?
See example below:
Column 1 Column 2
S.A. Comunale Co. Mr. Art Stuart Comunale Co.
United Way Of St. Paul Mr. Klye Gilt United Way Of St. Paul
Bosley's Point Sea-Doo Mr. Pete Intake Bosley's Point Sea-Doo
Hello, Denise,
Please specify which values belong to which columns exactly. It's not entirely clear from the example you provided where new column starts in rows 2-3.
Also, do you want to find fully duplicated values or partial duplicates? Please give me more details on the comparison criteria.
HI Alex, I am not sure if I am missing or not understanding a step but I am trying to compare addresses. I have set up my addresses on two different sheets in Excel and I see it populate the duplicated lists but when I compare my original list to the list that shows duplicates, I see that not all of the addresses are accurately marked. Do you have any additional pointers?
Hello, Katie,
For me to understand the problem better, please send me a small sample workbook with your source data and the result you expect to get to support@ablebits.com. Please don't worry if you have confidential information there, we never disclose the data we get from our customers and delete it as soon as the problem is resolved.
Please also don't forget to include the link to this comment into your email.
I'll look into your task and try to help.
Brilliant. Thank you so much, this is exactly what I needed and works much better than the conditional formatting solution which has the downside of finding duplicates in the same column.
Thanks for this. Couldn't find a solution to this question anywhere else - this gives easily filterable results.
hi, Im trying to compare 2 columns with system numbers in 2 sheets in the same excel file.
The 1st sheet (Sheet 2)has been filtered because only a subset of numbers is relevant for the match with the other sheet (Sheet 3). The 1st number to be compared is to be found in row224 (column A). I use the following formula but it seems not to be working since the formula gives nu result after entered (the formula is being shown in the field)
'=IF(ISERROR(MATCH(A224,Sheet3!$A$2:$A$82,0)),"missing","OK")
Could you tell me what's gone wrong? I typed the formula in the field above the sheet and I'm using excel 2016.
best regards, Douwe
Hello Douwe!
I've checked your formula and haven't got any errors, it works correctly and returns all the matching values. However, I replaced the quotation marks from ” to ". Maybe it'll work for you too.
If however, there are still difficulties with the formula, please specify what error do you get exactly (#NAME?, #N/A?, etc.) and I'll do my best to help you.
Thank you for this great formula, it works nicely!
Is it possible to use this command in LibreOffice Calc?
Thank you so much Alexander. You saved a big project where we were racking our heads on removing duplicates from a huge volume data. Is your first method useful in selecting multiple columns? If you happen to see my message, please reply.
And also, the tool provided by Ablebits, is it open source or paid? Please let me know. Thank you.
Hello Julie,
Yes, the first method can be used to compare 2 lists / tables by multiple columns, but it will require extra effort.
First, you need to create a helper column in each list. In the helper column, combine the data from the columns you want to compare by.
Assuming your first table includes columns A, B, C and your second table - columns K, L, M; row 1 is column headers, and data starts in row 2. You want to compare columns A + B with columns K + L.
In cell D2, enter the formula = A2&B2 and copy it down to the end of the table.
In cell N2, enter the formula = K2&L2 and copy it through the entire table.
For more information, please see this comprehensive tutorial on concatenating cells.
Then, in cell E2, enter the formula discussed in the first method:
=IF(ISERROR(MATCH(D2,$N$2:$N$10000,0)),"","Duplicate")
Compare Tables is part of Ultimate Suite for Excel, which is a paid product. It includes over 60 tools to handle more than 300 different tasks. And it has taken quite us a lot of time and resources to develop it :)
You can read more about Ultimate Suite here:
https://www.ablebits.com/excel-suite/index-2020.php
On the above page, you can also download a 14-day fully functional trial version to decide whether it is worth buying a license.