Find duplicates
Last updated
Last updated
JabRef can look for duplicated entries inside a library.
This feature is accessible directly through Quality → Find duplicates. It is also used when importing new entries from a supported reference format or directly from the Internet.
Detection of potential duplicates is done by an edit distance algorithm. Extra weighting is put on the fields author, editor, title. and journal.
The differences between the two entries can be configured through the toolbox located at the top of the window. From the toolbox, you can choose to show or hide differences, choose how to display differences (Unified or Split) and you can also choose how to compare entries (by words or characters).
Plain Text — This option hides the differences.
Show Differences — This option shows the differences.
Unified View — In this mode, differences are shown on the right side.
Split View — In this mode, differences are shown on both sides, with deletions on the left side and additions and updates on the right side.
Highlight words — This option compares entries values in terms of words.
Highlight characters — This option compares entries values in terms of characters. It divides both entry values into characters before comparing each character individually. This is perfect for comparing values with small differences (1 or 2 different characters).
From the toolbox's top-left corner, you also can choose to select all the left entry values by clicking Left
or selecting all the right entry values by clicking Right
. Be aware that selecting all entry values will select a value even when it is empty.
![Screenshot of the buttons to choose which entry to keep](../.gitbook/assets/button bar.png)
You are offered to:
Automatically remove exact duplicates. This button shows up if there are exact duplicates. Click it to stop showing other exact duplicates and have them removed automatically.
Keep left — Keeps the left entry and removes the right entry.
Keep right — Keeps the right entry and removes the left entry.
Keep both — Keeps both entries. This usually means that you don't consider the entries to be duplicates.
Keep merged — Keeps the merged entry only and removes the previous entries.
Cancel — Closes the dialog and stops showing other duplicates.