Deduplication is a commonly-used feature in GoldFynch, but there may be instances where there are duplicate attachment files in your case, but they weren't marked as "DUPE". In short, deduplication in GoldFynch is done on a root family level since it's not typically desired/allowed to exclude or remove duplicate attachments that belong to non-duplicate parent files. Because of this, GoldFynch will not mark attachment files as "DUPE" (even if the file hashes are the same), unless the parent files are also duplicates.


That being said, for the occasions where it is desired to exclude these duplicate attachments of non-duplicate parent files from functions like Advanced Search and Productions, here's what can be done:


If these duplicate attachments do, in fact, have the same MD5 file hash across the non-duplicate email parent files, you can tag these MD5 file hash duplicates and exclude them from Advanced Searches and Productions.


Step 1: View one of the attachments in GoldFynch, and then click the "Search Dupes by File Hash (MD5)" button under the "Found Duplicates" pane in the right-side column of the Document Viewer. This will run a search in GoldFynch for this specific file hash value.


Image illustrating where the "Search Dupes by File Hash" button is.


Step 2: Once your search has been conducted, bulk-select all of the resulting files and un-select the file you wish to be the "primary" file that will not be tagged as a duplicate. All resulting files should be selected except for one.


Screenshot showing how to bulk-select all of your results, then un-check a primary file, then tag all but one of the results.


Step 3: Click the "+Tag" button along the right side of the screen and create a new tag for these files. In our example, we named this tag "Attached Duplicates". Be sure to apply the tag to the "item only" so other file family items are not tagged (such as parent email files).


Creating a new tag and showing the "New" label.


Step 4: Now that these duplicate attachments have been tagged, you can choose to exclude these results from features like Advanced Search and Productions. Here are instructions on how to do so:


    4a: To exclude these files from an Advanced Search query, simply add the parameter "tags IS-NOT "Attached Duplicates"" (select your new tag name) to your search query and the next time you run the search, these tagged items will be excluded.



4b: To exclude these files from a production, simply select the tag in Step 2 of the Production Wizard and then check the "Invert" checkbox against it. This will exclude all files with this tag, even if they qualify for the production based on other criteria. 



Learn more about excluding files from productions here