Handling duplicates with Gemini, DupeGuru and NeoFinder

No prizes for guessing that I have dozens of duplicate files. It's actually more like hundreds. Since I consolidated my external drives, I see that I have the same file in multiple places. This came from running out of space on one drive, and copying the information to another. Problem is, I never went back and deleted the original.

Or I download a series of tutorials, then moved them, couldn't find them, then downloaded the series again. Now I have both.

And the usual problem of making a backup to a drive, then writing that information to BluRay, and forgetting to remove the other backup. Again, now I have two.

Regardless of how I got here, I realize one duplicate finder is not sufficient to untangle the situation. I have Gemini, DupeGuru and NeoFinder working to find where everything went.

Gemini is great for finding duplicates. If it says the file is the same, it's the same. It's removed over 300GB so far. Problem is, it doesn't handle fuzzy logic so well. So if you download a file at 480p resolution, then the same file at 720p resolution, those are not the same and aren't detected.

That's where DupeGuru comes in. It can handle looking for duplicates based on similar file names. It detects the slight differences and groups the files together so you can delete the lower res version.

Finally, there is NeoFinder. It catalogs all my volumes so I know what I actually have. Where is that tutorial on JMeter? Where is that shell scripting ebook? Where are all my mp3 files? But, it also adds the ability to find duplicates.

Once you have catalogued all your volumes, you can perform a duplicate search across them. You can now see where you have two, or more, of a file with the same name. You can then jump into Finder, or actually delete the file from NeoFinder. It will move the file to the trash and remove the entry from the catalog.

Using these 3 tools, I get what I feel is complete coverage for finding duplicate. I get an exact match with Gemini, a little more fuzzy logic with DupeGuru, and finally, visual confirmation using an indexed search through NeoFinder.

Other articles of interest:

Leave a Reply

Your email address will not be published. Required fields are marked *

Recent Comments
  • Round and Round with the For..Next in Katalon Studio (1)
    • Sweet Ophaline Labador: Hello can you help me how to loop this scenario. Checking the elements is just the same. I want to check that the following function is available in www.siacargo.com: Track Shipment – clicking on this can...
  • How To Disable the Quicken Registration Prompt (30)
    • David: A BIG Thank you! I regularly reinstall Windows OS’s using different hardware configs. In short, Quicken 2011 (it’s March of 2020 as I write this) has been a program I’ve been very happy with and continue to use....
    • Karen L: Today I rang Reckon Australia again, got a different guy and he talked me through the process of getting me a new product key to enter since I already had the licence and it had been extended. Reckon have only extended it by 1...
    • Karen L: I have Australian Quicken 2008 on Windows 8.1. It keeps asking me to renew my licence which Reckon did for me however the activation won’t work. I tried this to disable the registration prompt however it did not stop the...
  • Parsing Strings in Katalon – Split, Substring and Readlines (6)
    • Don Pedro: String tempText=”Date Month January. Revenue $1,355,721.00″ println(tempText.split(“ \$”)[1]) You need to escape the $ with \$ The [1] on the split captures characters on the right of the split [0]...
    • Prasad: Hi, I need to get the substring from the below string ‘Date Month January. Revenue $1,355,721.00’ I want to extract text after $ value. Please help Thank you
  • Working with Dates and Date Formatting in Katalon Studio (15)
    • Sangeethaa: How to get previous day’s date(Yesterday’s date)? Actually I was using today.previous() before,it was working fine till today morning.Now its not retrieving right date. Can anyone please guide me on this.
    • Don Pedro: That could be a little tricky. First, is the text/format always going to be the same? You will need to do some parsing. Separate the first but of text at 2019. After that, split the text again to get rid of Central Standard...
    • Don Pedro: It should be of the same Date type as today. You could then do some calendar trickery with Calendar /*Calendar c = Calendar.getInstance(); //c.set(Calendar.MONTH, 1) //Set the month, 0=January, 11=December println...
    • josh: I’m testing a page with a string like “Updated November 21, 2019 16:25:32 PM Central Standard Time.” How might I capture the date and time and compare it to current time to verify that the update time was within...
    • zakir hussain: today = new Date() lastWeek = (today – 1.week) can u please help me in understanding what is the return type of lastweek? and I wanted to set nextyear date how can i do it
  • Output status messages and test information by writing to the Log File Viewer in Katalon Studio (6)
    • Saranya: Good One. Could you plz share link to access all your katalon related blogs. ThankQ
    • Rodrigo Calabretta: I’m using the KeywordUtil.markFailed or KeywordUtil.markError and my test stops is being shown as Error and If I use in the @afterTestCase the testCaseContext.getTestCaseSta tus() to show the status test case...
  • Setting up Environment Profiles in Katalon Studio (2)
    • Don Pedro: Not quite sure I understand. The term “Execution Profile” is normally used now, but they are still GlobalVariables and are created in the same way for version 7 as previous. You make a new execution profile, then...
  • Here is my drive cluster (2)
    • Don Pedro: While I still have those drives, the drive cluster looks nothing like that now. Almost all of my external drives have had the cases removed and they have been installed into a series of MediaSonic 8 Bay Drive Enclosures. I...