Based on EfficientIR.
A tool to find duplicate image pairs or the most similar images of target image in your file system.
Like a hamster, I love browsing sites like Pixiv, X, etc., liking and saving the pictures that give me aesthetic pleasure. The problem is that some pictures are uploaded multiple times, and I only want to keep one copy.
There are also multiple softwares that already meet my needs like EfficientIR. As a front-end engineer, I want to re-implement it using modern front-end technologies while improving my native coding skills.
That's why I write this software, hope it can save your problems! Pull requests are always welcomed!
Windows
Download and install the tool.
Indexespage: add index paths that contain images.Indexespage: update index to generate the eigenvalues of images.- Start a search progress using generated eigenvalues.
Searchpage: search duplicate image pairs.Search Targetpage: search the most similar images of target image.
Duplicate Images Finder now supports indexing with multiple work processes, default to 4.
| Image Processing Model | CPU Model | Image Size | Time Consuming | Software Version |
|---|---|---|---|---|
EfficientNet-B2 |
Inter i5-12600KF |
approximately 50,000 images (≈ 170GB) | 90min (with 1 work process) | dev |
EfficientNet-B2 |
AMD Ryzen 5 9600X |
approximately 100,000 images (≈ 380GB) | 55min (with 4 work processes) | v1.3.0 |
Python environment is required. Binary is built successfully on python==3.12.4.
cd EfficientIR
git submodule update --init
pip install -r requirements.txt
pyinstaller build_nogui.specIf you need to run index or search actions, prepare backend binary first.
yarnyarn devyarn lint
# fix resolvable lint errors
yarn lint --fixyarn build