Skip to content

LolipopJ/dupimg-finder

Repository files navigation

Duplicate Images Finder / 重复图像查找工具

search-duplicate-images

Based on EfficientIR.

A tool to find duplicate image pairs or the most similar images of target image in your file system.

Why Write This?

Like a hamster, I love browsing sites like Pixiv, X, etc., liking and saving the pictures that give me aesthetic pleasure. The problem is that some pictures are uploaded multiple times, and I only want to keep one copy.

There are also multiple softwares that already meet my needs like EfficientIR. As a front-end engineer, I want to re-implement it using modern front-end technologies while improving my native coding skills.

That's why I write this software, hope it can save your problems! Pull requests are always welcomed!

Support Platforms

  • Windows

Usage

Download and install the tool.

  1. Indexes page: add index paths that contain images.
  2. Indexes page: update index to generate the eigenvalues of images.
  3. Start a search progress using generated eigenvalues.
    • Search page: search duplicate image pairs.
    • Search Target page: search the most similar images of target image.

Performance

Duplicate Images Finder now supports indexing with multiple work processes, default to 4.

Image Processing Model CPU Model Image Size Time Consuming Software Version
EfficientNet-B2 Inter i5-12600KF approximately 50,000 images (≈ 170GB) 90min (with 1 work process) dev
EfficientNet-B2 AMD Ryzen 5 9600X approximately 100,000 images (≈ 380GB) 55min (with 4 work processes) v1.3.0

Develop

Backend

Python environment is required. Binary is built successfully on python==3.12.4.

cd EfficientIR
git submodule update --init
pip install -r requirements.txt
pyinstaller build_nogui.spec

Frontend

If you need to run index or search actions, prepare backend binary first.

Install Dependencies

yarn

Development

yarn dev

Lint

yarn lint

# fix resolvable lint errors
yarn lint --fix

Production

yarn build

About

A tool to find duplicate image pairs or the most similar images of target image in your file system.

Topics

Resources

License

Stars

Watchers

Forks

Contributors