|Acrimon df6cf8a280||4 months ago|
|walk-image-fingerprint||4 months ago|
|LICENSE.txt||4 months ago|
|README.md||4 months ago|
|find-duplicate.py||4 months ago|
Tools I use for filtering and deduplicating images. These are licensed under GPL-3.0-only.
A copy of the license can be found in the
LICENSE.txt file in this repository.
This tool is meant for fingerprinting images for deduplication purposes. Useful for finding duplicates of images across image formats and resolutions.
This program is written in Rust and needs Rust installed in order to be compiled.
Once Rust is installed the program can be compiled by running the
cargo build --release command inside of the
The output executable is located in the
The output csv has two columns. The first column contains the file path and the second column contains the fingerprint. The program takes two cli arguments. The first argument is the folder to scan for images and the second argument is the name of the output file.
./walk-image-fingerprint /home/brandon/Pictures metadata.csv
This example invocation will scan the
/home/brandon/Pictures folder and write the results to a file named
metadata.csv in the current directory.
After fingerprinting is done duplicates can be detected by running the
python script and supplying the previous output file as the first argument.