Image Deduplicator – find duplicate images


Image Deduplicator is a useful Python package that finds duplicates using a variety of different algorithms, and plots duplicates found for a specific image file.

The software supports a fairly wide range of image formats with JPG, PNG, BMP, SVG, MPO, PPM, TIFF, GIF, PGM and PGM covered. Other popular image formats such as WEBP and RAW are not currently supported. We’d also like to see a simple command-line interface added.

Image Deduplicator has attracted more than 3K GitHub stars.

Support: Documentation
Developer: Tanuj Jain, Christopher Lennan, Dat Tran
License: Apache License 2.0

Image Deduplicator is written in Python. Learn Python with our recommended free books and free tutorials.

Pages in this article:
Page 1 – Introduction / Installation
Page 2 – In Operation
Page 3 – Summary

Notify of

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Inline Feedbacks
View all comments