smash – find duplicate files super fast

smash is a command-line interface tool to smash through to find duplicate files efficiently by slicing a file (or blob) into multiple segments and computing a hash using a fast non-cryptographic algorithm such as xxhash or murmur3.

smash does not support pruning of duplicates or empty files natively and it’s encouraged you vet the output report before pruning via automated tools.

This is free and open source software.

Key Features

  • Super fast analysis of large files thanks to slicing.
  • Suited for finding duplicates on bandwidth constrained networks, devices or very large files but plenty capable on smaller ones!
  • Supports a variety of non-cryptographic algorithms.
    • xxhash.
    • murmur3.
    • SHA-256.
    • SHA-512.
    • MD5.
    • FNV128.
    • FNV128a
  • Read-only view of the underlying filesystem when analyzing
  • Reports on duplicate files & empty (0 byte) files
  • Outputs a report in JSON. You can use tools like jq to operate on.
  • Used to dedupe multi-TB of astrophysics datasets, images and video content and run regularly to report duplicates.
  • Cross-platform support – runs under Linux, FreeBSD, macOS, and Windows.

Website: github.com/thushan/smash
Support:
Developer: Thushan Fernando
License: Apache License 2.0

Options for smash CLI tool
Click image for full size

smash is written in Go. Learn Go with our recommended free books and free tutorials.


Related Software

Find and Delete Duplicate Files with these CLI Tools
CzkawkaFind duplicate files, big files, empty files, similar images, and much more
fdupesGreat CLI tool that's written in C
fclonesEfficient duplicate file finder and remover
rmlintFast tool to remove duplicates and other lint
jdupesPowerful CLI duplicate file finder and 'enhanced' fork of fdupes
smashFind duplicate files super fast
rdfindCLI redundant data find tool written in C++
duffCommand-line utility for finding duplicate files
rmdupesOption to use a reference directory
PeriscopeOrganize storage and safely remove redundant files
Go Find DuplicatesScans directories for duplicate files and directories
samanlainenDelete duplicate files with SHA512 hashing
FSlintPython based CLI and GUI tool
sdupesFast duplicate file detection utility.
dupefiDuplicate file finder designed with Linux philosophy
DupsterDuplicate file finder
dupleFind and remove duplicate files
ddhDirectory Differential hTool
backdownSafely and ergonomically remove duplicate files

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments