Selection of utilities

hist-rs – high-throughput CLI to count unique lines

hist-rs is a Rust command-line utility built for high-throughput line-oriented text processing.

It provides a fast standalone alternative to traditional shell pipelines for building histograms of repeated lines, and it also includes extra controls for deduplication, filtering, substitutions, and output shaping that make it useful for logs, datasets, and other text streams.

This is free and open source software.

Key Features

  • Counts the abundance of unique lines from files or standard input.
  • Can output only unique lines to deduplicate an input stream.
  • Supports include and exclude pattern matching while processing input.
  • Filters results using minimum or maximum abundance thresholds.
  • Can sort output by key or abundance in ascending or descending order.
  • Supports regex-based pattern substitution on incoming lines.
  • Can show the last k entries together with a summary count of the remaining data.
  • Includes an option to generate a histogram without sorting by value.

Website: github.com/noamteyssier/hist-rs
Support:
Developer: Noam Teyssier
License: MIT License

hist help

hist-rs is written in Rust. Learn Rust with our recommended free books and free tutorials.


Related Software

Alternatives to uniq
runiqAn efficient way to filter duplicate lines from input, à la uniq
huniqRemoves duplicates from stdin using a hash table
nauniqNon-adjacent uniq
semiuniqRemoves nearby repeated lines in a file
uqRemoves duplicate lines from the output, regardless of the order
zetPerform set operations on files considered as sets of lines
anewAdds new lines to files, skipping duplicates
anewerAppends lines from stdin to a file if they don't already exist in the file

Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments