Duff is a Unix command-line utility for quickly finding duplicates in a given set of files. Duff is written in C and should compile on most modern Unices.
a DVI to PDF translator. Its features include TeX special's that approximate the functionality of the PostScript pdfmarks used by Adobe Acrobat Distiller, the ability to include PDF files and JPEG files as embedded images, support for both Type1 and PK fonts, support for arbitrary linear graphics transformations, a color stack accessible via special's, partial font embedding and stream compression for reduced output file size, native, portable graphics via TPIC specials, balanced page and destination trees for improved reader access on very large document files
dwdiff is a front-end for the diff program that operates at the word level instead of the line level. It is different from wdiff in that it allows the user to specify what should be considered whitespace, and in that it takes an optional list of characters that should be considered delimiters.
lets you place hyperlinks and shell/tcl/TeX/etc code inside plain text files
Elex generates a scanner (lexer) from a specification oriented around regular expressions.
eolfix is a command line utility for querying and correcting end-of-line (EOL) characters in ASCII text files. It can convert line endings between DOS, Unix, and Mac formats and handles "mixed" and binary formats. It converts only as needed and features a report-only mode.
a program for merging EPS (Encapsulated Postscript) files
a perl program for splitting an EPS (encapsulated postscript) file into several smaller EPS files
a simple plain-text format which allows conversion to and from HTML. Instead of editing HTML directly, it provides an easy-to-edit, easy-to-read and intuitive way to write HTML
euc2html is a simple application that reads in EUC encoded double-byte characters and translates them to HTML 4.0 Unicode encoded entities.
EVP dirdiff recursively compares two directory trees using message digest (hash), e.g. MD5.
converts the given set of C files into HTML files with all the user defined function calls converted to hyper links so that the user can click the link to view that function definition
converts IE favorite files to a Netscape bookmark file
fccu-docprop is a command line utility that tries to print properties of MS OLE files. MS OLE Files are mainly MS Office DOC and XLS files. This software uses the libgsf library to get those metadata. This software can be used for forensic purpose.
digs tags out of a filename and allows you to use those tags within the execution of a program
crlf converts files from/to DOS and UNIX text file formats, tolower converts filename(s) case to lower/upper case, untab converts TABs in files to spaces, and time_t returns values for time handling
fk_html is a simple perl script to convert html mail to plaintext. It converts your mail while you're downloanding it running as a fake pop3 server that redirects your mail client connections.
fsplit and fmerge
fsplit and fmerge are utilities to split a large binary file into smaller pieces and merge them together on another machine.
gClipColl provides a drag-and-drop repository for text snippets. Any text dragged to it is stored in a list for dragging to another application.
a tool to extract information from files. The default settings (and the shorthand options) are useful to extract information such as the title or meta tags from HTML files but it could also be used for other kind of documents
Generic Colouriser acts as a filter, i.e. taking standard input, colourising it and writing to standard output.
a command-line parser generator. Creates a a C or C++ file containing command line parsing routines for your program based on a simple configuration file
a CHM file viewer for Gnome2. It uses PyCHM, a set of Python wrappers around the C library libchm
GNU Talk Filters
The GNU Talk Filters are filter programs that convert ordinary English text into text that mimics a stereotyped or otherwise humorous dialect. These filters have been in the public domain for many years, but now for the first time they are provided as a single integrated package. The filters include austro, b1ff, brooklyn, chef, cockney, drawl, dubya, fudd, funetak, jethro, jive, kraut, pansy, pirate, postmodern, redneck, valspeak, and warez. Each program reads from standard input and writes to standard output. This version of the package also provides the filters as a C library, so they can be easily embedded in other programs.
Gnutran is a simple, Emacs-based front-end to a number of machine translation engines available on the web.
an optical character recognition software. It converts PGM files into ASC files
gozer is a commandline text rendering utility for creating images from abitrary text in antialised truetype fonts using optional fontstyles, wordwrapping and layout control.
searches one or more input files for lines containing a match to a specified pattern. By default, grep prints the matching lines
a plain text to HTML conversor. It succesfully converts subtle text markup to lists, bold, italics, tables and headings to their corresponding HTML tags without having to write unreadable source text files
a tool for automatically creating high-quality HTML markup from Project Gutenberg etexts. In combination with freely-available HTML-to-Postscript conversion tools, GutenMark can convert Project Gutenberg etexts into publication-quality Postscript, for print-on-demand applications
hd2u is a filter used to convert plain texts from DOS (CR/LF) format to UNIX format (CR) and vice versa.
help2info is a bash script that generates a simple info page from the output of the --help argument of the specified program.
a Perl script that converts the --help and --version output from a program into a simple manual page
Tools to manipulate hierarchical text outlines (i.e. text trees), including a generator and a spiffy pager.
Highlight is a universal sourcecode converter for Linux and Windows, which transforms code to HTML, XHTML, RTF, LaTeX or TeX - files with syntax highlighting.
highlights strings using ANSI terminal escape codes
HistView takes an ASCII changelog as input and outputs a formatted HTML page, optionally containing links to download releases.
Html Code Convert
Html Code Convert helps speed up the conversion of HTML code into different format including Java Script, JavaServer Pages, Microsoft ASP, PHP, Perl, and the UNIX Shell. It is particularly useful in CGI scripting.
HTML to LaTeX
HTML to LaTeX converts a web site to a LaTeX document which can be used to generate postscript, pdf, and other formats.
HTML2DB is a tool to assist with the task of converting well-behaved HTML into DocBook SGML.
a converter from html to xsl:fo. The html code could be written with StarOffice or other WYSIWYM editors and must not be 100% valid html code
a small Perl script designed to convert a properly formatted HTML file into a properly formatted LaTeX file
a simple console based utility for converting HTML text streams (or any ASCII based text stream for that matter) into a series of perl print statements for inclusion in a Perl script
html2text converts HTML documents into plain text.
html2text reads HTML documents from standard input or a (local or remote) URI, and formats them into a stream of plain text characters that is written to standard output or into an output-file. The program is able to preserve the original positions of table fields, allows you to set the screen width, and accepts also syntactically incorrect input. The rendering is largely customizable through an RC file.
html_parse is a tool for stripping HTML tags from a document. It is also capable of adding the resulting plain text to a database driven by MySQL.
converts HTML files to PDF or PostScript, generates a table-of-contents for books and generates indexed HTML files
htmlrecode recodes the HTML file using a new character set, while losing no characters at all.
replaces key tags read from a template file with the data read from a data file and generate an output file
features are links to other info files, the ability to read compressed files and prettier layout
creates indexed pdf documents from text files. Designed to aid creating an electronic distribution method for legacy system reports, since many mainframe type print spools are plain text
Prev 50 Next 50