Links:
ALTSE ALTSE is an alternative search engine technology. It can index up to a couple million Web pages. ASPSeek ASPSeek is a full-featured medium-to-large scale SQL-based Internet search engine. It consists of indexing robot, search daemon and search frontend (CGI program). DataparkSearch DataparkSearch is a full-featured web search engine released under the GNU General Public License. DataparkSearch consists of two parts. The first part is indexing mechanism (indexer). Indexer walks over html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search using data collected by indexer. ddc-concordance ddc-concordance is a search engine developed specially to meet the needs of linguistic researchers. Douglas Thrift's Search Engine Douglas Thrift's Search Engine is an indexing search engine for use on small websites such as personal or small business sites. It is designed to be very similar to Google for end users and its output is customizable. For indexing, it supports both the Robots Exclusion Protocol and the Robots META Tag. DreamCloak DreamCloak allows you to create unlimited cloaked entry pages for each actual page across unlimited sites. DuckDuckGo DuckDuckGo is a search engine focused on relevant results and respecting user privacy. It is a mash-up of several other sites like Wikipedia, About, Bing, and Yahoo. Estraier Estraier is a full-text search system for personal use. Full-text search means functions to search lots of documents for some documents including specified words. filofant filofant is an indexing server for e-mails, attachments and other documents stored on various locations in your company. The indexed documents are accessible by a customizable web frontend like an internet search engine. Find What I Mean Find What I Mean aims to provide a searching library that tolerates errors in queries. FM SiteSearch Pro (commercial) FM SiteSearch Pro adds a search capability to a web site. It comes with a relevance engine, control panel, large web site support, mysql support (optional), search/keyword statistics, advanced searches, specialized searches, fully customizable and many more. focuseek searchbox focuseek searchbox can spider sites and power the search function of a web site or portal, or it can index information from any source and enable search in your business processes. FtpLocate FtpLocate is a fast FTP search engine written with Perl. gonzui gonzui is a source code search engine for accelerating open source software development. In the open source software development, programmers frequently refer to source codes written by others. Our goal is to help programmers develop programs effectively by creating a source code search engine that covers vast quantities of open source codes available on the Internet. Goose Search Goose Search allows you to search Google's index of the Internet from the command line. You run Goose, giving it your list of search terms, and it presents a list of search results using an easy to navigate Curses display in your terminal. You can then select a search result to open in your web browser. Harvest Harvest is a full featured web based search system for any kind of documents. HarvestMan a web-crawler written in the python programming language. HarvestMan is a HarvestMan is full-featured, multithreaded web-crawler written in Python. HarvestMan supports as much as 40 customization options as of the current stable version. Heritrix Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. htdoogle htdoogle is a Web interface for the HTDIG search engine. It is fast and intuitive. HtSearch HtSearch is a PHP interface to htsearch, a frontend to ht://Dig. Hyper Estraier Hyper Estraier is a full-text search system. It can be used as a Web search engine, mailbox searching, etc. It features high performance searching, high scalability of target documents, a perfect recall ratio by the N-gram method, phrase searching, attribute searching, and similarity searching. Multilingualism is supported with Unicode. imgSeekWeb imgSeekWeb is based on the imgSeek project. The final goal is a distributed server side content-based image search engine. IntelliSearch Intellisearch is a concept thought up to combine Dasher, an on screen predictive keyboard, with Yahoo!'s predictive search functionality, in an accessible manner. isbnsearch isbnsearch is a distributed search portal of common sources of ISBN numbers, with permanent caching of results. To provide a open-source free interface for ISBN retrieval using HTML, SQL or XML to be independent of any toolkits or software. locust locust is a full featured Internet search engine specifically designed for knowledge area or corporate search. It can index 2.5 million documents per 24 hours on a single Dell server. It consists of clean C++/STL code written from scratch. LuMriX LuMriX is a search engine that exploits XML and XML Topic Maps. In contrast to other retrieval methods, it does not relate single items to resources, but combines given items into meaningful associations (concepts), which are in turn linked to resources. mguesser mguesser is a standalong part of libudmsearch (a core of mnogo search engine http://mnogosearch.org) which allows to guess text's charset and language. mnoGoSearch extension for PHP nmGoSearch extension for PHP is a complete PHP binding for the mnoGoSearch API. mnoGoSearch-php mnoGoSearch-php is a full-featured web search engine software for intranet and internet servers. Montezuma Montezuma is a full-text indexing/search engine library written entirely in Common Lisp. Montezuma is a Common Lisp port of Ferret. Ferret is a Ruby port of Lucene. mygosuMenu mygosuMenu is a simple, lightweight, fast, free, search engine friendly DHTML menu, compatible with most browsers. Namazu Namazu is a full-text search system intended for easy use. Not only it works as a CGI program for a small or medium scale Web search engine, but also works as a personal use search system for your pile of emails. NVBase NVBase is an information retrieval system that makes any data within an enterprise available. Any source of information including emails, RDBMS, file systems, and Web pages can be indexed and searched. Open Search Server Open Search Server is a stable, high-performance search engine and a suite of high-powered full text search algorithms Pagecast Pagecast is a program that makes it easy to send lists of URLs to popular internet search engine services. Perlfect Search Perlfect Search is a sophisticated, powerful, versatile, customizable and effective site indexing/searching suite available under an open source license (GPL). It comes as a pair of disctinct scripts. The indexer, that automatically scans and indexes a web site, and the search engine, a cgi script that serves search queries for keywords over the index, and displays results pages in html, in a standard format including title, description and relevance ranking for each matching document. Personal Search Engine Personal Search Engine is a tool that allows a webmaster or developer relatively easy add a local search engine to an existing web site. PhpDig PhpDig is a web spider and search engine written in PHP, using a MySQL database and flat file support. PhpDig builds a glossary with words found in indexed pages. On a search query, it displays a result page containing the search keys, ranked by occurrence. phpLinks phpLinks is an open source project written in PHP for use with MySQL, allowing one to run an extremely efficient Link Farm with full search capabilities. A "simulated" search engine in many ways. phpSERA phpSERA is a PHP/MySQL-based tool for Search Engine Ranking Analysis (SERA). The rankings are based on parsing output of search engines, using simple regular expressions. POPsearch POPsearch is a personal search engine that is designed to help you easily organize and find information on your computer. POPsearch lets you index your entire collection of email messages and files. This collection can then be searched from any web browser. pro-search pro-search is a crawler for FTP servers, SMB shares, HTTP servers, and DC++ networks. It has a powerful, Web-based search and navigation interface. pseudo-cron pseudo-cron allows users to use cron jobs on a Website without shell access. Whenever any user requests a page which uses pseudo-cron, it checks if any cron jobs should have been run since the previous request. PyLucene PyLucene is a GCJ-compiled version of Java Lucene integrated with Python via SWIG. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene. Pyndex Pyndex is a simple full text indexer written in Python. It uses Metakit as its storage manager, so you need to have Metakit installed. Quick Submit Quick Submit is an automatic search-engine URL submitter. It is a perl CGI which allows you to submit your website to search engines in a matter of minutes. RestPose RestPose is a search engine. It is designed to take a set of documents and then, when given a query, to return ranked lists of documents which are a good match for that query. Satellite2 Satellite2 is a website indexing/search fascillity, written in Perl with (planned) : support for a large range of file formats (txt, html, doc, xsl, pdf, chm), Unlimited number of different indexes possible with one Satellite installation, Unlimited number of resulpage templates possible, and a web based administrator which allows easy adding, modifying and deleting of indexes. Sherlock Holmes Sherlock Holmes is a universal search engine - a system for gathering and indexing of textual data (text files, web pages, ...), both locally and over the network. sitemap_gen sitemap_gen is a platform-independent site map generator. It crawls a Web site starting from a given URL, and outputs an XML sitemap file that you can use for Google or other search engines.
Next 50