Links:
sitemap_gen sitemap_gen is a platform-independent site map generator. It crawls a Web site starting from a given URL, and outputs an XML sitemap file that you can use for Google or other search engines. spindle spindle is a web indexing/search tool built on top of the Lucene toolkit. It includes a HTTP spider that is used to build the index, and a search class that is used to search the index. spinn3r-client spinn3r-client implements client bindings to access the Spinn3r Web service. Spinn3r is a Web service for indexing weblogs. It provides raw access to every post being published in real time. Strigi Strigi is a desktop and indexer independent desktop search engine. Its main features include very fast crawling, a very small memory footprint, no hammering of the system, and pluggable backends (currently clucene and hyperestraier was provided). Communication between the daemon and search program is done over an abstract interface which is desktop independent. SWISH++ SWISH++ is a Unix-based file indexing and searching engine (typically used to index and search files on web sites). The Data Mine The Data Mine is a search engine designed to give users an unusually powerful interface. It is designed around human-computer intelligent interaction (making the computer a tool so humans can use their intelligence). It divides the screen into two halves: one lets you find all the instances of your query's keywords, and the other lets you look through a highlighted version of the results you choose. The Search Engine Project The Search Engine Project is a simple yet extremely powerful and fast search engine in PHP with MySQL database. The TSEP is built to index and search a site of 250 pages within seconds. It also supports boolean search. Web Searcher Web Searcher is an applet for GNOME which allows you to search a string in a selected search engine, with little work. It launches a browser with the Web address of a search engine ready to search. Webglimpse Webglimpse is a fast, flexible search engine for finding information in a related web of pages. WebSuck WebSuck goes through the web-pages you specify and checks for links and data files. The links are followed, and the data files are output in the format of your choice. xinabse xinabse is a search engine for small to medium sized sites. It consists of a HTTP spider written in Perl and a templatable frontend in PHP. Keywords and sites are stored in a MySQL database. Xion FTP spider Xion FTP spider is a Perl/Mod_Perl based ftp spider/search system. YaCY YaCY is a p2p-based distributed Web Search Engine. Yahoo BOSS Yahoo BOSS (Build your Own Search Service) is a search API. This PHP 5.3 package can retrieve the results of Web, news, and image searches, and also cache them.
Prev 50