LinuxLinks.com
Newbies What Next ? News Forums Calendar

Search





News Sections
Home
General News (3973/0)
Reviews (628/0)
Press Releases (464/0)
Distributions (187/0)
Software (883/2)
Hardware (522/0)
Security (192/0)
Tutorials (345/0)
Off Topic (181/1)


User Functions
Username:

Password:

Don't have an account yet? Sign up as a New User


Events
There are no upcoming events




Xapian

Xapian

Xapian is an open source probabilistic information retrieval library. It is a full text search engine library for programmers.

The Xapian search engine library is a highly adaptable toolkit which allows developers to easily add advanced indexing and search facilities to their own applications. It implements the probabilistic model of information retrieval, and provides facilities for performing ranked free-text searches, relevance feedback, phrase searching, boolean searching, stemming, and simultaneous update and searching. It is highly scalable, and is capable of working with collections containing hundreds of millions of documents.

It supports a rich set of boolean query operators.

Xapian is written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, Ruby, Lua and Erlang.

 Xapian 1.2.18

Price
Free to download

Size
3.0MB
License

GNU GPL v2

Developer
Olly Betts, Hein Ragas, James Aylett, Sam Liddicott, Richard Boulton, Dan Colish

Website
xapian.org

System Requirements

Support:
Documentation, Mailing Lists, Twitter, SourceForge Project Page

Selected Reviews:

Features include:

  • Supports database files > 2GB - essential for scaling to large document collections
  • Transactions: if database update fails in the middle of a transaction, the database is guaranteed to remain in a consistent state
  • Simultaneous search and update, with new documents being immediately visible
  • Support for large databases: Xapian has been proven to be scalable to hundreds of millions of documents
  • Accurate probabilistic ranking: more relevant documents are listed first
  • Phrase and proximity searching - users can search for words occurring in an exact phrase or within a specified number of words, either in a specified order, or in any order
  • Relevance feedback, which improves ranking and can expand a query, find related documents, categorise documents etc
  • Structured Boolean queries, e.g. "race AND condition NOT horse"
  • Wildcard search, e.g. "wiki*"
  • Spelling correction
  • Synonyms
  • Omega, a packaged solution for adding a search engine to a web site or intranet. Omega can easily be extended and adapted to fit changing requirements
  • Faceted search - dynamically generate complete lists of category values which feature in matching documents
  • Supports Unicode (including codepoints beyond the BMP), and stores indexed data in UTF-8
  • Highly portable

Return to Search Engines for Big Data Home Page

Bookmark and Share


Last Updated Sunday, August 31 2014 @ 05:15 AM EDT


We have written a range of guides highlighting excellent free books for popular programming languages. Check out the following guides: C, C++, C#, Java, JavaScript, CoffeeScript, HTML, Python, Ruby, Perl, Haskell, PHP, Lisp, R, Prolog, Scala, Scheme, Forth, SQL, Node.js (new), Fortran (new), Erlang (new), Pascal (new), and Ada (new).


Group Tests
All Group Tests

Top Free Software
5 Office Suites
7 Document Processors
6 Lean Desktops
6 Desktop Search
9 Project Management
9 Groupware Apps
14 File Managers
10 Databases
21 Backup Tools
21 DVD Tools
21 Window Managers
21 Productivity Tools
9 Terminal Emulators
21 Financial Tools
21 Text Editors
21 Video Emulators
21 Home Emulators
42 Graphics Apps
6 CAD Apps
42 Scientific Apps
10 Web Browsers
42 Email Apps
12 Instant Messaging
10 IRC Clients
7 Twitter Clients
12 News Aggregators
11 VoIP Apps
11 Remote Display Apps
42 Best Games
42 More Games
21 More Games
21 Notable Games (1)
21 Notable Games (2)
21 Notable Games (3)
8 ASCII Games
9 Educational Games
42 Audio Apps
42 Video Apps
6 Screencasting Apps
80 Security Apps
9 System Monitoring
6 Family History Apps
11 PDF Tools
6 Music Servers
6 Collection Managers
7 Calculator Apps
8 Geometry Apps
Free Console Apps
14 Multimedia
Programming
8 Compilers
9 IDEs
9 Debuggers
7 Revision Control Apps
6 Doc Generators
'Free' Proprietary
21 Closed-Source Apps
Top Commercial Apps
42 Games
Free Web Software
21 Web CMS
14 Wiki Engines
8 Blog Apps
6 eCommerce Apps
5 Human Resource Apps
10 ERP
10 CRM
6 Data Warehouse Apps
8 Business Intelligence
6 Point-of-Sale

All Group Tests

Other Articles
Migrating from Windows
Back up your data
Distribution Guide
Distro Portal Pages
20 Free Linux Books
Running Linux Under Windows


Older Stories
Sunday 09/07
  • Free Office Suites that Cut The Mustard (0)
  • Low-Spec Hardware? Try these Desktop Environments (0)

  • Monday 08/25
  • Powerful Command-line File Transfer Programs (0)

  • Saturday 08/16
  • Great Apps to Take Notes (0)

  • Sunday 08/10
  • Exciting New Terminal Emulators (0)

  • Saturday 08/02
  • The Best Lightweight Graphical Email Clients (0)

  • Sunday 07/20
  • Excellent Free Distraction-Free Tools for Writers (0)
  • Pick of the Bunch: Console Internet Applications (0)
  • Mobile Phone Management Made Easy (0)

  • Saturday 06/28
  • Easy File Comparisons With These Great Free Diff Tools (0)


  • Vote

    What do you find MOST attractive about Open Source software?

    Amount of customization
    Security
    Freedom provided
    Speed of development
    Quality
    Multiple versions
    Cost
    Potential to contribute
    Ability to modify code
    Results
    4873 votes | 2 comments

    Built with GeekLog and phpBB
    Comments to the webmaster are welcome
    Copyright 2009 LinuxLinks.com - All rights reserved