Apache Lucene is a freely available information retrieval software library that works with fields of text within document files. This evolving venture is also called the Apache Lucene Project. Apache is a server that is distributed under an open source license.
Next Steps
-
Lucid Imagination adds support for Apache search tools
Lucid Imagination offers support and enhanced s...
(SearchEnterpriseLinux.com) -
Using Lucene with OJB
A look at Apache Lucene, a text search engine ...
(TheServerSide.com)
The Lucene application program interface (API) stays the same regardless of the format of the file to be indexed. Provided that the text information can be recovered and extracted, Lucene can index practically any type of text-containing document. Lucene has become popular for use in Internet search engines as well as for single-site search operations.
The Apache Lucene Project comprises four main components:
- Lucene Core: Indexing, searching, spell checking, hit highlighting, and tokenization.
- PyLucene: Python port for Lucene Core.
- Solr: Extensible Markup Language (XML), Hypertext Transfer Protocol (HTTP), and APIs for Javascript Object Notation (JSON), Python, and Ruby, as well as hit highlighting, faceted search, caching, replication, and an interface for Web site administrators.
- Open Relevance Project: Free distribution of materials for performance testing and relevance evaluation.
Tech TalkComment
Share
Comments
Results
Contribute to the conversation