Open Source Search Engines in Python

3 projects

advas is a python module which provides algorithms for advanced search. These methods are mainly used in information retrieval and linguistics. This package contains: * statistical algorithms: term frequency (tf), term frequency with stop list, inverse document frequency (idf), retrieval status value (rsv), language detection, k-nearest neighbour algorithm...

Details

PyLucene is a GCJ-compiled version of Java Lucene integrated with Python. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene.

Details

Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.

Details