Python Source
A
Ajax
Aspect-Oriented
 
B
Bloggers
Build Systems
 
C
CMS
Charting & Reporting
Code Analyzers
Connection Pools
 
D
Databases
 
E
ERP & CRM
ESB
 
F
Forum Soft
 
G
General Purpose
 
H
HTML Parsers
 
I
IDEs
Installers
Issue Tracking
 
L
Logging Tools
 
N
Network Clients
Network Servers
 
P
PDF Libraries
Parser Generators
Persistence
Project Management
 
R
RSS & RDF Tools
 
S
Search Engines
Security
Source Control
 
T
Template Engines
Testing Tools
 
W
Web Frameworks
Web Mail
Web Servers
Web Services
Web Testing
Wiki Engines
Workflow Engines
 
X
XML Parsers
 

Open Source Search Engines in Python

advas

advas is a python module which provides algorithms for advanced search. These methods are mainly used in information retrieval and linguistics. This package contains:

 * statistical algorithms: term frequency (tf), term frequency with stop list, inverse document frequency (idf), retrieval status value (rsv), language detection, k-nearest neighbour algorithm (kNN).
 * linguistic algorithms: stemming algorithms, synonym detection with the use of the OpenThesaurus (plain text version).
 * sound-like methods: soundex, metaphone, NYSIIS algorithm, caverphone algorithm (version 2.0).
 * ranking methods a simple descriptor-based ranking algorithm.
 * text search algorithms: Knuth-Morris-Pratt.

Go To advas

PyLucence

PyLucene is a GCJ-compiled version of Java Lucene integrated with Python. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene.

Go To PyLucence

Whoosh

Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.

Go To Whoosh