PHP Classes

Fuzzy Index: Index text for performing fuzzy search

Recommend this page to a friend!
  Info   View files View files (4)   DownloadInstall with Composer Download .zip   Reputation   Support forum (2)   Blog    
Ratings Unique User Downloads Download Rankings
Not enough user ratingsTotal: 742 This week: 1All time: 4,517 This week: 571Up
Version License PHP version Categories
fuzzy-index 1.0GNU Lesser Genera...5.3PHP 5, Databases, Searching, Text pro...
Description 

Author

This class can index text for performing fuzzy search.

It can process a list of text strings and build a database that indexes snippets of those strings and the locations where they appear.

The class can also search for given keywords and returns the locations of the indexed strings where the best matching text appears.

It uses SQLite to store the indexed text database, but the class can be extended to use a different database type.

It uses certain heuristics to extract the snippets from the indexed text. These heuristics are implemented as separate classes that can be used interchangeably.

Innovation Award
PHP Programming Innovation award winner
June 2012
Winner


Prize: One copy of the Zend Studio
Searching for text in a large documents is not a trivial text.

To make it useful it needs to be fast and take in account that search words may be misspelled and they may not appear contiguously in the document being searched.

This class addresses the challenges of searching large text documents. It builds a database that indexes the documents in a way that is fast to search and locate the text snippets that contain the words that the user is looking for.

Manuel Lemos
Picture of Philipp Strazny
  Performance   Level  
Name: Philipp Strazny is available for providing paid consulting. Contact Philipp Strazny .
Classes: 5 packages by
Country: United States United States
Age: 57
All time rank: 1380201 in United States United States
Week rank: 420 Up49 in United States United States Up
Innovation award
Innovation award
Nominee: 3x

Winner: 1x

  Files folder image Files  
File Role Description
Plain text file FuzzyIndex.php Class FuzzyIndex class and utility classes
Accessible without login Plain text file FuzzyIndexTest.php Test tests for FuzzyIndex and heuristics
Accessible without login HTML file fuzzyindex_readme.html Doc. explanation
Accessible without login Plain text file demo_multilingual.php Example usage example

 Version Control Unique User Downloads Download Rankings  
 0%
Total:742
This week:1
All time:4,517
This week:571Up