HTML::Index::Store

HTML::Index::Store is a subclass'able module for storing inverted index files for the HTML::Index modules.
Download

HTML::Index::Store Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Perl Artistic License
  • Price:
  • FREE
  • Publisher Name:
  • Ave Wrigley
  • Publisher web site:
  • http://search.cpan.org/~awrigley/

HTML::Index::Store Tags


HTML::Index::Store Description

HTML::Index::Store is a subclass'able module for storing inverted index files for the HTML::Index modules. HTML::Index::Store is a subclass'able module for storing inverted index files for the HTML::Index modules.SYNOPSIS my $store = HTML::Index::Store->new( MODE => 'r', COMPRESS => 1, DB => $db, STOP_WORD_FILE => $path_to_stop_word_file, );The HTML::Index::Store module is generic interface to provide storage for the inverted indexes used by the HTML::Index modules. The reference implementation uses in memory storage, so is not suitable for persistent applications (where the search / index functionality is seperated).There are two subclasses of this module provided with this distribution; HTML::Index::Store::BerkeleyDB and HTML::Index::Store::DataDumperCONSTRUCTOR OPTIONSConstructor options allow the HTML::Index::Store to provide a token to identify the database that is being used (this might be a directory path of a Berkeley DB implementation, or a database descriptor for a DBI implementation). It also allows options to be set. Some of these options are then stored in an options table in the database, and are therefore "sticky" - so that the search interface can automatically use the same options setting used at creating time.DBDatabase identifier. Available to subclassed modules using the DB method call. Not sticky.MODEEither 'r' or 'rw' depending on whether the HTML::Index::Store module is created in read only or read/write mode. Not sticky.STOP_WORD_FILEThe path to a stopword file. If set, the same stopword file is available for both creation and searching of the index (i.e. sticky).COMPRESSIf true, use Compress::Zlib compression on the inverted index file. The same compression is used for searching and indexing (i.e. sticky).STEMAn option, if set, causes the indexer to use the Lingua::Stem module to stem words before they are indexed, and the searcher to use the same stemming on the search terms (i.e. sticky). Takes a locale as an argument.SOUNDEXAn option, if set, causes the searcher to use the Text::Soundex to expand a query term on search if an exact match isn't found. To work, this option needs to be set at indexing, so that entries for soundex terms can be added to the index (i.e. sticky). If this has been done, then a SOUNDEX option can be passed to the search function to ennable soundex matching for a particular query.VERBOSEAn option which causes the indexer / searcher to print out some debugging information to STDERR.NOPACKAn option which prevents the storer from packing data into binary format. Mainly used for debugging (sticky). Requirements: · Perl


HTML::Index::Store Related Software