|
Review and how it works
DataparkSearch Engine was designed as a total open sources web-based search engine released under the GNU General Public License and developed to organize searchwithin a web site, group of web sites, intranet or local system.
DataparkSearch is made up of two parts. The first part is indexing mechanism (indexer). Indexer walks about html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search utilizing data collected by indexer.
Brief details Support for http, https, ftp, nntp and news URL schemes;
Htdb virtual URL scheme support for indexing SQL databases;
Text/html, text/xml, text/plain,audio/mpeg (MP3) and image/gif mime types included support;
External parsers support for other document types;
Comes with the power to index multilingual sites operating with content negotiation;
Searching all of the word forms operating with ispell affixes and dictionaries;
Fuzzy searching rooted in acronyms and abbreviations.
Stopwords and synonyms lists;
Boolean query language support;
Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank);
Multiple character sets support;
Accent insensitive search;
Phrases segmenting for Chinese, Japanese, Korean and Thai languages;
Mod_dpsearch - search module for Apache web server;
Internationalized Domain Names support;
The Summary Extraction Algorithm.
How to install & uninstall DataparkSearch - system requirements Nothing special to install, use or uninstall it.
|
|