The Java program for performing full-text searches is called
and is located in the root directory of this DVD. To launch the search engine, simply change
to the DVD directory and type
java -jar DocSearch.jar.
This DVD comes with the DocSearcher search engine, which was developed by John Brown.
The tool was released under the GPL and uses the Lucene and POI Apache Open Source libraries,
as well as PDF box. DocSearcher requires a Java 2 runtime (version 1.5 or higher). You
java -version to find out which version you have installed
(assuming java is in your search path).
For your convenience, we also include a pair of scripts for launching DocSearcher in
Linux or Windows. To start the search engine from the desktop, navigate to the top directory
of the DVD and launch
search.sh (for Linux) or
search.bat (for Windows).
DocSearcher is a search tool. It uses the Open Source Lucene and POI Apache APIs as well as the Open Source PDF Box API to provide searching capabilities for HTML, MS Word, MS Excel, RTF, PDF, OpenOffice (and Star Office) Documents, and text documents. Other file formats are currently not supported.
To perform a search you simply type in the keywords or phrases you are looking for
and click the search button.
The search results will be presented in a Table. You can click on files as you would in a standard web browser.
To get back to your search results you can click on the Search Results Icon ().
There are 4 types of searches you can perform with DocSearcher: Keyword, Phrase, Boolean, and Wild Card. The example queries below describe each of these types of searches:
|Examples Searches||What They Do|
-with the "keywords" radio button selected
|Finds documents with the keywords "summer" and "vacation"
This type of search is called a keyword search.
-with the "phrase" radio button selected
|Finds documents with the phrase "summer vacation"
This type of search is called a phrase search.
|"john smith" -"john brown"||Finds documents with the "john smith" but excludes document with "john brown"
This type of search is called a boolean search.
|virtu*||Finds documents with words that start with virtu... For example virtuous, virtue, virtuosity, etc...
This type of search is called a wild card search.