Comparison of Excalibur and PDF WebSearch Boolean Commands
Query: information retrieval
Excalibur: finds documents containing 'information' or 'retrieval'
PDF WebSearch: finds the phrase "information retrieval"
PDF WebSearch processes a set of words without operators as a phrase and will search for the phrase, whereas Excalibur assumes an "or" relationship between the words. The number of search hits in PDF WebSearch is equal to the number of times the complete phrase occurs in the document.
PDF WebSearch can be switched from boolean to natural language queries and will interpret a set of words as having an "or" relationship when the Natural Language option is checked.
Query: information or retrieval
Excalibur: finds documents containing 'information' or 'retrieval'
PDF WebSearch: finds documents containing 'information' or 'retrieval'
Both are the same.
Query: information and retrieval
Excalibur: finds documents containing both 'information' and 'retrieval'
PDF WebSearch: finds docs containing both 'information' and 'retrieval'
Both are the same.
Query: information not retrieval
Excalibur: finds documents containing 'information' but not 'retrieval'
PDF WebSearch: this is not a valid boolean syntax in PDF WebSearch. It will be treated as a phrase. The valid argument is:
information and not retrieval
The word "not" can be used at the beginning of a phrase without following the word "and". The valid argument is:
not retrieval
The word "not" can also be used with "within". The boolean expression for "within" is "w/9". For example, the expression
information and not w/3 retrieval
will return documents containing the word "information" but not if the word is within 3 words of the word "retrieval"
Query: (information not retrieval) and WAIS
Excalibur: finds documents containing 'WAIS', plus 'information' but not 'retrieval'
PDF WebSearch: the same, but expressed as:
(information and not retrieval) and WAIS
Query: web*
Excalibur: finds documents containing words starting with 'web'
PDF WebSearch: the same
PDF WebSearch: the same.
PDF WebSearch supports a complete set of special symbols, as follows:
?
|
Matches any single character. Example: dis? matches dish or disk
|
*
|
Matches any number of characters. Example: dis* matches dishes
|
~
|
Stemming. Example: apply~ matches apply, applies, applied.
|
%
|
Fuzzy search. Example: ba%nana matches banana, bananna.
|
#
|
Phonic search. Example: #smith matches smith, smythe.
|
&
|
Synonym search. Example: fast& matches quick.
|
~~
|
Numeric range. Example: 12~~24 matches 18.
|
:
|
Variable term weighting. Example: Treasure:4 w/5 Island:1
|
|