skip to page content
Office of Administrative Law Judges

Advanced Search Options

Search for this:
 Order by
Without this:
SEARCH HELP
Using logic operators
Phrase Matching
Wild Card Searches
Using the Thesaurus
Regular Expressions
Quantity Searching
Using Word Forms
Controlling Proximity
PROXIMITY
line
sentence
paragraph
page
WORD FORMS
Exact match
Plural & posessives
Any word forms
Custom
RANKING FACTORS
Rank Factor Importance
Word ordering
Word proximity
Database Frequency
Document Frequency
Position in text
Depth in site


Applying Search Logic

Texis and Metamorph use set logic for text queries. Set logic is easier to use and provides more abilities than boolean. The examples below make reference to single keywords, but keep in mind that each keyword can represent an entire list of things or any of the special pattern matchers.

Sets (or lists) of things are specified by placing the elements within parenthesis, separated by commas. Example: (bob,joe,sam,sue) . In the examples below, you could replace any of the keywords with a list like this.

The default behavior of the search is to locate an intersection (or 'AND') of every element within a query. This means that the query: "microsoft bob interface" is the equivalent to the boolean query: "microsoft AND bob AND interface" .

- (without)
The - (minus) is the most commonly used logic symbol. It means the answer should EXCLUDE references to that item.
+ (mandatory)
The + (plus) symbol in front of a search item means that the answer MUST INCLUDE that item. This is generally used in conjunction with the permutation operation.
@N (permute)
The @ followed by a number indicates how many intersections to locate of the terms in your query. This may be confusing at first, but it is very powerful.

 

Query Finds
bob sam joe Bob with Sam and Joe
bob sam -joe Bob with Sam without Joe
bob sam joe @1 Bob with Sam, or Bob with Joe, or Joe with Sam
A B C D @1 AB or AC or AD or BC or BD or CD
+A B C D @1 ABC or ABD or ACD
A B C -D @1 ( AB or AC or BC ) without D

Table 6.2: Search Logic Examples

The plus(+) and minus(-) operators must be attached to the term to which they apply. There must be a space between the operator and any preceding term.

Correct Incorrect
bob +sam -joe bob + sam - joe
bob+sam-joe

Go Back to Top of Page


Keywords Phrases and Wild-cards

To locate words, just type them in as you would in a word processor. Letter cases will be ignored.

The wild-card character * (asterisk) may be used to match just the prefix of a word or to ignore the middle of something.

If the item you wish to locate is more complicated than the simple * wild-card can accomplish, try using the regular expression matcher (http://www.thunderstone.com/texis/site/pages/regexp.html).

To locate a number of adjacent words in a specific order, surround them with " (double quotation) characters. Putting a - (hyphen) between words will also force order and one word proximity.

 

Query Locates
john john, John
"john public" John Public
web-browser Web browser, web-browser
John*Public John Q. Public, John Public
456*a*def 1-456-789-ABCDEF
activate activate, activation, activated, ... *

Go Back to Top of Page


Invoking Thesaurus Expansion

The Search Appliance has a vocabulary of over 250,000 word and phrase associations. Each entry is generally classifiable by either its meaning or part of speech.

Depending on the administrator's Synonyms setting for this profile, synonyms may already be included for each term in your query. If not, synonyms may be included for individual terms within your query by preceding them with a ~ (tilde) character.

Go Back to Top of Page


Using the Special Pattern Matchers

These pattern matchers are used to locate hard-to-find items within text:

If improperly used these pattern matchers can slow queries. Therefore they require other keyword(s) in the query and are disabled entirely under Page proximity. For more details see the Vortex manual on Query Protection (http://www.thunderstone.com/site/vortexman/link_qprot.html).

Go Back to Top of Page


Using Word Forms

  The Word forms options give you control over how many variations of your query terms will be sought in your search.

Exact match:
Only exact matches will be allowed. (the default)
Plurals & possessives:
Plural and possessive forms will be found. (s, es, 's)
Any word forms:
As many word forms as can be derived will be located.
Custom:
Uses the Custom Suffix List, Custom Suffix Default Removal, and Custom Suffix Min Length settings to create your own custom behavior.

 

Word president
EXACT president
PLURAL (above) + presidents president's
ANY (above) + presidential presidency preside presides presiding presided
Word tight
EXACT tight
PLURAL (above) + tights
ANY (above) + tightly tightening tightened tighter tightest
Word program
EXACT program
PLURAL (above) + programs program's
ANY (above) + programming programmatic programmed programmer programmable

Table 6.4: Word Form Examples

We call this morpheme processing, and it is generally smarter than a traditional "stemming" algorithm. It doesn't just rip the end off a word, it actually checks to see if it could be a valid form of the search term. More information is available at http://www.thunderstone.com/site/texisman/link_ling.html .

Notes: Thesaurus terms are also treated in the same manner. Words smaller than 4-5 characters will not be morpheme processed.

Go Back to Top of Page


Controlling Proximity

These options give you control over the region in which a match must be found.

line:
match terms must be located within the same line.
sentence:
all terms within the same sentence.
paragraph:
match terms must be located within the same paragraph.
page:
(default) all terms within the same document.

In all cases the best possible matches for your query are located and ordered by decreasing quality. A bar graph is produced to indicate the quality of each answer.

Go Back to Top of Page