06360215 is referenced by 346 patents.

A method and apparatus are provided for retrieving documents from a collection of documents based on information other than the contents of a desired document. The collection of documents, which may be a hypertext system or documents available via the World Wide Web, is indexed. In one embodiment, an indexing process of a search engine receives one or more specifications that identify documents, or document locations, and non-content information such as a tag word or code word. The indexing process searches the index to identify all documents in the index that match one or more of the specifications. If a match is found, the tag word is added to the index, and information about the matching document is stored in the index in association with the tag word. A search query is submitted to the search engine. The search query is automatically modified to add a reference to the tag word, such as a query term that will exclude any index entry for a document associated with the tag word. The search is executed against the index, and a set of search results is generated. Accordingly, the search results automatically exclude all documents associated with the tag word. These techniques may be used, for example, to implement a Web search service that produces more accurate search results or that prevents certain documents, such as pornographic materials, from appearing in search results.

Title
Method and apparatus for retrieving documents based on information other than document content
Application Number
9/186058
Publication Number
6360215 (B1)
Application Date
November 3, 1998
Publication Date
March 19, 2002
Inventor
J Eric Baldeschwieler
San Francisco
CA, US
Paul Gauthier
San Mateo
CA, US
Douglass R Judd
San Jose
CA, US
Agent
Hickman Palermo Truong & Becker
US
Agent
Edward A Becker
US
Assignee
Inktomi Corporation
CA, US
IPC
G06F 7/00
View Original Source