06161130 is referenced by 726 patents and cites 9 patents.

A technique, specifically a method and apparatus that implements the method, which through a probabilistic classifier (370) and, for a given recipient, detects electronic mail (e-mail) messages, in an incoming message stream, which that recipient is likely to consider "junk". Specifically, the invention discriminates message content for that recipient, through a probabilistic classifier (e.g., a support vector machine) trained on prior content classifications. Through a resulting quantitative probability measure, i.e., an output confidence level, produced by the classifier for each message and subsequently compared against a predefined threshold, that message is classified as either, e.g., spam or legitimate mail, and, e.g., then stored in a corresponding folder (223, 227) for subsequent retrieval by and display to the recipient. Based on the probability measure, the message can alternatively be classified into one of a number of different folders, depicted in a pre-defined visually distinctive manner or simply discarded in its entirety.

Title
Technique which utilizes a probabilistic classifier to detect "junk" e-mail by automatically updating a training and re-training the classifier based on the updated training set
Application Number
9/102837
Publication Number
6161130
Application Date
June 23, 1998
Publication Date
December 12, 2000
Inventor
John C Platt
Bellevue
WA, US
Mehran Sahami
Stanford
CA, US
Susan T Dumais
Kirkland
WA, US
David E Heckerman
Bellevue
WA, US
Eric Horvitz
Kirkland
WA, US
Agent
Michaelson & Wallace
Agent
Peter L Michaelson
Assignee
Microsoft Corporation
WA, US
IPC
G06F 17/30
G06F 15/173
G06F 15/16
View Original Source