1
Rakesh Agrawal, Soumen Chakrabarti, Byron Edward Dom, Prabhakar Raghavan: Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values. International Business Machines Corporation, Gates & Cooper, May 15, 2001: US06233575 (370 worldwide citation)

A system, process, and article of manufacture for organizing a large text database into a hierarchy of topics and for maintaining this organization as documents are added and deleted and as the topic hierarchy changes. Given sample documents belonging to various nodes in the topic hierarchy, the tok ...


2
Soumen Chakrabarti, Byron Edward Dom, David Andrew Gibson, Prabhakar Raghavan, Sridhar Rajagopalan, Shanmugasundaram Ravikumar, Andrew Tomkins: Method for interactively creating an information database including preferred information elements, such as preferred-authority, world wide web pages. International Business Machines Corporation, John L Rogitz, March 12, 2002: US06356899 (268 worldwide citation)

A method for identifying, filtering, ranking and cataloging information elements; as for example, World Wide Web pages, of the Internet in whole, part, or in combination. The method is preferably implemented in computer software and features steps for enabling a user to interactively create an infor ...


3
Soumen Chakrabarti, Byron Edward Dom, Martin Henk van den Berg: System and method for focussed web crawling. International Business Machines Corporation, John L Rogitz, July 9, 2002: US06418433 (261 worldwide citation)

A focussed Web crawler learns to recognize Web pages that are relevant to the interest of one or more users, from a set of examples provided by the users. It then explores the Web starting from the example set, using the statistics collected from the examples and other analysis on the link graph of ...


4
Soumen Chakrabarti, Byron Edward Dom, Piotr Indyk: Enhanced hypertext categorization using hyperlinks. International Business Machines Corporation, Altera Law Group, May 14, 2002: US06389436 (194 worldwide citation)

A method, apparatus, and article of manufacture for a computer implemented hypertext classifier. A new document containing citations to and from other documents is classified. Initially, documents within a neighborhood of the new document are identified. For each document and each class, an initial ...


5
Soumen Chakrabarti, Byron Edward Dom, David Andrew Gibson, Prabhakar Raghavan, Sridhar Rajagopalan, Shanmugasundaram Ravikumar, Andrew Tomkins: Method for cataloging, filtering, and relevance ranking frame-based hierarchical information structures. International Business Machines Corporation, John L Rogitz, December 25, 2001: US06334131 (166 worldwide citation)

A method for cataloging, filtering and ranking information, as for example, World Wide Web pages of the Internet. The method is preferably implemented in computer software and features steps for enabling a user to interactively create an information database including preferred information elements ...


6
Byron Edward Dom, Dragutin Petkovic: Video story board user interface for selective downloading and displaying of desired portions of remote-stored video data objects. International Business Machines Corporation, James C Pintner Esq, McGinn&Gibb P C, December 26, 2000: US06166735 (152 worldwide citation)

A system and method are provided for supporting video browsing over a communication network such as the Internet/World Wide Web. A graphical user interface is provided through a client software tool such as a Web browser. A client/user selects a video data object stored at a remote server. A set of ...


7
Soumen Chakrabarti, Byron Edward Dom: Feature diffusion across hyperlinks. International Business Machines Corporation, Gray Cary Ware Freidenrich, September 26, 2000: US06125361 (94 worldwide citation)

A system and method for ranking wide area computer network (e.g., Web) pages by popularity in response to a query. Further, using a query and the response thereto from a search engine, the system and method finds additional key words that might be good extended search terms, essentially generating a ...


8
Soumen Chakrabarti, Byron Edward Dom, Sunita Sarawagi: System and method for mining surprising temporal patterns. International Business Machines Corporation, Khanh Q Tran Esq, February 13, 2001: US06189005 (69 worldwide citation)

A system and method for data mining is provided in which temporal patterns of itemsets in transactions having unexpected support values are identified. A surprising temporal pattern is an itemset whose support changes over time. The method may use a minimum description length formulation to discover ...


9
Soumen Chakrabarti, Byron Edward Dom, David Andrew Gibson, Jon Michael Kleinberg, Prabhakar Raghavan, Sridhar Rajagopalan: Method and system for filtering of information entities. International Business Machines Corporation, John L Rogitz, February 7, 2006: US06996572 (49 worldwide citation)

A system and method are provided for eliciting interesting structure from a collection of entities or resources with explicit and/or implicit, static and/or dynamic relations, called “affinities,” between them. Interesting structure includes (1) notions of quality, authority, or definitiveness of in ...


10
Soumen Chakrabarti, Byron Edward Dom, David Andrew Gibson, Prabhakar Raghavan, Sridhar Rajagopalan, Shanmugasundaram Ravikumar, Andrew Tomkins: Method for interactively creating an information database including preferred information elements, such as, preferred-authority, world wide web pages. International Business Machines Corporation, John L Rogitz, January 1, 2002: US06336112 (46 worldwide citation)

A method for cataloging, filtering and ranking information, as for example, World Wide Web pages of the Internet. The method is preferably implemented in computer software and features steps for enabling a user to interactively create an information database including preferred information elements ...