06208990 is referenced by 132 patents and cites 8 patents.

A computer software architecture to automatically optimize the throughput of the data extraction/transformation/loading (ETL) process in data warehousing applications. This architecture has a componentized aspect and a pipeline-based aspect. The componentized aspect refers to the fact that every transformation used in this architecture is built up with transformation components selected from an extensible set of transformation components. Besides simplifying source code maintenance and adjustment for the data warehouse users, these transformation components also provide these users the building blocks to effectively construct pertinent and functionally sophisticated transformations in a pipelined manner. Within a pipeline, each transformation component automatically stages or streams its data to optimize ETL throughput. Furthermore, each transformation either pushes data to another transformation component, pulls data from another transformation component, or performs a push/pull operation on the data. Thereby, the pipelining; staging/streaming; and pushing/pulling features of the transformation components effectively optimizes the throughput of the ETL process.

Title
Method and architecture for automated optimization of ETL throughput in data warehousing applications
Application Number
9/116426
Publication Number
6208990 (B1)
Application Date
July 15, 1998
Publication Date
March 27, 2001
Inventor
Mohan Sankaran
Union City
CA, US
Frank Joseph DeRose
Fremont
CA, US
Girish Pancha
San Francisco
CA, US
Jyotindra Pramathnath Gautam
Fremont
CA, US
Sankaran Suresh
Santa Clara
CA, US
Agent
Wagner Murabito & Hao
US
Assignee
Informatica Corporation
CA, US
IPC
G06F 17/30
View Original Source