Data Warehouses and Data Mining
Data Warehouse
Data warehouse was first coined by Bill Inmon in 1990. According to the Inmon, a data warehouse is a subject-oriented, integrated, time-variant, and non-volatile collection of data. This data helps analysts to make informed decisions in an organization.
- A data warehouse is a database that is kept separate from the organization's operational database.
- Possesses consolidated historical data, which helps the organization to analyze its business.
- Help in the integration of diversity of application systems. Why a data warehouse is separated from operational databases?
- An operational database is constructed for well-known tasks and workloads such as searching particular records. Indexing etc.
- An operational database query allows reading and modifying operations.
- An operational database maintains current data.
Features:
- Data warehouse is subject oriented because it provides information around a subject.
- Data is integrated from heterogeneous sources.
- Data collected in a data warehouse is identified with a particular time period.
- Data is non volatile.
- A data warehouse does not require transaction processing, recovery, and concurrency controls.