Explain Data Warehouse Architecture.

 Data Warehouse Architecture

Data warehouses often adopt a three-tier architecture, as presented in Figure below:



1. Data Sources: A data warehouse system uses heterogeneous sources of data either from operational databases or from some external sources.

2. Bottom Tier: The bottom tier of the architecture is the data warehouse database server. It is the relational database system. Data is fed into the bottom tier by some back-end tools and utilities. The back end tools and utilities perform the following functions:

  • Data extraction: gathers data from multiple, heterogeneous and external sources.
  • Data cleaning: Detect errors in data and correct them when possible.
  • Data transformation: converts data from legacy or host format to warehouse format.
  • Load: which sorts, summarizes, checks integrity, and builds indices and partitions.
  • Refresh: This involves updating data sources to the warehouse.


3. Middle Tier: Middle tier is an OLAP server that can be implemented using either the relational OLAP (ROLAP) model or the multidimensional OLAP (MOLAP) model.

  • ROLAP is an extended relational database management system. The ROLAP maps the operations on multidimensional data to standard relational operations.
  • MOLAP directly implements multidimensional data and operations.


4. Top Tier: The top tier is a front-end client layer. The top tier layer holds the following tools:

  • Query and Reporting tools: Production reporting tool.
  • Analysis tools: Prepare charts based on analysis.
  • Data mining tools: Discover hidden knowledge, patterns.

Comments

Popular posts from this blog

What are different steps used in JDBC? Write down a small program showing all steps.

Discuss classification or taxonomy of virtualization at different levels.

Pure Versus Partial EC