EDUCBA

EDUCBA

MENUMENU
  • Free Tutorials
  • Free Courses
  • Certification Courses
  • 360+ Courses All in One Bundle
  • Login
Home Data Science Data Science Tutorials Data Warehouse Tutorial Data Warehouse Process
Secondary Sidebar
Data Warehouse Tutorial
  • Basic
    • What is Data Warehouse
    • Data Warehouse tools
    • Career in Data Warehousing
    • Benefits of Data Warehouse
    • Data Warehouse Architecture
    • Data Warehouse Design
    • Data Warehouse Implementation
    • Data Warehouse Features
    • Data Warehouse Modeling
    • Data Warehouse Software
    • Data Warehousing
    • Types of Data Warehouse
    • 10 Popular Data Warehouse Tools
    • Data Lake Architecture
    • Three Tier Data Warehouse Architecture
    • Data Warehouse Process
    • Database Parallelism
    • What is OLTP
    • What is OLAP
    • OLAP Tools
    • Types of OLAP
    • Operations in OLAP
    • MOLAP
    • HOLAP
    • Data Warehouse Schema
    • Data Warehouse Components
    • Snowflake Schema
    • Snowflake Architecture
    • What is Star Schema
    • Galaxy Schema
    • What is Fact Table
    • Kimball Methodology
    • Data Warehouse Testing
    • Operational Data Stores
  • ETL
    • What is Data Mart
    • What is Data Cube
    • What is a Data Lake
    • What is Data Integration
    • What is ETL
    • What is ETL Testing
    • ETL Testing Tools
    • ETL architecture
    • Dimension Table
    • Multidimensional Data Model
    • Fact Constellation Schema
    • ETL Process
  • Interview Questions
    • Data Warehouse Interview Questions
    • ETL Interview Questions
    • ETL Testing Interview Questions
    • Data Warehousing Interview Questions

Data Warehouse Process

By Priya PedamkarPriya Pedamkar

Data Warehouse Process

Introduction to Data Warehouse Process

Data Warehouse is a Data Compilation, Organization, and structural Management process which involves a series of activities performed over the given data. This process can be worked upon by a variety of data sources, which simply means that the data can be of heterogeneous nature. It can be defined as a method of altering the data collected from multiple sources into a similar structured form of readily usable facts/ figures, and for constructing it to be accessible to the business professionals for Analysis and Decision-making processes. A Data Warehouse also goes by the names ‘Decision Support System’, ‘Business Intelligence Solution’, ‘Analytic Application’, ‘Management Information System’, ‘Executive Information System’, etc.

Types of Data Warehouse Architecture

The Data Warehouse Architecture can be built based on two different process prototypes, such as the below:

Start Your Free Data Science Course

Hadoop, Data Science, Statistics & others

  • Centralized Architecture
  • Distributed Architecture

1. Centralized Architecture

As the name says, the Centralized Data Warehouse process Architecture is a solitary unit of a system that is dedicated to the Data Warehouse processing. It is the traditional method for constructing a Data Warehouse system, and this Architecture is the preferred model for organizations of small to medium size. The small to medium-sized organizations deal with a much lesser amount of data when compared to the larger-sized organizations.

This Architecture is simpler in structure, and so it has a lesser number of components involved. The main components in this model are the multiple data sources, the centralized Data Warehouse, and the client units which receive the processed data from the centralized Data Warehouse. This Data Warehouse can process a diverse range of data sources, which can contain any type or form of data in them, as it is a common property of any Data Warehouse system.

This model is considered to be an efficient type of Architecture for an organization with nominal storage space, lesser hardware devices, limited funding, fewer technical support professionals, etc. The complete Data warehouse process takes place in one physical location, which minimizes the communication delays that cannot be handled by a smaller or medium-sized organization.

The outcome from this type of Data Warehouse Architecture is used as a Business Intelligence source input. Business Intelligence process involves the Data Warehouse’s processed data as the input for creating Analytical Results, and Generation of Reports for the data fetched from the system. These results and reports will further be used by the Business Stack Holders for structuring the business flow and make meaningful decisions to run the business successfully.

2. Distributed Architecture

The Distributed Data Warehouse process Architecture consists of the same outline for the system implementation. The main difference for a Distributed Data Warehouse system against the Centralized Data Warehouse system is that the components of the warehouse are not located in the centralized form. Instead, it is in a distributed form, where the data sources can be in different locations or system units, the data processing can be carried out in a dispersed way, etc.

This type of Architecture can be applied to larger organizations as well, as the distributed nature can aid to handle a larger amount of data for Analysis and Report generation. This model can surmount the disadvantages of the centralized Data Warehouse process Architecture, and hence it is seen as an alternative option for the Centralized model.

In this type of Architecture, all the activities are assigned in different functional units. The distributed processing involves the activities like the data collection from heterogeneous data sources, processing of the collected data, organizing and placing the processed data into the data warehouse system, retrieving the information from the data warehouse, utilizing the results for analytical processing, and report creation, and finally employing the generated results for business decision making.

Any Distributed Data Warehouse Architecture can be managed into five different types of configuration, such as:

  • Client-Server Architecture
  • Three – Tier Architecture
  • N – Tier Architecture
  • Cluster Architecture
  • Peer – to – Peer Architecture
Client-Server Architecture

The Client-Server Architecture has two components, the client and the server, where the data collection, transformation, and loading is performed by the Client units, while the Server handles the data warehouse system development, processing the contents of the warehouse and the overall data management.

Three – Tier Architecture

The Three Tier Architecture contains the client as one tier, the server as one tier, and the rest of the connected systems into the third tier. The third tier unit can be used for enabling communication between the client system and the server system.

N – Tier Architecture

N – Tier is nothing but a multiple-tier Architecture, where the client-server architecture is connected with other intermediate units, such as the downstream applications, middleware structures, along with multiple client and server units.

Cluster Architecture

A Cluster system is where each node in the system is responsible for its own individual activity, and the nodes cannot function on their own without collaborating with other nodes. This allows the entities to be connected as a network and to process concurrently by utilizing the respective resources assigned to each node.

Peer – To – Peer Architecture

In the Peer – To – Peer Architecture, each node will be capable of achieving all the activities including the client, the server, the data processing, etc. The responsibilities can be shared amongst the nodes, hence each unit is called as the ‘Peer’ in the Data Warehouse system.

Advantages of Data Warehouse process

Below are the advantages of the Data Warehouse process Architecture,

  • A well-designed Data Warehouse system can led to a higher performance during the Decision –Making Process.
  • Highly efficient in terms of generating Business Intelligence solutions.
  • Controls the wastage of time due to the regulated data processing.
  • No Data Redundancy, enhanced Data Quality, Consistent and Reliable output Data.

Conclusion

The Data Warehouse process is an essential activity in Data Science Technology, as it plays a vital role in the Business Intelligence Model and the Business decision-making process. As the application of Data Warehouse is observed to be in business areas like Banking, Retail, Technology, Defence, etc, it is observed to be a growing technique for Data Management.

Recommended Articles

This is a guide to Data Warehouse Process. Here we also discuss the introduction and types of data warehouse architecture along with advantages. You may also have a look at the following articles to learn more –

  1. Data Warehouse Schema
  2. Data Warehouse Testing
  3. Benefits of Data Warehouse
  4. Data Warehouse vs Data Mart
Popular Course in this category
Free Data Science Course
4+ Hours | Lifetime Access | Verifiable Certificates
4.5
Price

View Course

Related Courses

Business Intelligence Training (12 Courses, 6+ Projects)4.9
All in One Data Science Bundle (360+ Courses, 50+ projects)4.8
Data Visualization Training (15 Courses, 5+ Projects)4.7
Primary Sidebar
Footer
About Us
  • Blog
  • Who is EDUCBA?
  • Sign Up
  • Live Classes
  • Corporate Training
  • Certificate from Top Institutions
  • Contact Us
  • Verifiable Certificate
  • Reviews
  • Terms and Conditions
  • Privacy Policy
  •  
Apps
  • iPhone & iPad
  • Android
Resources
  • Free Courses
  • Database Management
  • Machine Learning
  • All Tutorials
Certification Courses
  • All Courses
  • Data Science Course - All in One Bundle
  • Machine Learning Course
  • Hadoop Certification Training
  • Cloud Computing Training Course
  • R Programming Course
  • AWS Training Course
  • SAS Training Course

ISO 10004:2018 & ISO 9001:2015 Certified

© 2023 - EDUCBA. ALL RIGHTS RESERVED. THE CERTIFICATION NAMES ARE THE TRADEMARKS OF THEIR RESPECTIVE OWNERS.

EDUCBA
Free Data Science Course

Hadoop, Data Science, Statistics & others

By continuing above step, you agree to our Terms of Use and Privacy Policy.
*Please provide your correct email id. Login details for this Free course will be emailed to you
EDUCBA

*Please provide your correct email id. Login details for this Free course will be emailed to you

Let’s Get Started

By signing up, you agree to our Terms of Use and Privacy Policy.

EDUCBA

*Please provide your correct email id. Login details for this Free course will be emailed to you
EDUCBA

*Please provide your correct email id. Login details for this Free course will be emailed to you
EDUCBA Login

Forgot Password?

By signing up, you agree to our Terms of Use and Privacy Policy.

This website or its third-party tools use cookies, which are necessary to its functioning and required to achieve the purposes illustrated in the cookie policy. By closing this banner, scrolling this page, clicking a link or continuing to browse otherwise, you agree to our Privacy Policy

Loading . . .
Quiz
Question:

Answer:

Quiz Result
Total QuestionsCorrect AnswersWrong AnswersPercentage

Explore 1000+ varieties of Mock tests View more