What is Informatica Architecture?
Informatica Architecture is a comprised of multiple components or tools for ETL processing. The major ETL product release from the organization Informatica is known as the Informatica power center. It has four components to develop, configure, process and monitor the ETL flows or mappings. The components of Informatica architecture are Powercenter Designer, Workflow Manager, Workflow Monitor, Repository Manager. Apart from these data processing tools there are certain services associated with the Informatica architecture, the services know as Integration service, repository services, nodes and reporting services. The Node is the major component in which all the other services processed to extract, transform and load the data
The different editions of Informatica PowerCenter as available in the market are below
- Standard edition
- Advanced edition
- Premium edition
Below are few of the typical uses of Informatica:
- Any organization migrating from previously installed Legacy systems like mainframe to any new database. The data migration from the existing system to the current system can be performed.
- Operation setting up their own data warehouse will require an ETL tool for moving the data to the warehouse from the production system.
- Informatica supports the integration of data from different systems like multiple databases or multiple files which is extremely important while setting up a data warehouse.
- Informatica supports the cleansing of data as all the data from the production system not needed in a data warehouse.
- Among all the competitors offering ETL solutions, Informatica is the most efficient tool offering a wide range of editions for different products. This helps the client to choose any edition based on the specific requirements. Informatica has also been ranked among the topmost data integration product available in the market.
The Informatica Architecture
Informatica has a service-oriented architecture consisting of the below services and components.
- Repository service: The repository service is used to maintain the metadata along with providing access for the same to other services.
- Integration service: The integration service is used for the move of the data from source to target.
- Reporting service: The reporting service is used to facilitate the report generation.
- Nodes – Node is the computing platform on which the different services are executed.
- Informatica designer: The Informatica designer is used to create mappings showing the data flow between the source and target.
- Workflow manager: The Informatica workflow manager is used to create the workflow and other tasks and for executing the same.
- Workflow monitor: The Informatica workflow monitor is used for monitoring the execution of the workflows.
- Repository manager: The Informatica repository manager is used for managing the objects in the repository.
- Informatica Architecture is service-oriented architecture.
- The Informatica domain in the Informatica tool is the fundamental administrative unit.
- It consists of nodes and services which can further be divided into folders and subfolders based on the administration requirement and design architecture.
- Informatica administrator console web page creates a domain that can be created which looks like a folder. Under that folder, a node can be created along with the services.
- Inside the domain, a node acts as the logical representation of the machine. The different services and processes in the Informatica run in the node. There can be multiple nodes in a domain. A Gateway node also exists inside a domain. A Gateway node receives the request from different clients and routes them to respective services and nodes.
A domain consists of two types of services
Type 1: Service Manager which is responsible for managing operations like authorization, authentication, and log in. It is also responsible for running application services on different nodes along with managing users and groups.
Type 2: Application services consist of specific services like repository service, reporting service, and integration service. According to the configuration, the services run on respective nodes.
- The Informatica PowerCenter repository consists database which is maintained by repository service. The metadata is stored in database tables. The Informatica client tools that connect to repository are Informatica designer, Informatica workflow monitor, Informatica workflow manager. Repository service enables these clients to access the repository.
- Repository service is a service that Informatica offers for managing a repository. The repository service handles one request exclusively for one repository and could execute on multiple nodes for the increment of the performance. It also offers a version control mechanism with which we can maintain the different versions of the same object and avoid multiple users modifying the same object at the same time.
The domain is the most fundamental administrative control Informatica, which is the parent entity composed of different services like repository service, reporting service, integration service, and different nodes. The domain configuration is configured from the Informatica admin console which is a web page UI opened in a web browser. After logging in to the admin console, it shows different services like integration service, repository service, nodes, domain, etc. It also shows the status of the different services whether they are running or not.
Power Centre connectivity
- The different PowerCenter client tools like workflow manager, designer, repository manager, workflow manager are development tools that are installed on the client machine. The different mapping and objects that we create are saved in the Informatica server and therefore the clients must have the network connectivity with the server. Moreover, the clients connect to sources and targets for importing the structure definition along with metadata. Thus, it has to maintain connectivity with the source/ target.
- The PowerCenter uses TCP/IP to connect to integration service and repository service.
- PowerCenter uses ODBC drivers to connect between source and targets.
Repository service is responsible to maintain the connectivity between clients and the power Centre repository. It is a multithreaded process that keeps the metadata in the repository up to date.
Integration service is the service that is responsible for the execution of the tasks that are created in Informatica. It is a task inside the Informatica server that waits for a task to be assigned to it. As soon as a task is assigned to it, supposed to execute a workflow, the integration service fetches the details and starts the execution.
Conclusion – Informatica Architecture
Informatica is a service-oriented architecture consisting of components like application services, service Manager in the Informatica domain. It consists of application services like repository service, reporting service, and integration service. Thus, Informatica Architecture is formed on the accumulation of all these services.
This has been a guide to Informatica Architecture. Here we discuss what is Informatica? Informatica architecture, domain, and its two types. You can also go through our given articles to learn more-