Title: Informatica CDQ | How to Create & Execute a Data Profile + Metrics Explained
1Day5-Informatica Cloud Data Quality(CDQ)
Agenda
- Secure agent Installation and configuration
- How to Start Secure agent
- Create connections
- Data profiling service
- Create Data profile and execute it
- Explain data profiling metrics
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
2Secure agent Installation and configuration
- Secure Agent - a lightweight program that
facilitates secure data integration between cloud
and on-premises environments, enabling secure
communication across firewalls - Login to Administrator Service in IDMC
- Download Secure agent
- Install Secure agent
- Configure secure agent
- Enable services and connectors
- Create Flat file connection and test
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
3How to Start Secure agent
- To start the Informatica Secure Agent in
Informatica Cloud, follow these steps - 1. Verify Prerequisites
- Ensure that the Secure Agent is installed on your
machine. - Check that your system meets the hardware and
software requirements. - 2. Starting the Secure Agent (Windows)
- Go to the Start Menu ? Informatica Secure Agent ?
Start Secure Agent. - Alternatively, press Win R, type services.msc,
and hit Enter. - Locate Informatica Secure Agent in the list of
services. - Right-click on it and select Start.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
4How to Start Secure agent
- 2. Starting the Secure Agent (Linux)
- Open a terminal window.
- Navigate to the Secure Agent directory
- bash
- cd /opt/Informatica/secureagent
- Start the agent with
- bash
- ./agent.sh start
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
5How to Start Secure agent
- 3. Check Agent Status
- WindowsOpen the Secure Agent application, and
the status will show as "Running". - Linux
- bash
- ./agent.sh status
- 4. Verify in Informatica Cloud
- Log in to Informatica Cloud.
- Go to Administrator ? Secure Agents.
- The agent should show as "Online".
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
6Create connections
- To create connections in Informatica Data
Management Cloud (IDMC) as an administrator,
follow these steps - ?? Steps to Create Connections in IDMC
- Log in to IDMC
- Go to Informatica Cloud and log in with your
admin credentials. - Navigate to the Administrator Console
- In the top navigation bar, click on
"Administrator". - Select "Connections" from the drop-down menu.
- Create a New Connection
- Click on "New Connection".
- Choose the connection type (e.g., Salesforce,
Oracle, SQL Server, FTP, etc.).
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
7Create connections
- Configure Connection Details
- Name Enter a descriptive name for the
connection. - Description Optional, but helpful for
documentation. - Connection Properties Fill in the required
details like - Host, Port, Username, Password (for databases)
- API Keys, Tokens, or OAuth Details (for cloud
apps) - Database Name, Schema, etc.
- Advanced Settings (Optional)
- Configure additional settings like SSL
certificates, timeout values, etc., if needed.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
8Create connections
- Test the Connection
- Click "Test Connection" to ensure everything is
set up correctly. - If successful, you'll see a confirmation message.
If not, check the error details to troubleshoot. - Save the Connection
- Click "Save" to finalize the connection.
- Assign to Secure Agent (if needed)
- If the connection requires on-premises data
access, ensure the connection is assigned to the
appropriate Secure Agent.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
9Data profiling service
- Data Profiling Service in Informatica Data
Management Cloud (IDMC) is a powerful feature
that helps you understand the structure, quality,
and content of your data. It provides insights
into data patterns, inconsistencies, and
anomalies, which are crucial for data governance,
data quality, and analytics. - ?? Key Features of Data Profiling Service
- Column Profiling
- Analyzes data at the column level to identify
data types, null values, unique counts, patterns,
and more. - Data Quality Metrics
- Measures data accuracy, completeness,
consistency, and validity. - Pattern Recognition
- Detects patterns like phone numbers, email
formats, ZIP codes, etc. - Anomaly Detection
- Identifies outliers or unusual data points.
- Data Relationship Discovery
- Discovers relationships between tables or
datasets.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
10Data profiling service
- ?? How to Use Data Profiling Service in IDMC
- Log in to IDMC.
- Go to "Data" ? "Data Profiling".
- Click "Create New Profile".
- Select the connection (e.g., database, flat file,
etc.) and the specific data objects to analyze. - Choose the profiling rules or configurations.
- Click "Run" to start profiling.
- Review the profiling reports for insights on data
quality, patterns, and anomalies.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
11Data profiling service
- Common Use Cases
- Data Quality Assessment Identifying missing
values, duplicates, or inconsistent data. - Data Migration Validation Ensuring accuracy
during system migrations. - Regulatory Compliance Verifying data integrity
for GDPR, HIPAA, etc. - ETL Optimization Understanding data
characteristics to improve data pipelines.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
12Create Data profile and execute it
- To create and execute a Data Profile in
Informatica Data Management Cloud (IDMC), follow
these steps. I'll guide you through the process
step-by-step. - ?? Step 1 Log In to IDMC
- Go to Informatica Cloud and log in with your
credentials. - ?? Step 2 Access Data Profiling
- Click on the "Data" tab in the top navigation
bar. - Select "Data Profiling" from the dropdown.
- ??? Step 3 Create a New Data Profile
- Click "New Profile" or "Create".
- Provide a name for the data profile (e.g.,
"Customer Data Profile"). - Add an optional description for future reference.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
13Create Data profile and execute it
- ?? Step 4 Select Data Source
- Choose the connection to your data source (e.g.,
SQL Server, Oracle, Flat File, etc.). - Select the specific table or file you want to
profile. - ?? Step 5 Configure Profiling Settings
- Choose the profiling rules you want to apply
- Column Profiling Analyzes data types, null
values, unique counts. - Data Quality Checks Looks for duplicates,
patterns, etc. - Relationship Discovery Identifies data
relationships. - You can customize the settings as needed.
- ?? Step 6 Run the Data Profile
- Click "Run" or "Execute" to start the profiling
process. - The system will analyze the data and generate
reports.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
14Create Data profile and execute it
- ?? Step 7 Review the Results
- Once the profile is complete, go to the "Results"
tab. - Review metrics like
- Data distribution
- Data quality scores
- Anomalies or outliers detected
- You can export the report if needed.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
15Explain data profiling metrics
- Data Profiling Metrics are key indicators that
help you assess the quality, structure, and
content of your data. They provide insights into
data consistency, completeness, uniqueness, and
more, enabling informed decisions for data
governance, analytics, and quality management. - Heres a breakdown of the common data profiling
metrics
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
16Explain data profiling metrics
- ?? 1. Column-Level Metrics
- These metrics provide insights into individual
columns within your dataset - Data Type Identifies the type of data (e.g.,
integer, string, date). - Distinct Count Number of unique values in the
column. - Null Count Number of null or missing values.
- Null Percentage Percentage of nulls compared to
the total number of rows. - Max/Min Values Identifies the highest and lowest
values. - Data Length Length of text data or number of
characters. - Data Patterns Detects patterns like email
formats, phone numbers, ZIP codes. - Value Distribution Shows the frequency
distribution of values.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
17Explain data profiling metrics
- ?? 2. Data Quality Metrics
- These metrics assess how well the data meets
business requirements - Completeness Measures if all required data
fields are populated. - Uniqueness Identifies duplicate records or
values. - Consistency Checks if data values are consistent
across different datasets. - Validity Ensures data adheres to defined
formats, rules, and standards. - Accuracy Compares data to a trusted source to
verify correctness.
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com
18Thank You !
References https//informatica.com
InventModel Technology Solution
91-98219-31210
support_at_inventmodel.com