Checklist For Best Practices in Powercenter

Hexaware Technologies Ltd
Best Practices in
Powercenter
A Checklist for Tuning Performance
Xitiz Pugalia
2014
Hexaware Technologies Ltd.,

Plot No. H5, SIPCOT IT PARK,
Navallur Post, Siruseri, Chennai 603103 (India)
Contents
Optimizing Flat File Targets........................................................................................................................... 2
Optimizing Flat File Sources .......................................................................................................................... 2
Optimizing the Source................................................................................................................................... 2
Optimizing Mappings Overview .................................................................................................................... 2
Optimizing Datatype Conversions................................................................................................................. 3
Optimizing Expressions ................................................................................................................................. 3
Optimizing Aggregator Transformations ...................................................................................................... 3
Optimizing Custom Transformations ............................................................................................................ 3
Optimizing Joiner Transformations............................................................................................................... 3
Optimizing Lookup Transformations............................................................................................................. 3
Optimizing Normalizer Transformation ........................................................................................................ 4
Optimizing Sequence Generator Transformations ....................................................................................... 4
Optimizing Sorter Transformations .............................................................................................................. 4
Optimizing Source Qualifier Transformations .............................................................................................. 4
Optimizing SQL Transformations .................................................................................................................. 4
Eliminating Transformation Errors................................................................................................................ 4
Optimize the Sessions ................................................................................................................................... 4
Optimize Grid Deployment ........................................................................................................................... 5
Optimizing the Powercenter Components ................................................................................................... 5
Optimize the Systems ................................................................................................................................... 5
Other Points to be Checked .......................................................................................................................... 5
Optimizing Flat File Targets

If you use a shared storage directory for flat file targets, ensure that the shared storage
directory is on a machine that is dedicated to storing and managing files, instead of performing other
tasks
Drop Indexes and Key Constraints before running the session
Use constraint-based loading only if necessary
Increasing Database Checkpoint Intervals
Using Bulk Loads for DB2, Sybase ASE, Oracle, or Microsoft SQL Server database
Using External Loaders for IBM DB2 EE or EEE, Oracle, Sybase IQ or Teradata
When bulk loading to Microsoft SQL Server or Oracle targets, define a large commit interval
Minimizing Deadlocks: increase the number of target connection groups the Integration
Service uses to write to the targets in a session
Increasing Database Network Packet Size
Optimizing Oracle Target Databases
Optimizing Flat File Sources

If each line in the source file is less than the default setting (1024 Bytes per line), you can
decrease the line sequential buffer length in the session properties
Configuring Single-Pass Reading: Consider using single-pass reading if you have multiple
sessions that use the same sources
Optimizing Filters: Use Filter Conditions as early in the mapping as possible
Optimizing the Source

Optimizing the Query: Single table select statements with an ORDER BY or GROUP BY clause
may benefit from optimization such as adding indexes
Using Conditional Filters: if multiple sessions read from the same source simultaneously, the
Powercenter conditional filter may improve performance
Increasing Database Network Packet Size in Oracle (listener.ora and tnsnames.ora)
Connecting to Oracle Database Sources: Use IPC protocol to connect to the Oracle database
Using Teradata FastExport (use FastExport reader instead of Relational reader)
In Sybase or Microsoft SQL Server Tables, use tempdb as an in-memory database to allocate
sufficient memory
Optimizing Mappings Overview

Reduce the number of transformations in the mapping
Delete unnecessary links between transformations
Use Expression Transformations to do the most amount of work possible
Optimizing Datatype Conversions

Use integer values in place of other datatypes when performing comparisons using Lookup
and Filter transformations
Convert the source dates to strings through port-to-port conversions to increase session
performance
Optimizing Expressions
When possible, isolate slow expressions and simplify them
Choose Numeric fields in place of strings as much as possible
Choose Decode instead of Lookup as much as possible
Use Operators instead of Functions
Optimizing Aggregator Transformations

Group by simple columns
Use sorted input
Use incremental aggregation
Filter data before you aggregate it
Limit port connections
Optimizing Custom Transformations

Decrease the number of function calls the Integration Service and procedure make
Increase the locality of memory access space for the data
Write procedure code to perform an algorithm on a block of data instead of each row of data
Optimizing Joiner Transformations

Designate the master source as the source with fewer duplicate key values
Designate the master source as the source with fewer rows
Perform joins in a database when possible
Join sorted data when possible
Optimizing Lookup Transformations

Use the optimal database driver
Cache lookup tables
Optimize the lookup condition
Filter lookup rows
Index the lookup table
Optimize multiple lookups
Create a pipeline Lookup transformation and configure partitions in the pipeline that builds
the lookup source
Optimizing Normalizer Transformation

Place the Normalizer Transformation as close to the target as possible
Optimizing Sequence Generator Transformations

Create a reusable Sequence Generator and using it in multiple mappings simultaneously
Consider configuring the Number of Cached Values to a value greater than 1,000
If you do not have to cache values, set the Number of Cache Values to 0
Optimizing Sorter Transformations

Allocate enough memory to sort the data.
Specify a different work directory for each partition in the Sorter transformation
Run the Powercenter Integration Service in ASCII mode
Optimizing Source Qualifier Transformations

Use the Select Distinct option for the Source Qualifier transformation if you want the
Integration Service to select unique values from a source
Optimizing SQL Transformations

SQL Transformations used in Query mode instead of Script Mode
Eliminating Transformation Errors

Set Error Threshold to an appropriate value
If there are multiple errors in a session, set tracing level to low (ex-Terse)
Check Session log to check Transformation Errors
Optimize the Sessions
Use a Grid to improve session / workflow performance

Use Pushdown Optimization
Run Concurrent Sessions and Workflows
Buffer Memory: Adjust DTM Buffer Size & Default Buffer Block Size
Caches: Analyze Transformation Caches & configure them accordingly
Target-Based Commit: Choose appropriately
Real-time Processing: See if Source Based Commit can be used
Remove Staging Areas as much as possible
Generate less or binary Log Files
Choose Appropriate Error Tracing

Post-Session Emails: Set flat file Logging for Post Session Emails
Optimize Grid Deployment

Add nodes to the grid
Increase storage capacity and bandwidth
Use shared file systems
Use a high-throughput network
Optimizing the Powercenter Components
Optimizing Powercenter Repository Performance:

Ensure the Powercenter repository is on the same machine as the Repository Service process
Order conditions in object queries
Use a single-node tablespace for the Powercenter repository if you install it on a DB2 db
Optimize the database schema for the Powercenter repository if you install it on a DB2 or
Microsoft SQL Server database
Optimizing Integration Service Performance:

Use native drivers instead of ODBC drivers for the Integration Service
Run Int Service in ASCII data movement mode if character data is 7-bit ASCII or EBCDIC
Cache Powercenter metadata for the Repository Service
Run Integration Service with high availability
Optimize the Systems

Improve network speed
Use multiple CPUs
Reduce paging
Use processor binding
Use Local Disk if possible: A local disk can move data 5 to 20 times faster than a network
Minimize the number of network hops between the source and target databases and the
Integration
Move Target to a Server System if possible
Ask Network Engineer to provide enough Bandwidth
Use Multiple CPUs
Reduce Paging
Use Processor Binding
Other Points to be Checked
Check if the mapping has multiple targets. If yes, could each target be loaded in parallel?
Check if it is possible to break up the Mapping logic?
Can the Mapping logic be placed in to Mapplet?
Are the target tables having constraints?
Does a parent need to be loaded first? Check for PK FK Relationship
Naming convention for mapping , workflow, session, transformations
SCD type 1 / 2 followed
Check if DTM buffer size is auto or manually set
Use Decode instead of nested IIF functions
Use of Variables or Parameters ($Source / $ DBConnection_Src) for Connection of Source /

Target Tables
Use of Unconnected Lookups wherever only 1 value is to be returned by Lookup Table
Using Dynamic Lookups for SCD implementations
Use of Shortcuts in case of multiple folders
Re-usable transformations
Memory Utilization - Source Bottleneck (can be checked only by execution of mapping)
Use of Re-usable sessions
Memory Utilization - Target Bottleneck (can be checked only by execution of mapping)
Memory Utilization - Transformation Bottleneck (can be checked only by execution of mapping)
Different Commit Type & Commit Interval (from Session Properties, default is Target Type ,
10000)
Column Precision Too high / Implicit Data type Conversion

Checklist For Best Practices in Powercenter

Diunggah oleh

Informasi Dokumen

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Checklist For Best Practices in Powercenter

Diunggah oleh

Hak Cipta:

Format Tersedia

Hexaware Technologies Ltd

Hexaware Technologies Ltd.,

Optimizing Flat File Targets

Optimizing Flat File Sources

Optimizing the Source

Optimizing Mappings Overview

Optimizing Datatype Conversions

Optimizing Aggregator Transformations

Optimizing Custom Transformations

Optimizing Joiner Transformations

Optimizing Lookup Transformations

Optimizing Normalizer Transformation

Optimizing Sequence Generator Transformations

Optimizing Sorter Transformations

Optimizing Source Qualifier Transformations

Optimizing SQL Transformations

Eliminating Transformation Errors

Optimize the Sessions

Use a Grid to improve session / workflow performance

Choose Appropriate Error Tracing

Optimize Grid Deployment

Optimizing the Powercenter Components

Optimizing Powercenter Repository Performance:

Optimizing Integration Service Performance:

Optimize the Systems

Other Points to be Checked

Check if it is possible to break up the Mapping logic?

Can the Mapping logic be placed in to Mapplet?

Are the target tables having constraints?

Does a parent need to be loaded first? Check for PK FK Relationship

Naming convention for mapping , workflow, session, transformations

SCD type 1 / 2 followed

Check if DTM buffer size is auto or manually set

Use Decode instead of nested IIF functions

Use of Variables or Parameters ($Source / $ DBConnection_Src) for Connection of Source /

Use of Unconnected Lookups wherever only 1 value is to be returned by Lookup Table

Using Dynamic Lookups for SCD implementations

Use of Shortcuts in case of multiple folders

Memory Utilization - Source Bottleneck (can be checked only by execution of mapping)

Use of Re-usable sessions

Memory Utilization - Target Bottleneck (can be checked only by execution of mapping)

Memory Utilization - Transformation Bottleneck (can be checked only by execution of mapping)

Column Precision Too high / Implicit Data type Conversion

Anda mungkin juga menyukai