PowerCenter Partitioning Option
Delivering High Performance for Processing Massive Data Volumes
The Partitioning Option increases PowerCenter's performance through parallel data processing, and has been instrumental in establishing PowerCenter's industry performance leadership. This option provides a thread-based architecture and automatic data partitioning that optimizes parallel processing on multiprocessor and grid-based hardware environments.
Key Features
Data Smart Parallelism
- Automatically aligns PowerCenter partitions with database table partitions to improve performance
- Automatically guarantees data integrity by leveraging PowerCenter’s parallel engine that dynamically realigns data partitions for set-oriented transformations
Session Design Tools
- Creates user-defined partitioning schemes quickly and easily
- Provides a graphical partitioning map for determining the best partitioning points
- Gathers statistics on configurable session options, such as error handling, recovery strategy, memory allocation, and logging, to maximize performance
Integrated Monitoring Console
- Gathers session statistics, such as throughput, rows/second, error details, and performance optimizations, to identify potential bottlenecks and recognize trends
- Shows all session execution and dependency details
Multiple Partition Schemes
- Supports parallelization through multiple mechanisms including key range, hash algorithm-based, round robin, or file partitions
- Supports parallelization through concurrent processing of specified partitions along the data transformation pipeline to maximize data throughput
Benefits
Scales Cost-Effectively to Handle Large Data Volumes With the Partitioning Option you can execute optimal parallel sessions by dividing data processing into subsets that are run in parallel and spread among available CPUs in a multiprocessor system. When different processors share the computational load, large data volumes can be processed faster. When sourcing and targeting relational databases, the Partitioning Option enables PowerCenter to automatically align its partitions with database table partitions to improve performance. Unlike approaches that require manual data partitioning, data integrity is automatically guaranteed because PowerCenter’s parallel engine dynamically realigns data partitions for set-oriented transformations (e.g., aggregators or sorters).
Enhances Developer Productivity The Partitioning Option provides intuitive, GUI-based, session design tools that reduce the time spent on initial and ongoing configuration and performance tuning tasks. You can easily create user-defined partitioning schemes. A graphical partitioning map helps you determine the best points of partitioning. Configurable session options, such as error handling, recovery strategy, memory allocation, and logging, make it easier to gather statistics used to maximize performance.
Optimizes System Performance in Response to Changing Business Requirements The Partitioning Option lets you easily gather in-depth session statistics such as throughput, rows/second, error details, and performance optimizations. These statistics help you identify potential bottlenecks and recognize trends. An integrated monitoring console lets you view all session execution and dependency details. With PowerCenter’s metadata-driven architecture, data transformation logic is abstracted from the physical execution plan. Thus, rapid performance tuning is possible without compromising the logic and design of the original data mappings. You can continually and easily optimize system performance in the face of increasing data loads and changing business requirements.
 |
“... PowerCenter helps us achieve greater productivity, drive down costs, minimize operational risks and accelerate the time-to-market for new data integration-intensive enterprise applications.”
- Avram Kornberg Senior Vice President and CIO Natexis USA
read the full story
| |
Product Literature
Partitioning Option Datasheet PowerCenter Brochure
|
 |
|