products

Data Profiling Option

Assessing the Initial and Ongoing Quality of All Enterprise Data

The Data Profiling Option extends PowerCenter's ability to provide comprehensive, accurate information about the content, quality, and structure of data in virtually any operational system. Fully integrated into PowerCenter, these data profiling capabilities automatically leverage the platform's unified development environment, performance, metadata capabilities, and universal data access. With this option, developers can automatically assess the initial and ongoing structure and quality of data regardless of its location or type.

Key Features

Data Profiling

  • Reduces the time it takes for developers to assess the structure, content, and quality of data by providing a single interface for the entire profiling process
  • Instills confidence in data by automatically and accurately profiling any data accessible to PowerCenter
  • Provides the choice of scanning every single data record or using algorithmic sampling of source systems to deliver accurate assessments
  • Enables developers to create custom profiling rules which can be used to validate data, as well as ensure that data is transformed to meet the rule condition on an ongoing basis
  • Displays data profiling results immediately through an interactive profiling mode and provides ongoing data profiling and quality metrics through a batch mode

Full Integration Across Entire Data Integration Platform

  • Leverages PowerCenter’s performance, scalability, and comprehensive data access
  • Automatically harnesses all of PowerCenter’s sophisticated parallelization and grid capabilities
  • Empowers developers to extend profiling capabilities by using PowerCenter mapplets to support more complex scenarios

Benefits

Reduce Initial and Ongoing Data Quality Assessment Time
With this option, developers can streamline data profiling efforts by using a wizard-driven, automated interface for conducting the initial profiling assessment, including data content, structure and quality. Users can easily re-run the resulting data profile mappings to continually measure quality improvement, and analyze changes to source data over time. IT departments can be more productive since they use the same user interface for the entire process.

Lower Development Costs by Automating Data Discovery
PowerCenter is unique because powerful data profiling capabilities are embedded within the same environment used for developing data integration solutions, reducing developer ramp-up time as well as reducing development costs.  By automating the process of data discovery, IT teams can avoid the labor-intensive, manual work of creating one-off queries or scripts to assess source data structure or quality.

Accelerate Time to Delivery by Reducing Risk and Rework in Data Integration Projects
Organizations can automatically profile any data accessible to PowerCenter, eliminating upfront assumptions and guesswork by building detailed statistics on the state of the actual data. Data integration projects can be delivered to specification and on time because developers can scope out potential data quality issues faster and with much greater accuracy than with the traditional method of hand-coded queries.

Product Literature

Data Profiling Option Datasheet
PowerCenter Brochure
Data Quality White Paper