Informatica PowerCenter Big Data Edition is highly scalable, high-performance enterprise data integration software. Its visual development environment lets developers build ETL data flows that run natively on Hadoop — without having to learn Hadoop.
Up to 5x Productivity
Increase developer productivity on Hadoop up to five times over hand-coding through the visual Informatica development environment. Easily reuse data flows and collaborate with other developers and analysts with a common integrated development environment (IDE).
Universal Data Access
Your IT team can access all types of big transaction data, including RDBMS, OLTP, OLAP, ERP, CRM, mainframe, cloud, and others. You can also access all types of big interaction data, including social media data, log files, machine sensor data, Web sites, blogs, documents, emails, and other unstructured or multi-structured data.
High-Speed Data Ingestion and Extraction
You can access, load, transform, and extract Big Data between source and target systems or directly into Hadoop, HBase, or your data warehouse. High-performance connectivity through native APIs to source and target systems with parallel processing ensures high-speed data ingestion and extraction.
Your IT organization can process all types of data at any scale—from terabytes to petabytes—with no specialized coding on distributed computing platforms like Hadoop.
Optimized Performance for Lowest Cost
Based on data volumes, data type, latency requirements, and available hardware, PowerCenter Big Data Edition deploys Big Data processing on the highest performance and most cost-effective data processing platforms. You get the most out of your current investments and capacity whether you deploy data processing on SMP machines, traditional grid clusters, distributed computing platforms like Hadoop, or data warehouse appliances.
ETL on Hadoop
This edition provides an extensive library of prebuilt transformation capabilities on Hadoop, including data type conversions and string manipulations, high-performance cache-enabled lookups, filters, joiners, sorters, routers, aggregations, and many more. Your IT team can rapidly develop data flows on Hadoop using a codeless graphical development environment to increase productivity and promote reuse.
Profiling on Hadoop
Data on Hadoop can be profiled through the Informatica developer tool and a browser-based analyst tool. This makes it easy for developers, analysts, and data scientists to understand the data, identify data quality issues earlier, collaborate on data flow specifications, and validate mapping transformation and rules logic.
Design Once and Deploy Anywhere
ETL developers can focus on data and transformation logic without having to worry where the ETL process is deployed—on Hadoop or traditional data processing platforms. With Informatica Vibe, a virtual data machine, developers can design once, without any specialized knowledge of Hadoop concepts and languages, and easily deploy data flows on Hadoop or traditional systems.
Complex Data Parsing on Hadoop
This edition makes it easy to access and parse complex, multi-structured, unstructured, and industry standard data such as Web logs, JSON, XML, and machine device data. Optional parsers are offered for market data and industry standards like FIX, SWIFT, ACORD, HL7, HIPAA, and EDI.
Entity Extraction and Data Classification on Hadoop
Using natural language processing (NLP) and a list of keywords or phrases, entities related to your customers and products can be easily extracted and classified from unstructured data such as emails, social media data, and documents. You can enrich master data with insights into customer behavior or product information such as competitive pricing.
Your IT team can easily coordinate, schedule, monitor, and manage all interrelated processes and workflows across traditional and Hadoop environments to simplify operations and enable drilling down into individual Hadoop jobs while meeting SLAs.
This edition provides 24x7 high availability with seamless failover, flexible recovery, and connection resilience. When it comes time to develop new products and services using Big Data insights, you can rest assured that they are scalable and available 24x7 for mission-critical operations.
Key Benefits of Informatica PowerCenter Big Data Edition
Helps you bring innovative products and services to market faster and improve business operations
Lowers Big Data project costs while handling growing data volumes and complexity
Expands Hadoop adoption across the enterprise to realize performance and cost benefits
Represents proven data integration software, so the risk of adopting new technologies is minimized