Both industry research and real world experience show that 80% of the work in a Big Data project involves data integration and data quality. Informatica software includes the broadest set of data integration and data quality capabilities available on Hadoop, for fivefold productivity gains that transform more data into more accurate, insightful analysis in less time.

PowerCenter Big Data Edition

Informatica PowerCenter Big Data Edition delivers up to five times the productivity by allowing your developers to integrate almost any type of data at any scale — without having to learn Hadoop.

Learn More

Key Features

  • A visual development environment dramatically increases productivity by eliminating hand coding
  • Hundreds of high-speed connectors and pre-built transformations integrate all types of data
  • More than 100,000 trained Informatica developers worldwide simplify Big Data project staffing
  • Informatica's “Map Once, Deploy Anywhere” Vibe™ technology ensures that new data types and technologies don't slow you down

HParser

An easy-to-use codeless data parsing transformation environment, HParser is optimized to process any file format natively on Hadoop with scale and efficiency.

Learn More

Key Features

  • Pre-built parsers address a wide range of data sources, including logs, industry standards, documents, and binary or hierarchical data
  • A visual development environment for creating custom parsers eliminates cumbersome parsing logic development and testing

Data Quality Big Data Edition

Powered by Vibe virtual data machine, Informatica Data Quality Big Data Edition delivers authoritative and trustworthy data of any type and volume using pre-built data quality rules processed natively on Hadoop.

Learn More

Key Features

  • A visual development environment increases productivity up to five times over hand coding
  • Pre-built data quality and data matching rules cleanse and identify duplicate customer data
  • Vibe lets you design data quality rules once and deploy on both Hadoop and traditional platforms
  • Natural language processing allows entity extraction and data classification

Vibe Data Stream for Machine Data

Based on Informatica’s established high-performance messaging technology, Informatica Vibe Data Stream provides highly available, reliable, real-time streaming data collection for Big Data analytics, operational intelligence, and traditional enterprise data warehousing.

Learn More

Key Features

  • Real-time data collection works at high volume and high speed across a wide variety of data sources, over both local and wide area networks
  • Adaptable architecture enables one-to-one, one-to-many, many-to-one, and many-to-many connections
  • Vibe delivers directly to targets for either stream or batch processing

Data Masking

Informatica Data Masking delivers policy-based data security for applications running on Hadoop and other Big Data platforms, minimizing the risks of exposing sensitive data as it's stored and analyzed in Big Data projects and ensuring compliance with data privacy mandates and regulations.

Learn More

Key Features

  • Dynamic Data Masking policies require no customization or coding to protect data within Hadoop and other Big Data platforms
  • Existing access control policies govern data masking rules for sensitive data elements
  • Authorization policies limit unmasked data access to privileged users
  • Persistent Data Masking rules protect sensitive data to reduce risk of data breaches in nonproduction environments

MDM

Informatica MDM enriches master data with Big Data details like social insight, mobile geo-location information, and real-time transaction signals. The resulting multi-domain view of customers, products, and relationships improves operations and deepens customer understanding.

Learn More

Key Features

  • Flexible multi-domain data model adapts to your unique business requirements
  • Reusable business rules accelerate and streamline MDM, data integration, and data quality projects
  • Multi-domain approach expands beyond a single data domain, use case, or region to increase agility and accommodate both current and future business needs

Data Replication

Informatica Data Replication is highly scalable, reliable, database-agnostic transaction replication software that operates in real time without disrupting the performance of operational source systems. It gives your reporting and analytics systems continuous access to the freshest data by replicating entire schemas, subsets of schemas, or changed data at high speeds directly into Hadoop, traditional databases, and data warehouses.

Learn More

Key Features

  • Ability to aggregate data with Hadoop before loading into data warehouse
  • Relevant transactional data incorporates unstructured data for analytics
  • Replication technology is simple to use, easy to configure, and quick to deploy for faster time to value
  • A single noninvasive technology captures and delivers data across heterogeneous data stores for lower costs

Data Archive

Informatica Data Archive is highly scalable, full-featured smart partitioning and data archiving software. It archives inactive data and legacy applications in a compressed but readily accessible form, significantly improving performance and lowering risk while cost-effectively managing data growth in a range of enterprise business applications.

Learn More

Key Features

  • Smart partitioning to significantly improve application performance
  • Database and unstructured data archiving to improve maintenance efficiencies and lower costs
  • Fine-grain data restoration to production environments
  • Pre-built accelerators to speed implementations for popular packaged applications
  • Integrated development environment for custom and/or modified rules
  • Secure, highly compressed, immutable archive file to support compliance, retention, and access requirements