Big Data Edition

Safely, efficiently integrate any type of data on Hadoop without ever having to learn Hadoop.

Free trials

Key Features

Faster to value, faster to staff, faster to integrate, faster to trust, faster to innovate, faster to deploy.

Universal data access

Access all types of data including transactions, applications, databases, log files, social, machine, and sensor data

High-speed data ingestion and extraction

Move data between source systems, Hadoop, and target applications using high-performance connectivity

Real-time data collection and streaming

Collect log files and machine and sensor data in real time and reliably stream data at scale directly into Hadoop

Data integration on Hadoop

Access an extensive library of prebuilt transformation capabilities on Hadoop via a visual development environment

Data profiling on Hadoop

Profile data on Hadoop to understand the data, identify data quality issues, and collaborate on data pipelines

Data discovery on Hadoop

Automate the discovery of data domains and relationships on Hadoop such as sensitive data that needs to be protected

Data quality on Hadoop

Scrub, standardize, and enrich data on Hadoop with an extensive set of data quality rules including address validation

Natural language processing on Hadoop

Use natural language processing to identify and classify entities in social media and text files

Complex data parsing on Hadoop

Parse complex, multi-structured, unstructured, and industry standard data on Hadoop using pre-built parsers, or easily create your own

End-to-end data lineage

Provides complete transparency with end-to-end data lineage of all data movement from source data, through Hadoop, to target applications

Design once and deploy faster

Preserves transformation logic so you can build data pipelines once and speed deployments as Hadoop continues to change

2014 Gartner Magic Quadrant for Data Integration Tools

Download the report

2014 Gartner Magic Quadrant for Data Quality Tools

Download the report

Editions of Big Data Edition

Feature List

Standard

Governance

Data Integration Transforms on Hadoop

Data Quality Transforms on Hadoop

 

Data Profiling on Hadoop

Column, Rule, Join Validation, Mapping Generation from Profile, Midstream, Comparative Profiling and Scorecarding

Complex Data Parsing (Big Data Parser)

Restricted to logs, XML, JSON, custom/proprietary data formats

End-to-End Data Lineage

 

Restricted to supporting Big Data Edition

Business Glossary

 

Restricted to supporting Big Data Edition

Natural Language Processing (NLP) Transforms on Hadoop

Address Validation Transforms on Hadoop

 

Data Domain Discovery on Hadoop

 

Data Masking Transforms on Hadoop (Limited)

Real-Time Data Collection and Streaming (Vibe Data Stream)

Restricted to HDFS targets and 100 GB daily source data volume

Restricted to HDFS targets and 100 GB daily source data volume

High-Speed Data Ingestion

Database Connectivity

Hadoop Connectivity

HBase Connectivity

Social Media Connectivity

Unlimited Data Types

Unlimited Data Types

Ten (10) Informatica Data Analyst Named Users

Support (included with subscription license only)

8 x 5

24 x 7

Related Products & Solutions

Big Data Parser

Provides pre-built parsers on Hadoop for a variety of industry standards, documents, log files, and complex file formats.

Big Data Relationship Management

Discovers relationships among parties and groups them to create a 360-degree view.

PowerCenter

The industry's only fully integrated, end-to-end, agile data integration platform.

Vibe Data Stream

Built on fast brokerless messaging technology that helps you manage many small pieces of incoming streaming data.

Customer Success Stories

Western Union

Western Union built a data platform based on Hadoop and Informatica Big Data Edition

UPMC

UPMC used a collection of Informatica products to improve research outcomes in the quest to cure various diseases

BNY Mellon

BNY Mellon accelerated a successful merger using Informatica’s real-time integration products

Resources