Data Engineering Integration

Get fast, flexible, and repeatable big data integration and ingestion at scale.



Key Features

See how the industry’s most comprehensive cloud data engineering product helps you access, integrate, clean, catalog, and govern big data.

Universal Data Access

Access all types of data including transactions, applications, databases, log files, social, machine, and sensor data.

Advanced Spark Support

Leverage innovations in the latest Spark engine for advanced data management, including data ingestion, data quality, data masking, and stream processing.

Data Integration on Spark and Hadoop

Access an extensive library of advanced prebuilt data transformations, including Python transformations to operationalize data science projects.

Visual Design Interface

Employ the intuitive visual interface to build data engineering transformation logic used in creating the data pipeline.

High-Speed Mass Ingestion

Ingest data from source systems and applications into cloud and big data using high-performance connectivity, mass ingestion, and dynamic mappings.

Serverless Deployment

Automatically scale and deploy big data workloads in cloud environments, such as Amazon Web Services, Microsoft Azure, and Google Cloud Platform.

Intelligent Data Parsing

Parse complex multi-structured, hierarchical, and unstructured data automatically with Informatica Intelligent Structure Discovery. Easily handle schema drifts.

Flexible Deployment

Automate deployment and management of data processing compute clusters with Spark Serverless services, such as Databricks, Qubole, and Google Dataproc.

Big Data Profiling

Profile big data to better understand the data, identify data quality issues, and collaborate on data pipelines.

Operational Insights

Give DataOps a single view to monitor and plan big data resources. Achieve better compliance and performance management.

Data Engineering in the Cloud

Informatica Data Lake Management on AWS

Take advantage of the security and scalability of the managed Hadoop framework in AWS EMR to easily find, prepare, and govern big data to quickly drive business value.

Informatica Data Lake Management on Microsoft Azure

Leverage the flexibility of the managed Hadoop framework in Microsoft Azure HDInsight to easily find, prepare, and govern big data to quickly drive business value.

Intelligent Data Pipelines with Databricks

Accelerate data pipelines for AI and analytics with intelligent data integration and ingestion from Informatica and Databricks.


Informatica Operational Insights

Use machine learning to efficiently monitor and manage your data lake deployments across domains and locations.


Sign up now

Customer Success Stories

Western Union

Western Union built a data platform based on Hadoop and Informatica Big Data Edition

M.D. Anderson Cancer Center

Informatica empowered scientific and clinical collaboration at this renowned cancer center by turning data into knowledge and facilitating self-service business intelligence.

Tinkoff Bank

Tinkoff Bank acquires and retains more customers at a lower cost with Informatica Big Data Management.