Informatica for AWS Data Lakes

Get trusted and actionable business insights from your AWS data lake.

Accelerate data lakes as a service on AWS

As organizations increasingly rely on data to power digital transformation, the clamor for fast access to trusted data is growing. Simultaneously, companies look to deploy data management initiatives on AWS for cost-effective scalability and agility. The more complex and voluminous the data, the greater the need to ensure it is complete, consistent, accurate, and compliant. Informatica’s market-leading modular, artificial intelligence (AI)-driven approach to data lake management enables you to deploy your data lake solution on AWS and deliver trusted, timely, and relevant data.

Unleash the power and value of data

Informatica provides native data integration on Hadoop so business analysts can get all the data they need. Hundreds of pre-built high-performance connectors, data integration transformations, and parsers enable virtually any type of data to be ingested and processed on Amazon Elastic MapReduce (EMR) or an AWS-hosted Hadoop distribution. Informatica’s unique capabilities like dynamic mappings, dynamic schema support, and parameterization—combined with smart performance optimization—ensure maximum developer productivity, operational reusability, and data integration performance.

c09-big-data-building According to the 2016 Bain & Co. Building IT Capabilities survey, 59% of organizations believe they lack the capabilities to generate meaningful business insights from their data.

Find, prepare, and govern data in a uniquely collaborative, intelligent way

Informatica provides a simple excel-like data preparation solution that enables business analysts to get the right data at the right time. Unlike passive data preparation solutions, Informatica’s metadata-driven artificial intelligence engine, CLAIRE™, accelerates data delivery and business self-service by inferring the structure of and relationships among data sets to aid business analysts in their discovery. Collaboration capabilities enable multiple business analysts to curate datasets together using tags and project workspaces. Finally, business analysts can operationalize their work by pushing data preparation steps into reusable transformations for IT to run systematically.

Deploy on AWS Marketplace


Discover, classify and catalog data assets across the enterprise

Informatica provides an artificial intelligence-driven discovery engine to scan and understand data assets across the enterprise. Informatica’s metadata-driven artificial intelligence engine, the CLAIRE™ engine, accelerates metadata management and data stewardship by inferring the data domains, data structure, and relationships among data sets so that business analysts and data stewards can find all types of data across the enterprise, discover relationships among them, and enrich the data with business glossary terms and crowdsourced annotations for maximum usability and trackability.