c01-big-data-v2

Enterprise Data Lake

Discover and prepare raw data into high-quality insights.

Overview

key-features-icon.png

Key Features

Excel-like interface

An easy-to-use interface allows business analysts to blend data without waiting for IT.

Enterprise collaboration

Manage data publications on a self-service basis, while organizing projects in workspaces.

Rapid blending of data sets

Prebuilt data integration transformations natively process all types of data at any scale.

Smart charts

Enable data exploration with easy-to-use data visualization.

Automated data discovery

Machine intelligence recommends hard-to-find data assets and identifies sensitive data for compliance.

Crowdsourced asset tagging and sharing

Empower analysts to collaborate in the data curation process by easily tagging and sharing data assets.

Self-service Sqoop data transfer

Bi-directional data ingestion and publication between Hadoop and Apache Sqoop sources.

Automated workflow creation

Record data prep steps as reusable data pipeline mappings for quick execution using Blaze or other engines.

Automated data quality

Prebuilt data quality business rule transformations to ensure consistency and accuracy.

User administration and authorization

Apache Ranger centralized security administration and Apache Sentry role-based access control.