Understand Big Data and Increase its Value to Support Your Agency's Mission
The U.S. government currently stores an average of 1.61 petabytes of data—a number that’s expected to grow to almost 3 petabytes by 2015. Federal agencies must not only collect, aggregate, analyze, and disseminate this information but also strictly adhere to data retention, security, and privacy laws.
These requirements are further complicated for the Department of Defense (DoD) and intelligence community since more than 30 percent of this data is unstructured. These organizations must be empowered to find the “needle in a haystack” in order to support the warfighter and protect against security threats. They face a plethora of issues in adopting a big data strategy, including bandwidth, storage capacity, retrieval, security, computational power, and trained personnel. The greatest challenge in defense and intelligence gathering lies in harnessing the power of data and turning it into actionable information.
The Informatica Solution for Big Data in the DoD and Intelligence Community
The DoD and intelligence community are betting on Big Data by investing $250 million annually to harness and utilize massive data in new ways. This investment also aims to improve the situational awareness of warfighters and analysts and to boost operational support.
With this solution, agencies can:
- Efficiently integrate, analyze, and disseminate massive volumes of data
- Work with Hadoop without committing to a single Hadoop platform
- Boost productivity by managing, ingesting, and correlating structured, unstructured, and semistructured data from any source, in any format, in any language
- Streamline decision making and operational intelligence and provide predictive capabilities to improve mission outcomes
- Innovate new business models and stakeholder services
- Improve operational efficiencies by replacing or supporting human decision making with automated algorithms
- Eliminate waste, fraud, and abuse and increase transparency by making data more readily available and referencing data and trends when formulating policy
- Enable more rapid response to security threats and potentially criminal activity by detecting obscure relationships and event sequences
Key Capabilities of the Informatica Solution for Big Data in the DoD and Intelligence Community
Based on a single, comprehensive platform, this solution addresses the challenges of big data and takes full advantage of Hadoop by providing the following capabilities:
- Universal access to all types of big transaction data: RDBMS, OLTP, OLAP, ERP, CRM; mainframe and cloud; big interaction data, including social media data; log files and machine sensor data; Web sites, blogs, documents, emails, and other unstructured or multi-structured data
- High-speed data ingestion, enabling access, loading, replication, transformation, and extraction of big data between source and target systems or directly into Hadoop or your data warehouse
- Data parsing and exchange of complex, multi-structured, unstructured, and industry standard data such as Web logs, JSON, XML, and machine device data; availability of prebuilt parsers for market data and industry standards such as FIX, SWIFT, ACORD, HL7, HIPAA, and EDI
- Metadata management, including data lineage, auditability, and standardization
- Data quality and data governance to profile, cleanse, and manage data, to increase understanding of data and trust in its quality, and to manage its growth effectively and securely