Syncsort, a software company specializing in high speed sorting products, data integration and backup software and services, has formed a technology partnership with Cloudera. The joint effort is aimed a providing an approach that customers can leverage to exploit critical mainframe data in Hadoop for big data analytics.
Cloudera provides deep Hadoop expertise, with a secure, reliable platform that makes Hadoop enterprise-ready. Syncsort's Hadoop ETL product, DMX-h allows access to mainframe data and helps reduce costs by replicating expensive mainframe batch processes in Hadoop. Combined with Cloudera’s Hadoop distribution, CDH, customers can improve efficiency and deploy big data analytics. In addition, once mainframe data is in Hadoop customers can combine it with a other sources.
“We view Cloudera as enabling a lot of the enterprise-level services that Hadoop needs,” said Jorge Lopez, director of product marketing, Syncsort, in an interview. “Together, we can really bridge the gap between the mainframe and Hadoop.”
The mainframe hosts mission-critical corporate data for many companies and is an essential component to be combined with unstructured data sources, such as web logs and clickstream data. As enterprises begin their big data initiatives, these organizations need to develop strategies to address the unique challenges of leveraging mainframe data in Hadoop as a means to leverage all their data.
The easiest and initial step to deal with this data in Hadoop is to copy the data from the mainframe into Hadoop so that it can combined and made securely available to the entire organization. The other option is actually moving the data from the mainframe into Hadoop to free up mainframe storage, the companies say.
For a test drive of DMX-h, including a Linux Virtual Machine with Cloudera CDH and DMX-h ETL Edition pre-installed, along with use case accelerators and sample data, click here.