Hadoop Articles
                
                        
                            
                            The past year was a blockbuster one for those working in the data space. Businesses have wrapped their fates around data analytics in an even tighter embrace as competition intensifies and the drive for greater innovation becomes a top priority. The year ahead promises to get even more interesting, especially for data managers and professionals. Leading experts in the field have witnessed a number of data trends emerge in 2016, and now see new developments coming into view for 2017.
                            Posted January 18,  2017
                            
                         
                    
                        
                            
                            With the rise of smartphones, laptops, sensors on machines, vehicles, and appliances, massive amounts of data are being generated, according to Balaji Thiagarajan, group vice president of big data at Oracle. For companies that can transform and manage it, he notes, data represents a huge opportunity as a source of competitive advantage and should be leveraged as such. Big data and cloud are two technologies driving dramatic transformations, and, says Thiagarajan, organizations must be ready to react and take advantage of important new trends and technologies to make sure that they come out ahead next year. Here, Thiagarajan shares 10 key predictions for big data in 2017.
                            Posted January 18,  2017
                            
                         
                    
                        
                            
                            IBM broke the U.S. patent record with 8,088 patents granted to its inventors in 2016. IBM's 2016 patent output covers inventions in artificial intelligence and cognitive computing, cognitive health, cloud, cybersecurity and other strategic growth areas for the company.
                            Posted January 16,  2017
                            
                         
                    
                        
                            
                            As the Hadoop ecosystem matures, the consensus in the industry has been that adoption of Hadoop technology is steady amid continuing disruptive innovation within the open-source framework. However, acquiring or developing skills is one of the most widely cited challenges of integrating Hadoop into the enterprise, as is solution complexity and system integration.
                            Posted January 16,  2017
                            
                         
                    
                        
                            
                            Talend is releasing the Winter '17 version of Talend Data Fabric, focusing on enhancing several different areas including data preparation, data cleansing, and self-service data stewardship. Talend's integrated platform now includes new data preparation features for big data that enable all employees to access, cleanse, and collaborate on the analysis of massive data sets. 
                            Posted January 12,  2017
                            
                         
                    
                        
                            
                            The Oracle Applications Users Group (OAUG) has announced Alyssa Johnson as its 2017 president. An active member of the OAUG since 2003, Johnson has served as a member of the organization's board of directors since 2011 and previously as the OAUG president in 2014.
                            Posted January 12,  2017
                            
                         
                    
                        
                            
                            Arcadia Data, a provider of visual analytics software, has added new native integration features for Arcadia Enterprise and Cloudera Enterprise to deliver a real-time, Hadoop-native analytics platform.
                            Posted January 11,  2017
                            
                         
                    
                        
                            
                            Tableau is partnering with the United States Department of Commerce and operational data management and intelligence provider Enigma to help the general public see and understand government data.
                            Posted January 11,  2017
                            
                         
                    
                        
                            
                            Hortonworks has forged an open source collaboration with Neustar, a provider of real-time information services, on security and identity management tools for IoT devices.
                            Posted January 11,  2017
                            
                         
                    
                        
                            
                            Dataguise, a provider of sensitive data governance solutions, is partnering with WHISHWORKS, a provider of IT services and systems integration. The partnership is focused on helping organizations to overcome difficulties in unlocking big data's potential because of the compliance requirements that will be imposed by the General Data Protection Regulation (GDPR) in May, 2018.
                            Posted January 10,  2017
                            
                         
                    
                        
                            
                            Xplenty has announced new $4 million in funding from Bain Capital Ventures, True Ventures, and Rembrandt Venture Partners, and with participation in the funding round from existing Xplenty investors Magma Venture Partners and Waarde Capital.
                            Posted January 04,  2017
                            
                         
                    
                        
                            
                            Rocana is releasing Rocana Ops 2.0, updating its event alerting and orchestration capabilities, offering unique visual experience for first responders, and providing wider cloud platform visibility. Now with Rocana Ops 2.0, IT teams can take real-time intelligent action on all of their event data. Rocana Reflex is a new event alerting and orchestration system that enables operations teams to provide smart, instant, and automated reactions based on what is happening in their environment.
                            Posted January 03,  2017
                            
                         
                    
                        
                            
                            Arcadia Data, provider of real-time modern business intelligence (BI) platforms, is releasing Arcadia Enterprise 3.3, making significant upgrades to uncover real-time insights. The new features - including Arcadia Smart Acceleration, advanced segmentation and cohort analytics, analytic extensions, and mobile and tablet support - unify real-time insights from streaming data with historical data discovery in a single view.
                            Posted December 20,  2016
                            
                         
                    
                        
                            
                            GridGain Systems, a provider of enterprise-grade in-memory computing solutions based on Apache Ignite, has announced the availability of GridGain Professional Edition 1.8, a fully supported version of Apache Ignite 1.8.
                            Posted December 14,  2016
                            
                         
                    
                        
                            
                            SnapLogic is receiving $40 million in Series F funding to hasten its reach around the globe and transform ways to integrate data, applications, and devices for digital business. The new round was led by European private equity firm Vitruvian Partners, with further investment from Andreessen Horowitz, Capital One, Ignition Partners, NextEquity Partners, and Triangle Peak Partners. This brings SnapLogic's funding to $136.3 million to date.
                            Posted December 07,  2016
                            
                         
                    
                        
                            
                            Dataguise - through its DgSecure platform - is now supporting sensitive data discovery on Amazon Redshift and Amazon RDS, as well as Amazon Simple Storage Service (S3). The platform will now scan for sensitive information stored on Amazon Redshift, RDS, and S3 and provide ongoing monitoring of sensitive data in S3 throughout its lifecycle.
                            Posted December 01,  2016
                            
                         
                    
                        
                            
                            What's ahead for 2017 in terms of big data and IoT? IT executives reflect on the impact that Spark, blockchain, data lakes, cognitive computing,AI and machine learning, and other cutting-edge approaches may have on data management and analytics over the year ahead.
                            Posted November 30,  2016
                            
                         
                    
                        
                            
                            SUSE is acquiring OpenStack IaaS and Cloud Foundry PaaS Talent and Technology Assets from HPE. The agreement aims to accelerate SUSE's entry into the growing Cloud Foundry Platform-as-a-Service (PaaS) market.
                            Posted November 30,  2016
                            
                         
                    
                        
                            
                            AtScale, which provides a self-service BI platform for big data, has announced an expansion of its services. With this announcement, the company says it is introducing a BI platform that enables businesses to work seamlessly across all of big data, on premise and in the cloud. In addition to Hadoop, AtScale has announced preview availability of support for data stored in Teradata, Google Dataproc and BigQuery, expanding on the company's existing support for Microsoft Azure and HDInsight.
                            Posted November 21,  2016
                            
                         
                    
                        
                            
                            Aerospike, a provider of NoSQL solutions, is releasing a new version of its Aerospike platform, transforming how organizations store, access, and analyze data. The new version of Aerospike includes features such as SortedMap, durable delete, IPv6, improved cluster management, and updated network naming.
                            Posted November 17,  2016
                            
                         
                    
                        
                            
                            Databricks has announced that, in collaboration with industry partners, it has broken the world record in the CloudSort Benchmark, a third-party industry benchmarking competition for processing large datasets. Databricks was founded by the team that created the Apache Spark project.
                            Posted November 16,  2016
                            
                         
                    
                        
                            
                            Expanding the capabilities for customers to take advantage of the elasticity of Apache Hadoop and Apache Spark in the cloud to power new workloads and analytic applications, Hortonworks, Inc. has announced the availability of Hortonworks Data Cloud on the AWS Cloud.
                            Posted November 15,  2016
                            
                         
                    
                        
                            
                            MicroStrategy Incorporated is launching MicroStrategy Desktop at no cost to users, allowing professionals to utilize popular data sources and build insightful visualizations. MicroStrategy Desktop, available for Mac and PC, is a data discovery tool that allows users to access data on their own and build dashboards.
                            Posted November 03,  2016
                            
                         
                    
                        
                            
                            New data sources such as sensors, social media, and telematics along with new forms of analytics such as text and graph analysis have necessitated a new data lake design pattern to augment traditional design patterns such as the data warehouse. Unlike the data warehouse - an approach based on structuring and packaging data for the sake of quality, consistency, reuse, ease of use, and performance - the data lake goes in the other direction by storing raw data that lowers data acquisition costs and provides a new form of analytical agility.
                            Posted November 03,  2016
                            
                         
                    
                        
                            
                            Trifacta, a provider of data wrangling solutions, is launching Wrangler Edge, a platform designed for analyst teams wrangling diverse data outside of big data environments. "We are packing the Trifacta product and adding enterprise features such as the ability to schedule jobs to handle larger data volumes to connect to diverse sources," said Will Davis, director of product marketing. "We also added collaboration and sharing features as well all without requiring organizations to manage a large Hadoop infrastructure."
                            Posted November 01,  2016
                            
                         
                    
                        
                            
                            SAP has completed its acquisition of Altiscale, which provides a high-performance, scalable Big Data-as-a-Service (BDaaS) solution that includes full operational services. The company announced this acquisition at Strata + Hadoop 2016 in New York City. With this acquisition Altiscale will operate as a focused and integrated BDaaS offering from SAP to help accelerate and operationalize Big Data deployment in the enterprise.
                            Posted October 26,  2016
                            
                         
                    
                        
                            
                            Paxata is releasing Paxata Connect to extend the Paxata Platform with a connectivity framework that creates a nexus to acquire, shape, and publish meaningful data for faster time to value. With Connect, information architects and developers can take advantage of out-of-the-box connectors, build their own repeatable data services and pipelines, and maintain transparency and oversight to ensure data provides a greater and faster return.
                            Posted October 19,  2016
                            
                         
                    
                        
                            
                            Talend, a provider of cloud and big data integration software, has formed a strategic partnership with T-Systems. T-Systems, a German subsidiary of Deutsche Telekom, is using Talend Big Data Integration software to streamline the collection and cleansing of data as part of T-Systems' Big Data platform services.
                            Posted October 11,  2016
                            
                         
                    
                        
                            
                            Vormetric, a Thales company, is releasing a new platform called Thales Orchestrator, to help reduce the cost of protecting data at rest across organizations. Thales Orchestrator's features include live data transformation, key management-as-a-service, Bring Your Own (encryption) Key management for AWS with Thales HSMs, vaultless tokenization, and Docker Encryption and Access Controls. Docker data-at-rest encryption and access controls that help to assure container images and file systems are controlled and secure.
                            Posted October 06,  2016
                            
                         
                    
                        
                            
                            Splice Machine, provider of an SQL RDBMS powered by Hadoop and Spark, now supports native PL/SQL on Splice Machine. Announced at Strata + Hadoop World in NYC, the new capabilities are available through the Splice Machine Enterprise Edition.
                            Posted October 05,  2016
                            
                         
                    
                        
                            
                            Pepperdata unveiled a new offering that enables customers of Amazon Elastic MapReduce (EMR) to gain granular visibility into their clusters' run time performance. Even after an Amazon EMR cluster has completed its work and is terminated, users will be able to access fine-grained monitoring data that allows customers to view a run and analyze it, as well as compare it with historical data to improve future performance.
                            Posted October 05,  2016
                            
                         
                    
                        
                            
                            Choosing when to leverage cloud infrastructure is a topic that should not be taken lightly. There are a few issues that should be considered when debating cloud as part of a business strategy.
                            Posted October 04,  2016
                            
                         
                    
                        
                            
                            Syncsort is incorporating new open metadata management capabilities in its DMX-h data integration software that, along with its seamless integration with Cloudera Navigator, aim to make big data governance easier.DMX-h provides organizations with a single interface for accessing and integrating all enterprise data, including IBM z mainframes, and the flexibility to use the metadata repository that best meets their needs, on premise and in the cloud.
                            Posted October 04,  2016
                            
                         
                    
                        
                            
                            NoSQL and Hadoop—two foundations of the emerging agile data architecture—have been on the scene for several years now, and, industry observers say, adoption continues to accelerate—especially within mainstream enterprises that weren't necessarily at the cutting edge of technology in the past.
                            Posted October 04,  2016
                            
                         
                    
                        
                            
                            Zaloni, the data lake company, unveiled new platform updates at Strata + Hadoop World 2016 including new enhancements to Bedrock Data Lake Management Platform and its Mica self-service data preparation solution. Bedrock helps businesses govern and manage data across the enterprise, and Bedrock 4.2 adds new capabilities around data privacy, security, and data lifecycle management.
                            Posted October 03,  2016
                            
                         
                    
                        
                            
                            MariaDB Corporation is updating its MaxScale platform, adding a data streaming integration with Kafka, enhanced security, and high availability capabilities. MariaDB MaxScale is a next-generation database proxy that manages administrative functions like security, scalability, data streaming and high availability, enabling the database to focus on core functionality to drive faster innovation.
                            Posted October 03,  2016
                            
                         
                    
                        
                            
                            At Strata + Hadoop World, Hortonworks showcased its technology solutions for streaming analytics, security, governance, and Apache Spark at scale.
                            Posted September 30,  2016
                            
                         
                    
                        
                            
                            Attendees of Strata + Hadoop saw their fair share of solutions that tout that they are "next big thing" to solve a multitude of big data problems. Eric Sammer, CTO and co-founder, and Bryce Hein, vice president of Marketing, at Rocana observed that the focus is more on how platforms can help issues rather than the infrastructure behind it.
                            Posted September 29,  2016
                            
                         
                    
                        
                            
                            Cloudera has added new technology enhancements to its data management and analytics platform to make it easier for companies to take advantage of elastic, on-demand cloud infrastructure for business value from all their data. The move to the cloud has become a top priority for CIOs, said Charles Zedlewski, vice president, of products at Cloudera, at Strata + Hadoop World 2016 in NYC.
                            Posted September 29,  2016
                            
                         
                    
                        
                            
                            MathWorks showcased the latest release of MATLAB, which is used in the development of analytics and algorithms to help solve engineering and scientific problems, at Strata + Hadoop World 2016 in New York.
                            Posted September 29,  2016
                            
                         
                    
                        
                            
                            Dataguise announced general availability of the Dataguise DgSecure Dashboard, a dashboard for visualization of all sensitive data throughout the enterprise—including government-protected PII, PHI, and PCI data—and demonstrated the capability at Strata + Hadoop World Conference in New York City.
                            Posted September 29,  2016
                            
                         
                    
                        
                            
                            At Strata + Hadoop World 2016, Kognitio announced a competition to find the best use case or application that has its newest offering, Kognitio-on-Hadoop, as part of the solution. According to Roger Gaskell, CTO of Kognitio, the company is looking for solutions that are innovative in terms of application or something more common place but is now being done at scale.
                            Posted September 29,  2016
                            
                         
                    
                        
                            
                            Attunity Ltd., a provider of big data management software solutions, is introducing a new platform for SAP, a data replication solution optimized to deliver SAP application data application data in real-time for big data analytics on-premises or in the cloud. Attunity Replicate for SAP aims to transform complex data structures into easily accessible data models across a wide variety of analytics platforms.
                            Posted September 28,  2016
                            
                         
                    
                        
                            
                            SAP is releasing a next generation data warehouse solution for running a real-time digital enterprise on-premise and in the cloud. The new solution, SAP BW/4HANA, will be available on Amazon Web Services (AWS) and SAP HANA Enterprise Cloud (HEC).
                            Posted September 28,  2016