Data Warehousing Articles
Rivery, the SaaS ELT, announced a new funding round of venture capital, receiving $30 million that will enable the company to expedite its growth across all teams in New York and Tel Aviv HQ including R&D, Product, and Sales, as well as expanding on EMEA where a London office has been launched to focus on the regional market.
Posted June 03, 2022
As we advance deeper into the digitally roaring 2020s, data executives and professionals are seeing change on a scale never seen before in their careers. A new generation of technologies that often build on previous solutions means new ways of working and ensuring performance for today's increasingly data-driven enterprises. We asked industry leaders for their views on what technology is enhancing enterprises' ability to compete on data.
Posted June 02, 2022
To serve the data analytics and applications for business growth and operational needs, data lakes are being widely adopted as the data infrastructure because of their scalability and flexibility. Data lakes are strong at parking petabytes of data and production delivery as a result of their "schema-on-read" structure. But every coin has two sides. Data lakes, as a semantically flexible data store and bypassed governance efforts, have been seen as muddy swamps and inefficient in data management.
Posted June 02, 2022
Data archiving is an important aspect of data governance and data management. Not only does archiving help to reduce hardware and storage costs, but it is also an important aspect of long-term data retention and a key participant in regulatory compliance efforts. When long-term data retention is imposed on your data—anything more than a couple of years—then archiving it can be the most optimal solution.
Posted June 02, 2022
Broadcom Inc., a global technology provider that designs, develops, and supplies semiconductor and infrastructure software solutions, announced it is acquiring VMware, Inc., an innovator in enterprise software. Broadcom will acquire all of the outstanding shares of VMware in a cash-and-stock transaction that values VMware at approximately $61 billion, based on the closing price of Broadcom common stock on May 25, 2022. In addition, Broadcom will assume $8 billion of VMware net debt.
Posted June 02, 2022
There are so many new buzzwords lately, including the data lakehouse, data mesh, and data fabric, just to name a few. But what do all these terms mean, and how do they compare to a data warehouse? This presentation covers all of them in detail and explains the pros and cons of each, with suggested use cases so attendees can see what approach will really work best for their big data needs.
Posted June 02, 2022
Microsoft has recently released a powerful new DMV specifically to help with memory issues, sys.dm_os_out_of_memory_events. It is currently available in Azure SQL Database and Azure SQL Managed Instances. This DMV consolidates and simplifies telemetry from SQL Server ring buffers, applies heuristics, and provides a result set. The DMV stores a record for each out-of-memory (OOM) event that occurs within the database, providing details about the OOM root cause, the memory consumption of database engine components at that point in time, potential sources of memory leaks, and more.
Posted June 02, 2022
PlanetScale, the serverless database provider powered by MySQL and Vitess, is offering a number of new innovations that accelerate delivery of "The Future Database," with new Insights providing granular performance visibility, Portals for multi-region deployment, and Connect enabling expansive analytics platform integrations.
Posted June 02, 2022
Data Intensity recently announced its deepened commitment to accelerate and transform Oracle-powered workloads to Oracle Cloud Infrastructure (OCI), combining a strategic partnership with its own migration and lifecycle management portfolio of expert technical and functional support services.
Posted June 01, 2022
The coming decade is going to require a modern data warehouse to meet demanding new requirements for machine learning, data variety, and real-time analytics—while still satisfying the more traditional need for analysis of structured data at scale.
Posted June 01, 2022
Deepnote, an early-stage startup backed by Accel and Index Ventures, is emerging from beta with version 1.0, opening up to the general availability of collaborative data science notebooks to data teams worldwide. Since the company's Series A announcement in Jan 2022, Deepnote has added many features going into the 1.0 launch. Most notably is the addition of Deepnote Workspaces, which empowers data teams to organize and surface data projects, notebooks, and apps in one place.
Posted May 31, 2022
SAP is introducing new innovations that deliver business value for customers in four critical areas: supply chain resilience, sustainability, business process transformation, and no-code application development. The innovations announced will help SAP customers accelerate their transformation journey with cloud-based solutions that provide the end-to-end business process support customers most need, according to the vendor.
Posted May 25, 2022
Push Technology, a provider of real-time data streaming and messaging solutions, is releasing Diffusion 6.8, adding new features that include the Diffusion Gateway Framework, expanded data wrangling calculations and conditionals, and journal logging.
Posted May 20, 2022
Data consumers need data for BI and analytics to make business decisions. But for most organizations, their current data infrastructure isn't keeping up with demand. In a presentation at Data Summit 2022, titled "Building the Open Data Lakehouse," Mark Lyons, senior director, product management, Dremio, explained why more organizations are moving their analytics and BI to an open data lakehouse and how you can build a successful lakehouse strategy.
Posted May 18, 2022
No other subject seems to capture the attention of IT leaders right now like database migrations. If there were an IT theme for 2022, it would be: Enterprises migrate from legacy data warehouses to the cloud. And it is no longer just the "early adopters" but the entire customer base that is looking to make the move to cloud-based systems. Let's examine the three most common problems that hamper the execution of migration projects and what can be done to avert migration disasters.
Posted May 18, 2022
Thomas Hazel, founder/CTO, ChaosSearch, examined the tools and technologies to get more value from data and how to determine which ones are right for your organization in a Data Summit 2022 keynote. By stripping away data engineering complexity and lowering total cost of infrastructure ownership and maintenance, more and more organizations are unlocking the value of analytics at scale.
Posted May 17, 2022
Data is often described as "the new oil"—a valuable fuel flowing through organizations. But it is time to stop talking about data as the new oil and concentrate instead on acting on its true importance. This is the view of Doug Laney author of "Infonomics," who gave the opening keynote talk at Data Summit 2022 in Boston.
Posted May 17, 2022
Around 85% of analytics, big data, and AI projects will fail, despite massive investments of money. It's not new news, but it still reflects on how powerfully design affects speed, scale, and usage. At Data Summit 2022, Brian O'Neill, founder and principal, Designing for Analytics presented his session, "Technically Right, Effectively Wrong: How to Avoid Creating the ML or Analytics Application No Customer Wants to Use."
Posted May 17, 2022
The case for increased data automation is clear. "Data teams are spending significant amounts of time on service requests like infrastructure, user provisioning, and incident coordination and communication," said Tina Huang, CTO and founder of Transposit. "Teams today are often manually creating tickets, Slack channels, and Zoom meetings, plus communicating with stakeholders. Data teams must ensure internal customers using data have access to the data they need and real-time updates about interferences with that data." Other tasks ripe for automation include log parsing, correlation, permissions and access, and more.
Posted May 16, 2022
Couchbase has announced version 7.1 of Couchbase Server, a new release that delivers advancements in performance, storage capacity, and workload breadth, including expanded operational analytics support with direct Tableau integration-all while reducing deployment cost. According to Couchbase, with 7.1, enterprise architects and development teams reduce the cost of building and running applications while gaining operational efficiency. "More organizations are experiencing the drawbacks of deploying first-generation cloud architectures, and one of the main disadvantages is the cost of cloud instance sprawl," said Ravi Mayuram, chief technology officer at Couchbase.
Posted May 10, 2022
Domino Data Lab, provider of a leading enterprise MLOps platform is introducing Domino 5.2, continuing Domino's progress towards helping enterprises become model-driven.
Posted May 09, 2022
Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, is releasing version 2.8 of its Data Orchestration Platform, featuring enhanced interface support for the Amazon S3 REST API; security improvements for sensitive applications with strict encryption compliance and regulatory requirements; and strengthened automated data movement functionality across heterogeneous storage systems.
Posted May 04, 2022
The volume, velocity and veracity of today's data deluge has put immense pressure on underlying data platforms and organizations' abilities to manage them effectively. And the pandemic has only exacerbated the problem. According to a 2021 survey, nearly half of digital architects are under high or extremely high pressure to deliver digital projects, but 61% blame legacy technology for making it difficult to complete modernization efforts. That said, databases of all types—SQL, NoSQL, or NewSQL—be they on-prem, cloud, hybrid, or edge, are struggling to navigate this new reality.
Posted May 04, 2022
The value of normalization is in understanding the data well enough to create the normalized design. Pulling out the business rules, business terms, and relationships from the mass of jumbled together raw content is critical. The business rules that result from performing the normalization exercise establish the requirements that need to be satisfied by solutions, whether they are either built or purchased. When an organization creates and maintains a normalized design for the data within the important areas of their business, they reduce work on all future systems.
Posted May 04, 2022
The 9th annual Data Summit conference will be held May 17-18, 2022, at the Hyatt Regency Boston. Pre-conference workshops will take place on May 16, 2022. The program is available for review and a variety of pass options are available to suit individual requirements.
Posted May 04, 2022
It is well known that a database is the fundamental building block for any data-based initiative. Databases are used when collecting, storing, processing, and analyzing data. A database is the silent component that drives business decisions and operational improvements or simply keeps track of inventory. As much as the database should be the almost invisible part of these processes, it is crucial to make the right choice. While it might look easy to select a suitable database, there are a few things to evaluate when making a decision.
Posted May 04, 2022
Having access to the latest version of open source databases is important to optimize your workloads for availability, performance, security, and more. In February 2022, AWS launched MariaDB version 10.6 for Amazon RDS for MariaDB alongside a number of other exciting capabilities.
Posted May 03, 2022
Many organizations are working hard to move to the cloud, but find that with a migration there is also complexity. Recently, Derek Swanson, CTO of Silk, offered advice on what to evaluate to successfully take advantage of all cloud has to offer, the issues to consider when determining what infrastructure will best serve each workload, and the risks of going to the cloud with the wrong strategy.
Posted May 02, 2022
Ocient, a hyperscale data analytics solutions company, is releasing version 19 of the Ocient Hyperscale Data Warehouse, enabling organizations to execute previously infeasible workloads in interactive time. With Ocient, organizations can tackle CPU-intensive workloads with ease, including large-scale joins and full-table scans with extreme I/O performance, returning results in seconds or minutes versus hours or days, according to the vendor.
Posted April 29, 2022
LogDNA, a leading observability data platform, is introducing several platform capabilities that empower companies to get more out of log data while maintaining control over costs. Enterprise users can now access Variable Retention and Enterprise Organizations, while all users benefit from new log control features, including Log Data Restoration, Usage Quotas, and Index Rate Alerting.
Posted April 29, 2022
Arcion is partnering with Databricks offer preconfigured, validated data replication for users of Databricks through that company's new Partner Connect program. Arcion's product enables faster, more agile analytics and AI/ML by empowering enterprises to integrate mission-critical transactional systems with their Databricks Lakehouse in real time, at scale, and with guaranteed transactional integrity, according to the vendor.
Posted April 28, 2022
Airbyte, creators of an open-source data integration platform, is releasing its cloud service for data movement in the U.S. "With Airbyte Cloud, we remove the headache of building and maintaining custom data infrastructure by providing a simple, economical way for enterprises to move data as needed," said Michel Tricot, co-founder and CEO, Airbyte.
Posted April 28, 2022
Google unveiled a variety of new services and innovations that allow customers to work with limitless data, across all workloads, and extend access to everyone. These new enhancements were revealed during its Data Cloud Summit.
Posted April 28, 2022
Each year, Data Summit features industry-leading experts covering the topics that matter most for data professionals who want to stay on top of the latest technologies and strategies. The conference program is now available for review, and a variety of pass options are being offered, including special pricing for attendees who register early.
Posted April 28, 2022
Microsoft announced it has begun migrating its internal SAP systems to S/4HANA under the RISE with SAP umbrella, making SAP responsible for the licensing, technical management, hosting, and support of its SAP applications under a single SLA. The migration to S/4HANA will serve a dual purpose for Microsoft: modernizing its legacy SAP systems before the end of mainstream support in 2027 and demonstrating to customers that it is capable of hosting and running one of the largest and most complex SAP installations in the world within the RISE framework.
Posted April 27, 2022
At AWS Summit San Francisco, AWS announced that Amazon Aurora Serverless v2 is generally available for both Aurora PostgreSQL and MySQL. Aurora Serverless is an on-demand, auto-scaling configuration for Amazon Aurora that allows a database to scale capacity up or down based on your application's needs.
Posted April 21, 2022
Quest Software has announced the launch of Foglight 6.1, a monitoring and optimization platform for the hybrid enterprise. Foglight enables businesses to confidently manage their IT infrastructure and databases by providing them with the tools for deep-dive database workload optimization and cloud cost management. New features include notification management for IT alert configuration, gMSA account integration for password security, and execution plan analysis for MySQL.
Posted April 20, 2022
IBM has announced Db2 13 for z/OS, which, the company says, enhances the availability, security and resiliency of data and applications. According to IBM, Db2 13 for z/OS provides the ability to develop large-scale AI-insights through an innovative, database-integrated approach, infuse AI within any application to improve operations and reduce costs, and enhance resiliency, efficiency, and application stability for maximum availability. Availability is planned for May 31, 2022.
Posted April 18, 2022
DBI Software, a provider of best-in-class performance monitoring, tuning, and trending tools for IBM Db2 LUW and SQL Server databases, has announced the release of version 7 of its Database Performance Web Suite.
Posted April 18, 2022
Startups are always emerging to address challenges and leverage opportunities in innovative ways. The companies bring fresh approaches to accelerating digital transformation, expanding what's possible with analytics, breaking down silos, and more. Here are 15 startups DBTA thinks are worth watching in 2022.
Posted April 18, 2022
dbt Labs, a pioneer in analytics engineering, is providing dbt Cloud on Databricks Partner Connect, allowing Databricks customers to experience the benefits of dbt Cloud on the lakehouse.
Posted April 15, 2022
Snowplow, an industry leader in data creation and behavioral data, announced that it has successfully joined the Snowflake Partner Network as a Premier Partner, along with achieving the Snowflake Ready Technology Validation. The Snowflake Ready Technology Validation program recognizes organizations that have completed a third party technical validation to confirm optimization with Snowflake integrations.
Posted April 14, 2022
For years, Oracle Exadata has been the hardware/software platform of choice for running Oracle databases—a resource deployed when organizations are looking to simplify digital transformations, increase database performance, and reduce costs. However as enterprises continue their cloud migration, questions arise about how to effectively migrate database workloads off of Oracle's Exadata Database Machine and onto the public cloud. Technology executives sizing up the risk against the often-significant rewards are particularly concerned about database performance, resiliency, and cost.
Posted April 13, 2022