Five years ago, the industry was debating the notion of a hybrid cloud and many boards and investors only saw the value in the public cloud. In 2018, AWS announced Outposts and the debate stopped. With 94% of IT spend and 80% of the data on-prem, the world is very much hybrid. The only thing we’re debating now is how to manage multiple clouds. Multi-cloud is complex and requires orchestration of data, security and networking. Cloudera has emerged as a leader if not the leader in hybrid, end-to-end data management.
Last year I wrote an article discussing how Cloudera is helping companies to manage data wherever it may reside. Later I followed up with another article discussing the “360-degree view of data” exacerbated by the Covid-19 pandemic. Cloudera has general availability (GA) with Cloudera Data Platform (CDP) for public and private clouds and has 1,000s of PB of data already under management. It is time to understand what makes this platform particularly powerful and how Cloudera will build on this momentum in the future.
“True” hybrid and multi-cloud data management
I have long maintained that hybrid and multi-cloud are table stakes to make a modern architecture function. Today the debate is more about which tier – public or private – to place applications and data, how to burst between the two, and the importance of security and governance.
To achieve a “true” hybrid cloud requires integration between the private and public clouds that allows applications and data to move seamlessly across the disparate infrastructures and for the combined environments to be managed and secured holistically. I would argue that many hybrid solutions today comprised of multiple point products from different vendors are not a “true” hybrid solution and a high-risk proposition. Each component will be subject to upgrades at various times regardless of your integration. Security and governance models will vary across products that put you at risk of data loss or security vulnerabilities.
I am excited about the availability of Cloudera’s Data Platform on both private and public clouds. Maybe it was my many comments on the hybrid future! But Cloudera recognized early that an enterprise architecture for hybrid and multi-cloud data was the destination.
Cloudera’s Data Platform is a “true” hybrid solution with the same architecture across private and public clouds and the same security and governance to avoid compliance issues and to reduce the risk of security breaches. There is a single pane of glass to manage your data, whether on-premises or in the three major public clouds; take your pick; AWS, Azure, and Google.
Cloudera delivers the agility, flexibility you expect from the cloud, and the cost controls you have on-premises as you can move your data seamlessly between the clouds. Cloudera has achieved a “true” hybrid solution for managing data.
Addressing the entire lifecycle of data
In 2020, before making CDP generally available (GA), Cloudera worked with a group of customers to understand how the private, public tier strategy would work and the level of interoperability across multi-cloud. Technical previews also gave customers experience with the security and governance platform Cloudera Shared Data Experience (SDX). CDP with SDX provides security and governance for your data, managed from a single pane of glass regardless of where the data resides. Security and governance are a significant part of the move to hybrid. SDX collects metadata that moves with the data from on-premises or the public cloud. There is always an audit trail to see who touched the data, how it has changed, who changed it, factors necessary to comply with regulations.
Cloudera has relationships with big enterprises, many of which have been understandably cautious, given the regulatory frameworks and the data privacy laws worldwide. But now, Cloudera sees cloud acceleration that has increased during the pandemic with hundreds of petabytes of data under management. More and more businesses are in the process of learning which workloads need the high performance of cloud and merit the expense while other workloads make more sense to run on prem at a different cost. With freedom of choice that a hybrid cloud model brings, workloads can now be tied directly to budgets and the business value of that workload.
CDP private and public services address the entire lifecycle of data, managing data from the point of origination, process it in real-time, reporting, direct actions using machine learning, and workload placement for optimum performance and cost.
The enterprise data cloud, including a hybrid private cloud, combines the ease of use of public cloud infrastructure with the performance, security, and cost controls in the data center.
Let the migrations begin!Now that CDP is available, Cloudera’s challenge now is to get customers on the platform as quickly and painlessly as possible to solve real business problems.
Migration involves blending business data and cloud strategy to achieve the desired result, best explained by a couple of industry examples.
The first example involves manufacturing and connected cars. Many use cases emerge from analyzing car data, such as the potential to improve journeys, reduce accidents, and predict service events. Analytical models in the cloud or even onboard can predict service events notifying an upcoming issue and directing the driver to the nearest dealership. Warranty analytics have proven to be effective at identifying emerging issues several months faster than traditional warranty processes.
The manufacturer will have data coming from the vehicle that needs to be meshed and analyzed with data coming out of production systems sitting within the manufacturing plant. Cloudera looked at the data landscape, with data at the edge, plant data, production data, and maintenance data that potentially can contribute to the analysis. Fundamental questions such as the quantity of data, sources, locations, and how the cost can be optimized form the basis for a reference architecture. Workloads not utilized in CDP are disabled, so customers are not incurring unnecessary cloud operating costs.
Another example involves the banking industry. The Fundamental Review of the Trading Book (FRTB) is a suite of capital rules developed for banks’ wholesale trading activities and liquidity. In this example, the bank used the CDP public cloud to house new data that required more proactive monitoring and did not fit the existing data warehouse structure. CDP private cloud housed historical data. Visibility to the banks’ overall liquidity resulted from blending data from both sources.
Solutions involve understanding the data landscape and deciding whether to locate in the cloud or on-premises or hybrid and then build the data architecture and cloud architecture based upon those requirements. Cloudera will help with the analysis with tools based on repeatable models that help optimize cloud and upfront costs.
At this point, I would say that Cloudera has emerged as a leader if not the leader in hybrid multi-cloud data analytics. If you want to manage data end-to-end, from the edge, on-premises, and multiple public clouds, your choice today is Cloudera that already has 1,000s of PBs under management.
With Cloudera’s Data Platform private and public cloud both generally available, the new focus is customer business problems, ultimately leading to new product innovation in the form of additional cloud-native services offered on the CDP platform. New integrated workloads typical across the customer base will result in a faster time to value.
To assist customers in migrating workloads to CDP, Cloudera uses a library of repeatable reference architectures increasing over time. Common patterns emerge from the bespoke work Cloudera performs for individual customers.
I look forward to writing the next chapter as customers solve real business problems on this platform across the hybrid cloud.
Note: Moor Insights & Strategy writers and editors may have contributed to this article.