Enterprise

The cloud backlash has begun: Why big data is pulling compute back on premises

Comment

Monsoon, Rainy Season background . cloud rain and thunderbolt hanging on blue sky. paper art style.vector.
Image Credits: lim_pix / Getty Images

Thomas Robinson

Contributor

Thomas Robinson is COO of Domino Data Lab, where he is responsible for revenue and go-to-market, leading sales, marketing, professional services, customer support and partnerships.

The great cloud migration has revolutionized IT, but after a decade of cloud transformations, the most sophisticated enterprises are now taking the next generational leap: developing true hybrid strategies to support increasingly business-critical data science initiatives and repatriating workloads from the cloud back to on-premises systems. Enterprises that haven’t begun this process are already behind.

The great cloud migration

Ten years ago, the cloud was mostly used by small startups that didn’t have the resources to build and operate a physical infrastructure and for businesses that wanted to move their collaboration services to a managed infrastructure. Public cloud services (and cheap capital in a low interest-rate economy) meant such customers could serve a growing number of users relatively inexpensively. This environment enabled cloud-native startups such as Uber and Airbnb to scale and thrive.

Over the next decade, companies flocked en masse to the cloud because it lowered costs and expedited innovation. This was truly a paradigm shift and company after company announced “cloud-first” strategies and moved infrastructures wholesale to cloud service providers.

The growing backlash

However, cloud-first strategies may be hitting the limits of their efficacy, and in many cases, ROIs are diminishing, triggering a major cloud backlash. Ubiquitous cloud adoption has given rise to new challenges, namely out-of-control costs, deepening complexity and restrictive vendor lock-in. We call this cloud sprawl.

The sheer quantity of workloads in the cloud is causing cloud expenses to skyrocket. Enterprises are now running core compute workloads and massive storage volumes in the cloud — not to mention ML, AI and deep learning programs that require dozens or even hundreds of GPUs and terabytes or even petabytes of data.

The costs keep climbing with no end in sight. In fact, some companies are now spending up to twice as much on cloud services as they were before they migrated their workloads from on-prem systems. Nvidia estimates that moving large, specialized AI and ML workloads back on premises can yield a 30% savings.

Furthermore, new regulations are complicating cloud environments. U.S. and European data sovereignty laws require enterprises to manage and isolate data in multiple regions according to varying compliance regulations, with compute attached to each one. This makes a single-region, single-cloud design no longer feasible for sophisticated global enterprises, further adding to the cost and complexity of infrastructure.

Lastly, cloud service providers have continued to move up the stack from providing not just infrastructure as a service (IaaS), but also platform as a service (PaaS) and software as a service (SaaS) in one convenient, integrated cloud deployment. These PaaS and SaaS offerings are a double-edged sword; while they provide ease of use and expedite time-to-value, they also have higher prices/margins and lead to vendor lock-in.

Unlike S3, AWS’ storage layer with an API that has become standard across clouds and on-prem storage providers, higher-level services like GCP Vertex are unique offerings with no intracloud compatibility. The net effect of building these higher-in-the-stack services into your IT architecture is being locked into a specific cloud provider.

The great repatriation

Thankfully, many of today’s most sophisticated companies realize that this new cloud paradigm is untenable and are developing a hybrid multicloud approach consisting of more than one public cloud provider and on-prem systems.

Andreessen Horowitz studied 50 top publicly traded software companies’ disclosed cloud costs and found that, “for every dollar of gross profit saved, market caps rise on average 24-25X the net cost savings from cloud repatriation. This means an additional $4B of gross profit can be estimated to yield an additional $100B of market capitalization among these 50 companies alone. Extending this analysis to the broader universe of scale public companies that stand to benefit from related savings, we estimate that the total impact is potentially greater than $500B.”

Walmart, for example, recently disclosed a years-long project to diversify its infrastructure to include edge compute at their store locations to augment what was previously provided by their Azure and GCP cloud infrastructure. Walmart’s new multicloud structure enables it to “switch seamlessly” between Google’s and Microsoft’s web-based services and its proprietary servers. Walmart said that “the system has saved as much as 18% annually on overall cloud expenditures and mitigates the potential for outages.”

If the Fortune 50 organizations repatriating workloads back to on-prem isn’t convincing, take a look at cloud service providers. All of the major cloud providers have invested in new products to add on-prem resources to their cloud stacks. AWS has debuted Outposts with K8s, Google is developing Anthos, and Azure is touting ARC.

If Amazon, Google and Microsoft are pouring vast sums into product development for on-prem capabilities, clearly the writing is on the wall for future customer demand. Even cloud stalwarts like Snowflake are making pivots into the hybrid/on-prem space.

Multicloud is the new cloud, and multicloud now includes on-prem.

What should companies do next?

The good news is that now more than ever, companies have more flexibility in how they develop their infrastructures.

One of the great second-order benefits of the great cloud migration has been the development of a whole new category of technologies incubated by cloud service providers. Better DevOps automation tools and cloud-native application design with technologies like containerization and Kubernetes have proliferated from the cloud world and become accessible to any organization (e.g., Google had the wisdom to allow Craig McLuckie and Joe Beda to develop Kubernetes and open source it to the world).

These new tools and technologies have reduced the cost and operational overhead for companies to manage their own infrastructure. The cloud-native approach provides more flexibility for organizations to move workloads between different underlying IaaS stacks. While not all companies are ready for a wholesale repatriation, there are a few things that can be done to ensure future flexibility for where workloads are created.

  1. Don’t lock yourself in. Avoid cloud services that are higher up the stack and designed to create lock-in. IaaS services have given way to PaaS and SaaS offerings. Each of these are only available from a single cloud service provider and serve to create lock-in to that particular cloud. If you spend time migrating your data science and ML workloads to AWS Sagemaker, for example, then migrating that back to on-prem or another cloud won’t be possible. The PaaS stack is completely different with Google Vertex and there are no easy paths to migrate those workloads. Instead, opt for a vendor-agnostic stack that has portability to other clouds.
  2. Make portability a priority for your architectural review committee. When developing applications, establish an architecture review process to determine if software will have hybrid-compatible architecture underpinnings and ensure that applications embrace the architectural principles of the cloud to deliver cost savings, security and efficiencies, whether in the cloud or on-prem.
  3. Start investing in hybrid multicloud and decide for yourself. This can take the form of beachhead projects — e.g., picking one large AI training workload to run on-prem — or it can take the form of vendor exploration from providers who are betting on hybrid, such as VMware, RedHat and NetApp.

Choose wisely

While cloud computing was once a panacea for savings and innovation, returns are now diminishing as AI and ML workloads drive both data volumes and the need for accelerated compute upwards, impacting bottom lines. However, the remarkable success of the cloud has given us powerful new ways of building and managing IT.

Today, the flexible architectural underpinnings that enabled cloud-first giants to permeate enterprise stacks on and off premises, making them just as nimble for addressing architectural needs. The choice between data centers and clouds — even which cloud to use — is yours. Make it wisely.

More TechCrunch

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The deck included some redacted numbers, but there was still enough data to get a good picture.

Pitch Deck Teardown: Cloudsmith’s $15M Series A deck

The company is describing the event as “a chance to demo some ChatGPT and GPT-4 updates.”

OpenAI’s ChatGPT announcement: What we know so far

Unlike ChatGPT, Claude did not become a new App Store hit.

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Look,…

Startups Weekly: Trouble in EV land and Peloton is circling the drain

Scarcely five months after its founding, hard tech startup Layup Parts has landed a $9 million round of financing led by Founders Fund to transform composites manufacturing. Lux Capital and Haystack…

Founders Fund leads financing of composites startup Layup Parts

AI startup Anthropic is changing its policies to allow minors to use its generative AI systems — in certain circumstances, at least.  Announced in a post on the company’s official…

Anthropic now lets kids use its AI tech — within limits

Zeekr’s market hype is noteworthy and may indicate that investors see value in the high-quality, low-price offerings of Chinese automakers.

The buzziest EV IPO of the year is a Chinese automaker

Venture capital has been hit hard by souring macroeconomic conditions over the past few years and it’s not yet clear how the market downturn affected VC fund performance. But recent…

VC fund performance is down sharply — but it may have already hit its lowest point

The person who claims to have 49 million Dell customer records told TechCrunch that he brute-forced an online company portal and scraped customer data, including physical addresses, directly from Dell’s…

Threat actor says he scraped 49M Dell customer addresses before the company found out

The social network has announced an updated version of its app that lets you offer feedback about its algorithmic feed so you can better customize it.

Bluesky now lets you personalize main Discover feed using new controls

Microsoft will launch its own mobile game store in July, the company announced at the Bloomberg Technology Summit on Thursday. Xbox president Sarah Bond shared that the company plans to…

Microsoft is launching its mobile game store in July

Smart ring maker Oura is launching two new features focused on heart health, the company announced on Friday. The first claims to help users get an idea of their cardiovascular…

Oura launches two new heart health features

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI considers allowing AI porn

Garena is quietly developing new India-themed games even though Free Fire, its biggest title, has still not made a comeback to the country.

Garena is quietly making India-themed games even as Free Fire’s relaunch remains doubtful

The U.S.’ NHTSA has opened a fourth investigation into the Fisker Ocean SUV, spurred by multiple claims of “inadvertent Automatic Emergency Braking.”

Fisker Ocean faces fourth federal safety probe

CoreWeave has formally opened an office in London that will serve as its European headquarters and home to two new data centers.

CoreWeave, a $19B AI compute provider, opens European HQ in London with plans for 2 UK data centers

The Series C funding, which brings its total raise to around $95 million, will go toward mass production of the startup’s inaugural products

AI chip startup DEEPX secures $80M Series C at a $529M valuation 

A dust-up between Evolve Bank & Trust, Mercury and Synapse has led TabaPay to abandon its acquisition plans of troubled banking-as-a-service startup Synapse.

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

The problem is not the media, but the message.

Apple’s ‘Crush’ ad is disgusting

The Twitter for Android client was “a demo app that Google had created and gave to us,” says Particle co-founder and ex-Twitter employee Sara Beykpour.

Google built some of the first social apps for Android, including Twitter and others

WhatsApp is updating its mobile apps for a fresh and more streamlined look, while also introducing a new “darker dark mode,” the company announced on Thursday. The messaging app says…

WhatsApp’s latest update streamlines navigation and adds a ‘darker dark mode’

Plinky lets you solve the problem of saving and organizing links from anywhere with a focus on simplicity and customization.

Plinky is an app for you to collect and organize links easily

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: How to watch

For cancer patients, medicines administered in clinical trials can help save or extend lives. But despite thousands of trials in the United States each year, only 3% to 5% of…

Triomics raises $15M Series A to automate cancer clinical trials matching

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Tap, tap.…

Tesla drives Luminar lidar sales and Motional pauses robotaxi plans

The newly announced “Public Content Policy” will now join Reddit’s existing privacy policy and content policy to guide how Reddit’s data is being accessed and used by commercial entities and…

Reddit locks down its public data in new content policy, says use now requires a contract

Eva Ho plans to step away from her position as general partner at Fika Ventures, the Los Angeles-based seed firm she co-founded in 2016. Fika told LPs of Ho’s intention…

Fika Ventures co-founder Eva Ho will step back from the firm after its current fund is deployed

In a post on Werner Vogels’ personal blog, he details Distill, an open-source app he built to transcribe and summarize conference calls.

Amazon’s CTO built a meeting-summarizing app for some reason

Paris-based Mistral AI, a startup working on open source large language models — the building block for generative AI services — has been raising money at a $6 billion valuation,…

Sources: Mistral AI raising at a $6B valuation, SoftBank ‘not in’ but DST is