GPU

Inference.ai matches AI workloads with cloud GPU compute

GPUs’ ability to perform many computations in parallel make them well-suited to running today’s most capable AI. But GPUs are becoming tougher to procure, as companies of all sizes increase their

CentML lands $27M from Nvidia, others to make AI models run more efficiently

Contrary to what you might’ve heard, the era of large seed rounds isn’t over — at least in the AI sector. CentML, a startup developing tools to decrease the cost — and improve

SQream calls in $45M to expand its GPU-based big data analytics platform

Back in 2010, Israeli data analytics startup SQream made a bet on the potential of GPUs as a cornerstone of enabling the processing and querying of big datasets, an area that it believed would only gr

How Index Ventures jumped to the front of the AI GPU line

Earlier this week, The New York Times shone a light on some of the desperation that founders are experiencing as they try and fail to secure compute power for their nascent artificial intelligence sta

Welcome to the trillion-dollar club, Nvidia

Given that the stock market has battered tech stocks in recent quarters, how is Nvidia breaking away from the pack? What can we learn from its rise?

Apple resizes the iPad’s workflow with Stage Manager

Though the iPad was a huge hit from the beginning based on its user-friendly interface and single-application focus, it had begun feeling a bit stale for those who hunger for more depth. Long one of t

Vultr will now let you rent a share of Nvidia’s A100 GPUs in its cloud

Vultr, the cloud platform that specializes in providing access to basic infrastructure services at a relatively low cost, today announced the launch of Vultr Talon, a new service that will offer devel

Google Cloud now lets you suspend and resume VMs

Google Cloud today launched its Suspend/Resume feature for virtual machines into general availability. Before it launched this feature as an alpha a couple of years ago, the only option developers had

Run:ai raises $75M for its AI platform

Tel Aviv-based Run:ai, a startup that makes it easier for developers and operations teams to manage and optimize their AI infrastructure, today announced that it has raised a $75 million Series C fund

Run:AI raises $30M Series B for its AI compute platform

Run:AI, a Tel Aviv-based company that helps businesses orchestrate and optimize their AI compute infrastructure, today announced that it has raised a $30 million Series B round. The new round was led

WaveOne aims to make video AI-native and turn streaming upside down

Video has worked the same way for a long, long time. And because of its unique qualities, video has been largely immune to the machine learning explosion upending industry after industry. WaveOne hope

AWS launches its next-gen GPU instances

AWS today announced the launch of its newest GPU-equipped instances. Dubbed P4d, these new instances are launching a decade after AWS launched its first set of Cluster GPU instances. This new generati

Nvidia will power world’s fastest AI supercomputer, to be located in Europe

Nvidia is is going to be powering the world’s fastest AI supercomputer, a new system dubbed “Leonardo” that’s being built by the Italian multi-university consortium CINECA, a g

Arm launches new chip designs for autonomous systems

Chip designer Arm today announced the launch of a new set of solutions for autonomous systems for both automotive and industrial use cases. These include the Arm Cortex-A78AE high-performance CPU, the

Apple unveils its super fast five nanometer A14 chip, shipping in the new iPad Air next month

No iPhone 12 announced today, but Apple unveiled a new chip that will power the next generation of its hardware (including that phone whenever it’s launched). The A14 Bionic, which will ship fir

Adding an external GPU to your Mac is probably a better upgrade option than getting a new one

Apple recently announced they would be transitioning their Mac line from Intel processors to their own, ARM-based Apple Silicon. That process is meant to begin with hardware to be announced later this

Nvidia begins shipping the A100, its first Ampere-based data center GPU

Nvidia announced today that its NVIDIA A100, the first of its GPUs based on its Ampere architecture, is now in full production and has begun shipping to customers globally. Ampere is a big generationa

China Roundup: Ant Financial’s new boss and Tencent’s army of new apps

Hello and welcome back to TechCrunch’s China Roundup, a digest of recent events shaping the Chinese tech landscape and what they mean to people in the rest of the world. This week, we are looking at

You can now use Azure to manage resources anywhere, including on AWS and Google Cloud

With the preview of Azure Arc, Microsoft today announced a major step in the evolution of its hybrid cloud story. Azure Arc takes the work the company has done on projects like Azure Stack, throws in

Nvidia breaks records in training and inference for real-time conversational AI

Nvidia’s GPU-powered platform for developing and running conversational AI that understands and responds to natural language requests has achieved some key milestones and broken some records tha
Load More