Skip to main content
GPUBeat Chips & Hardware DeepInfra Secures $107M to Expand AI…

DeepInfra Secures $107M to Expand AI Inference Cloud Platform

DeepInfra has raised $107 million in Series B funding to enhance its AI inference cloud platform, tackling the rising demands for performance and security in AI workloads.

NVIDIA — ai-infrastructure — NVIDIA
DeepInfra Secures $107M to Expand AI Inference Cloud Platform Source: GPUBeat

DeepInfra, an AI cloud provider, has successfully raised $107 million in a Series B funding round aimed at expanding its AI inference cloud platform and global capacity. The investment, co-led by 500 Global and Georges Harik, attracted participation from notable investors including Nvidia, Samsung Next, and Supermicro.

This funding arrives as demand for high-throughput inference capabilities rises. DeepInfra's CEO and co-founder, Nikola Borisov, shared the company's vision on LinkedIn, highlighting the shift towards inference as a key driver of AI workloads. He noted, "Most cloud platforms weren’t built for this. So we built our platform from the ground up for high-throughput inference, optimizing for cost, performance, and security at production scale."

Founded in 2022 and headquartered in Palo Alto, California, DeepInfra aims to leverage the growing parity between open-source models and proprietary systems. With a focus on agent-based systems, the company is positioned to meet the ongoing, high-volume demand for AI processing. Currently operating from eight data centers across the United States, DeepInfra plans to expand its geographical presence, though specific new locations have not been announced.

Investment Landscape

The Series B funding round underscores a trend among investors to support advancements in AI infrastructure. With backing from significant entities like Nvidia, the investment bolsters confidence in DeepInfra's strategic direction. The company had previously raised $18 million in a Series A round completed in April 2025, demonstrating a rapid growth trajectory in a competitive market.

Technological Advancements

DeepInfra's platform is already using Nvidia's Blackwell GPUs, with plans to incorporate the upcoming Vera Rubin GPUs soon. This aligns with the company's commitment to delivering advanced technology that enhances its inference capabilities. As the AI sector continues to evolve, efficiently and securely processing large volumes of data will be critical.

See also  New Tool Ranks Local LLMs for Optimal Hardware Use

Future Outlook

Looking ahead, DeepInfra's emphasis on expanding its infrastructure and optimizing its platform for high-throughput inference positions it as a significant player in the AI cloud sector. As businesses increasingly adopt AI solutions, the demand for specialized cloud platforms designed for efficient inference is expected to rise. DeepInfra's innovative approach could set a new standard for AI cloud providers, marking a notable shift in how AI workloads are managed and executed globally.

GD

GPUBeat Desk

Desk · joined 2026

GPUBeat Desk covers AI infrastructure — chips, foundation models, inference economics, datacenter buildouts, and the geopolitics of compute.