DeepInfra, an AI cloud provider, has successfully raised $107 million in a Series B funding round aimed at expanding its AI inference cloud platform and global capacity. The investment, co-led by 500 Global and Georges Harik, attracted participation from notable investors including Nvidia, Samsung Next, and Supermicro.
This funding arrives as demand for high-throughput inference capabilities rises. DeepInfra's CEO and co-founder, Nikola Borisov, shared the company's vision on LinkedIn, highlighting the shift towards inference as a key driver of AI workloads. He noted, "Most cloud platforms weren’t built for this. So we built our platform from the ground up for high-throughput inference, optimizing for cost, performance, and security at production scale."
Founded in 2022 and headquartered in Palo Alto, California, DeepInfra aims to leverage the growing parity between open-source models and proprietary systems. With a focus on agent-based systems, the company is positioned to meet the ongoing, high-volume demand for AI processing. Currently operating from eight data centers across the United States, DeepInfra plans to expand its geographical presence, though specific new locations have not been announced.
Investment Landscape
The Series B funding round underscores a trend among investors to support advancements in AI infrastructure. With backing from significant entities like Nvidia, the investment bolsters confidence in DeepInfra's strategic direction. The company had previously raised $18 million in a Series A round completed in April 2025, demonstrating a rapid growth trajectory in a competitive market.
Technological Advancements
DeepInfra's platform is already using Nvidia's Blackwell GPUs, with plans to incorporate the upcoming Vera Rubin GPUs soon. This aligns with the company's commitment to delivering advanced technology that enhances its inference capabilities. As the AI sector continues to evolve, efficiently and securely processing large volumes of data will be critical.
Future Outlook
Looking ahead, DeepInfra's emphasis on expanding its infrastructure and optimizing its platform for high-throughput inference positions it as a significant player in the AI cloud sector. As businesses increasingly adopt AI solutions, the demand for specialized cloud platforms designed for efficient inference is expected to rise. DeepInfra's innovative approach could set a new standard for AI cloud providers, marking a notable shift in how AI workloads are managed and executed globally.



