DeepInfra, an AI cloud provider, has successfully raised $107 million in a Series B funding round aimed at expanding its AI inference cloud platform and global capacity. The investment, co-led by 500 Global and Georges Harik, attracted participation from notable investors including Nvidia, Samsung Next, and Supermicro.
This funding comes at a time when demand for high-throughput inference capabilities is increasing. DeepInfra's CEO and co-founder, Nikola Borisov, shared the company's vision on LinkedIn, emphasizing the shift towards inference as a key driver of AI workloads. He stated, "Most cloud platforms weren’t built for this. So we built our platform from the ground up for high-throughput inference, optimizing for cost, performance, and security at production scale."
Founded in 2022 and based in Palo Alto, California, DeepInfra aims to take advantage of the growing parity between open-source models and proprietary systems. With a focus on agent-based systems, the company is well-positioned to meet the ongoing, high-volume demand for AI processing. Currently operating from eight data centers across the United States, DeepInfra plans to expand its geographical presence, although specific new locations have not yet been disclosed.
Investment Landscape
The Series B funding round highlights a trend among investors to support advancements in AI infrastructure. With backing from significant entities like Nvidia, this investment reinforces confidence in DeepInfra's strategic direction. The company had previously raised $18 million in a Series A round completed in April 2025, showcasing a rapid growth trajectory in a competitive market.
Technological Advancements
DeepInfra's platform is already utilizing Nvidia's Blackwell GPUs and plans to incorporate the upcoming Vera Rubin GPUs soon. This aligns with the company's commitment to delivering advanced technology that enhances its inference capabilities. As the AI sector continues to evolve, the ability to efficiently and securely process large volumes of data will be critical.
Future Outlook
Looking ahead, DeepInfra's focus on expanding its infrastructure and optimizing its platform for high-throughput inference positions it as a significant player in the AI cloud sector. As businesses increasingly adopt AI solutions, the demand for specialized cloud platforms designed for efficient inference is expected to grow. DeepInfra's innovative approach could set a new standard for AI cloud providers, marking a notable shift in how AI workloads are managed and executed globally.


