Trending
Building pay-per-intelligence for AI agents: How Ampersend uses Amazon Bedrock AgentCore Payments Microsoft proposes ratepayer-protection tariff in Nevada PLDT files to establish and float data center REIT in Philippines DCD Podcast – What data centers should expect from the next UK Prime Minister Karis eyes potential data center development outside Chicago, Illinois Top spy agencies say AI cyber threats will impact you within months. Here’s why Data Centers Take Training into Their Own Hands Amid Talent Shortages DataBank files for 200MW data center campus outside Atlanta, Georgia Gigawatt-scale data center campus proposed in Kansas 87-acre ‘Project Tallmadge’ to be built in Strasburg, Virginia Running ComfyUI workflows on Amazon SageMaker AI processing jobs Centuria Capital Group raises AU$300m in equity for ResetData AI cloud business AI-Native Leaders: The Organizational Playbook for Engineering Transformation at Scale New chip could help tiny robots traverse complex environments The $400 million machine powering the future of chipmaking

QumulusAI’s $124M Deal Spotlights AI Infrastructure’s Utilization Challenge

Qumulus’ $124M Deal Highlights the Next Hurdle in AI Infrastructure. 3 Minute Read. As workloads shift from training to production inference, idle resources can be the most expensive issue, according to Getty.

Since the year 2024, companies offering AI infrastructure have been striving to accommodate a greater number of GPUs on their premises. They are confronted with a new challenge: finding ways to keep them occupied. According to QumulusAI, it has secured over $124 million in three-year AI infrastructure deals related to Nvidia Blackwell implementations, such as a partnership with AI cloud service provider Hyperbolic.

The agreements focus on inference tasks, where the economic factors typically rely more on the effective utilization of GPU resources rather than acquiring them. “Ensuring the largest and most adaptable clusters were the main concern,” QumulusAI CEO Mike Maniscalco explained to Data Center Knowledge. Today, an increasing number of customers are keen on deploying models in large-scale production while still desiring the adaptability to perform minor training or adjustments on the same infrastructure.

The alteration in consumer behavior extends far beyond AI cloud providers.

 

Join the conversation

Your email address will not be published. Required fields are marked *