GMI Cloud, a fast-growing GPU-as-a-Service provider tailored for AI workloads, today announced its role as one of the first contributors to NVIDIA’s DGX Cloud Lepton platform and marketplace. This collaboration connects AI developers globally to GMI Cloud’s high-performance GPU infrastructure, including NVIDIA’s latest Blackwell architecture, through a unified platform designed to simplify AI development, training, and deployment.
Unlocking Scalable, Reliable GPU Access for AI Builders
Access to dependable, high-performance GPU resources is a key challenge for AI developers. NVIDIA DGX Cloud Lepton meets this need by providing a unified, integrated platform that streamlines workflows from prototype to production. It leverages NVIDIA’s comprehensive software stack—such as NVIDIA NIM microservices, NeMo, Blueprints, and Cloud Functions—to accelerate AI innovation.
GMI Cloud’s Strategic Contribution
As an NVIDIA Cloud Partner, GMI Cloud brings:
- Direct access to NVIDIA GPU clusters optimized for cost-effectiveness, scale, and performance
- Globally distributed infrastructure with strategic regional availability for compliance and low latency
- Full-stack ownership enabling competitive pricing and operational efficiency
- Fast deployment pipelines powered by NVIDIA’s integrated software ecosystem
Starting with 16-node GPU clusters available on the marketplace, GMI Cloud supports diverse use cases from large language model (LLM) training to autonomous systems and real-time AI inference.
CEO Perspective: Building AI Without Limits
“DGX Cloud Lepton reflects everything we believe in at GMI Cloud: speed, sovereignty, and scale without compromise,” said Alex Yeh, CEO of GMI Cloud. “We built our infrastructure from the silicon up to help developers build AI without limits. This partnership accelerates that vision by giving developers unmatched access to powerful, scalable GPU resources.”
By joining NVIDIA DGX Cloud Lepton, GMI Cloud enhances the global AI ecosystem with accessible, high-performance GPU infrastructure. This partnership empowers AI builders to innovate faster, scale smarter, and deploy with confidence across diverse applications and geographies.