Region
Global
Industry
Industry Cloud Computing
InpharmD is a leading AI-driven pharmaceutical-intelligence platform that synthesizes drug data, clinical guidelines, and real-world evidence to help pharmacists, researchers, and clinicians make critical treatment decisions. Its engine continuously ingests and processes large data sets—often in batch mode—to deliver precise, timely, and customized recommendations.
To guarantee top-tier performance and availability during unpredictable surges in demand, InpharmD adopted Avesha’s Elastic GPU Service (EGS) to seamlessly burst inference workloads from its primary on-premises Kubernetes cluster to Nebius Cloud—with zero code changes and full operational transparency.
InpharmD’s inference platform depends heavily on GPUs for real-time and batch processing of complex clinical queries. Running exclusively on its on-prem cluster created three bottlenecks:
With Avesha EGS, InpharmD created a hybrid GPU workspace spanning its on-prem cluster and two Nebius regions. Each time the platform spins up a new inference endpoint, EGS automatically evaluates capacity and latency:
• If on-prem has headroom, the endpoint is deployed locally for the lowest possible latency.
• If on-prem is full, EGS transparently bursts the workload to the optimal Nebius region—no DevOps tickets, no redeploy, no downtime.
EGS offers two complementary bursting modes:
Bursting Mode | How It Works | Outcome for InpharmD |
1. CustomerManaged Workloads in EGS Workspace | InpharmD deploys its AI services into the EGS-managed Kubernetes workspace. EGS provisions GPU nodes in Nebius as needed and schedules pods to the region with the best capacity-vs-latency profile. | Instant access to Nebius GPUs without managing remote clusters or rewriting manifests. |
2. Model-Centric Bursting with AutoProvisioning | InpharmD specifies a model (e.g., Hugging Face transformer, NVIDIA NIM, proprietary clinical-NLP). EGS finds capacity, deploys the model, and scales it automatically. | Rapid R&D iterations and ondemand clinical queries—no infrastructure management required. |
“With Avesha EGS, we dynamically optimize every workload—maximizing performance without overpaying for idle resources. We scale seamlessly and keep research costs predictable.”
— Tulasee Rao Chintha, CTO, InpharmD— Tulasee Rao Chintha, CTO, InpharmD
Key Benefits for InpharmD
Strategic Impact
By shifting from a fixed on-prem model to a hybrid GPU environment that “chases capacity,” InpharmD can:
InpharmD’s success with Avesha EGS showcases the power of intelligent GPU bursting between on-prem infrastructure and Nebius specialized GPU Cloud. With frictionless workload mobility, model-aware deployment, and deep observability, EGS gives InpharmD the agility, reliability, and cost-efficiency required to lead in AI-powered pharmaceutical intelligence.