Field composites

Composite 01 · AI Startup Founder

Karthik 32 years · Bengaluru · Indic-NLP startup · Bhashini-adjacent

Karthik's eight-person startup builds NLP tooling for Indic languages — Kannada, Tamil, and Marathi initially — with a Bhashini API integration for speech recognition. His model-training pipeline needs ~400 GPU-hours per training run, currently rented at market rates from a hyperscaler at ₹180–220 per GPU-hour. He applied for IndiaAI Compute Portal subsidised access in Q1 2026, targeting the stated subsidised rate.

The composite-modelled friction: the portal approval queue runs longer than the 45-day SLA in the scheme documents. The subsidised-rate portal does not yet price-publish H200 separately from H100. Karthik's training runs are H100-optimised; the allocated node is an AMD MI300X-comparable. Retooling the pipeline costs two engineering weeks. The compute is technically "delivered." The utility is discounted.

The IndiaAI dataset platform lists 140+ datasets. Karthik's team evaluates 22 relevant to Indic NLP. Eleven are metadata-only — no download available. Six have restrictive non-commercial licenses incompatible with his SaaS revenue model. Five are usable. The platform's promise and the usable-data inventory are two different numbers.

Failure mode: access lag + pricing gap + dataset usability mismatch
Composite 02 · CPaaS Compute-Provider Bidder

Suman 40 years · Mumbai · Compute-provider · Yotta + Tata comparable

Suman leads the AI infrastructure division of a mid-scale Indian datacenter operator bidding on the IndiaAI CPaaS tender. The bid covers 2,000 H100-equivalent GPU nodes across two facilities — one in Pune, one in Navi Mumbai. The capex per H100 server cluster is approximately ₹3.2–3.8 crore per 8-GPU node, or ₹80–95 crore for 200 nodes as a first tranche.

The bidding economics require a utilisation guarantee from MeitY of at least 65% to service the debt on the capex. The tender document references a 70% guaranteed-floor commitment for empanelled providers. The composite-modelled gap: the guarantee is conditioned on IndiaAI portal demand materialising within 18 months of empanelment. The demand-side signal — how many Karthiks actually convert from application to paid compute hour — is not yet disclosed.

Power and cooling are the physical constraint. H100/H200 clusters require 30–40 kW per rack. Suman's Navi Mumbai facility is rated for 22 kW per rack under the current power-purchase agreement. Upgrading to 35 kW requires a MSEDCL capacity augmentation that runs 9–14 months in approval. The GPU procurement lead-time from NVIDIA (or AMD's MI300X equivalent) is 6–9 months from confirmed PO. These two timelines do not arrive together.

Failure mode: capex exposure + utilisation risk + power-cooling timing mismatch
Composite 03 · Senior MeitY Policy Officer

Vikram 45 years · New Delhi · IndiaAI programme delivery lead

Vikram's mandate: translate the ₹10,372 crore cabinet outlay into operational compute, dataset infrastructure, and foundation-model grants. He manages three parallel tracks — CPaaS empanelment, the dataset platform curation pipeline, and FMaaS (Foundation Model as a Service) disbursement to research institutions.

Three structural items from the modelled scenario surface the delivery gap:

  • The IndiaAI Compute Portal's "GPU-hours available" dashboard shows allocated capacity, not live-deployed capacity. The gap between empanelled provider capacity and GPUs physically racked and powered is not currently a published metric.
  • Milestone disbursement for CPaaS providers is triggered on "infrastructure readiness certification," defined as facility audit + network connectivity + OS layer. GPU-on-floor count and actual GPU-hour delivery to startups are downstream of disbursement, not conditions for it. The accountability link between rupee out-the-door and compute delivered to beneficiary is structurally weak.
  • Foundation-model grant winners retain IP on base weights but must license derivative fine-tunes under "open-access" terms if trained on public IndiaAI dataset platform data. The definition of "open-access" in the grant agreement — whether it permits commercial sublicensing — is ambiguous in the current draft terms reviewed by industry bodies.
Failure mode: milestone disbursement decoupled from delivery + IP ambiguity + GPU-count opacity