All sessions

Bharat’s Sovereign AI: Scaling Innovation for a Billion+

Contents

Executive Summary

This AI summit session addresses India's strategic imperative to develop sovereign AI infrastructure and capabilities, framing sovereignty not as data localization alone but as intentional control across the entire technology stack—from hardware and compute to models and applications. The session features expert panels on infrastructure requirements and startup showcases demonstrating practical AI solutions built in India for India and the Global South, emphasizing that sovereign AI is essential for competing globally while serving domestic needs at scale.

Key Takeaways

  1. Sovereignty is Multi-Layered & Technical: Sovereignty extends beyond data to include hardware choice, cloud platform control, software ownership (open-source), model development, and metadata governance. A "kill switch" can exist at any layer—intentional design at every layer is mandatory.

  2. Energy & Capital Are the Binding Constraints: Physical infrastructure (power generation, cooling, networking) and financial infrastructure (risk capital for startups) are currently more limiting than technical capability. Solve these, and compute scaling becomes possible.

  3. Global South Cooperation is an Alternative to Isolation: Federated learning and differential privacy enable India to lead collaborative AI development in emerging economies without compromising data sovereignty—repositioning India from dependency to partnership.

  4. India Has Solved Diversity at Scale (Language, Geography, Population); Now Apply It to AI: With 55+ Indian languages, 1.5B population, 60% agricultural workforce, and presence across tier 1–3 cities, India's AI models must reflect this complexity. Western horizontal models don't solve these problems; vertical, custom models do.

  5. The Strategic Window is Now: Government support through India AI Mission, improving capital availability, and clear policy frameworks have created a rare moment for India to build sovereign AI infrastructure and avoid long-term technological dependency. Decisions made in the next 1–2 years will determine India's AI trajectory for decades.

Key Topics Covered

  • Sovereignty Definition & Framework: Data localization, infrastructure ownership, compute control, software independence, and model development as interconnected pillars
  • Compute Infrastructure & Heterogeneous Hardware: The need for diverse hardware options (not locked into single vendors) to avoid dependency and enable flexible workload deployment
  • Energy & Power Infrastructure: Nuclear SMRs/BSRs, on-site power generation, and sustainable energy as foundational constraints for scaling data centers
  • Network & Connectivity: Role of 5G/6G, undersea cables, edge computing, and latency optimization for distributed AI deployment to tier 2/3 cities
  • Global South Cooperation Model: Federated learning, differential privacy, and collaborative frameworks for knowledge sharing without compromising data sovereignty
  • Risk Capital & Ecosystem Development: Access to venture capital and private equity essential for scaling AI startups beyond infrastructure layer
  • Quantum Computing Integration: Long-term vision for QPU integration with existing HPC platforms (5–10 year timeline)
  • Responsible AI Governance: Need for global conventions and multi-disciplinary frameworks (contrasted with current fragmented company-level guardrails)
  • Startup Solutions: Real-world implementations in multilingual AI, industrial automation, and foundation models built under India AI Mission

Key Points & Insights

  1. Sovereignty is Intent-Based, Not Checklist-Based: Tarun (E2 Networks) emphasizes that true sovereignty requires intentional design decisions at every layer—data localization alone is insufficient. It demands control over data centers (Indian ownership), compute infrastructure, cloud operators, software stacks, and increasingly, model development itself.

  2. Hardware Heterogeneity = Strategic Independence: Locking into a single hardware type (e.g., GPUs only) creates overdependence. India needs access to diverse hardware with standardized APIs and programming models so developers can choose the right compute for each workload without proprietary lock-in.

  3. Energy is the Binding Constraint for Scaling: Multiple panelists identify power generation and cooling as the single most critical infrastructure decision for the next 1–2 years. On-site nuclear power (SMRs/BSRs) at data center parks could eliminate reliance on grid infrastructure and enable 24/7 operation at high reliability.

  4. Networks Are the "Invisible Layer": Neil (Tata Communications) underscores that network connectivity—via undersea cables, 5G/6G, and edge compute POPs—is as foundational as data centers themselves. Sub-10ms latency requirements for emerging use cases (robotic surgery, real-time inference) necessitate distributed edge infrastructure.

  5. Federated Cooperation Model for Global South: Rather than shutting borders entirely, India can share model weights and apply differential privacy to collaborate regionally (e.g., agricultural and healthcare models) while keeping raw data local. This enables leadership in Global South partnerships without compromising sovereignty.

  6. Control Plane & Metadata Are Often Overlooked: Neil highlights that while user data may reside in India, orchestration, provisioning, and metadata can still be controlled externally. Full sovereignty requires the entire cloud management platform (Kubernetes, OpenStack) to be open-source and under Indian control.

  7. Compute Constraints Are Rapidly Resolving: Tarun notes that past constraints on capital availability and policy support have been significantly eased by government backing (favorable taxation, data center policies). India now has the ability to build AI compute capacity "very rapidly."

  8. Quantum Security Threats Require Proactive Solutions: While quantum computing remains 5–10 years away from practical deployment, panelists warn that existing cryptographic security will break at scale. Solutions must be designed now to avoid retroactive vulnerabilities.

  9. Risk Capital Depth Is the Next Bottleneck: While data center investment is attracting capital, the startup ecosystem lacks sufficient venture and private equity funding to scale AI innovation companies beyond infrastructure layer. This gap threatens long-term competitiveness.

  10. "People with AI Will Replace People Without AI": Sorav (Shuna Labs) reframes the AI displacement debate—the issue is not whether AI will automate tasks, but ensuring all communities have access to AI tools in their own languages and contexts. Neglecting long-tail languages creates AI haves/have-nots divides.


Notable Quotes or Statements

"Sovereignty is not a checklist... it's about intent. How do we intend to remain in control of our destiny?" — Tarun, E2 Networks

"Sovereignty in one word is to ensure that you don't have a kill switch with somebody else who can turn off the knob." — Neil Kant Wanker, Tata Communications

"It's people with AI who are going to replace people without AI." — Sorav Bandupad, Shuna Labs

"When you control the foundation, you control the trajectory." — Rohitas Sharma, Socket AI Labs (referring to foundation models)

"Compute is extremely important, but we have to bring in innovation as well... We developed this with four GPUs." — Sorav Bandupad (on building multilingual translation models efficiently)

"The genie is already out of the bottle. There is no controlling it. The point is getting to a point where we can compete against the world." — Ravindra Kumar, Technodate AI (on industrial automation necessity)


Speakers & Organizations Mentioned

Panelists

  • Tarun — E2 Networks (Indian cloud infrastructure provider)
  • Lakshmi — [Organization/role not explicitly stated in transcript] (Infrastructure/hardware focus)
  • Neil Kant Wanker — VP & Global Head of Cloud AI & Edge, Tata Communications
  • Amit — Session moderator

Startup Founders

  • Sorav Bandupad — Founder & Chief Scientist, Shuna Labs (multilingual AI, speech-to-text)
  • Ravindra Kumar — Founder & CEO, Technodate AI (industrial automation co-pilot)
  • Rohitas Sharma — Head of Business & Partnerships, Socket AI Labs (foundation models)

Government & Industry Bodies

  • NASCOM — Supporting open-sourcing of multilingual models
  • Google Vani Project — Collecting long-tail language data across Indian geographies
  • India AI Mission — Government initiative providing funding and strategic direction
  • Indian Air Force — Potential deployment partner for maintenance co-pilot systems

Companies & Projects Referenced

  • Tata Communications — Submarine cable infrastructure, edge compute networks
  • Fujitsu — Quantum computing (64-qubit mockup on display)
  • Shuna Labs — 55-language real-time translation model (55M parameter, open-weights)
    • Pingala V1 — ASR model supporting 204 languages (lowest word error rate globally)
  • Technodate AI — Agentic AI co-pilot for industrial automation (concept design, programming, troubleshooting)
  • Socket AI LabsProject Aar (120B-parameter open-source multilingual foundation model)
    • Aar Code — Math and code foundation model (in-house secure development)
    • Aar Defense — Mission-critical intelligence for defense/infrastructure

Technical Concepts & Resources

AI Models & Projects

  • Project Aar — 120B-parameter open-source multilingual foundation model (India AI Mission)
  • Aar Code — Foundation model for secure code generation and AI-assisted engineering
  • Pingala V1 — Multilingual ASR (Automatic Speech Recognition) model, 204 languages
  • 55-Language Real-Time Translation Model — 1.3B parameters, open-weights, developed with 4 GPUs

Infrastructure & Hardware

  • Heterogeneous Hardware: GPUs, CPUs, QPUs (quantum processing units), specialized accelerators
  • Edge Computing POPs (Points of Presence): Distributed GPU inference for real-time, low-latency workloads
  • Nuclear SMRs/BSRs (Small Modular Reactors / Bharat Small Reactors): On-site power generation for data centers
  • 5G/6G Connectivity: Sub-10ms latency targets (5G); improved stability, cost efficiency (6G)
  • Undersea Submarine Cables: Tata Communications network infrastructure for inter-continental data transport

Software & Platforms

  • OpenStack — Open-source cloud management platform (sovereignty-aligned alternative to proprietary cloud)
  • Kubernetes — Open-source container orchestration (avoids vendor lock-in)
  • PyTorch/MLflow — Standard ML development frameworks
  • Hugging Face — Model registry for open-weights distribution
  • Open-Source Leaderboards — Verifiable ASR/NLP benchmarks (not proprietary)

Methodologies & Concepts

  • Federated Learning — Decentralized and centralized variants for collaborative AI without centralizing data
  • Differential Privacy — Technique to share insights while protecting individual data points
  • Agentic AI / AI Agents — Multi-step reasoning systems for design, programming, troubleshooting (Technodate use case)
  • Foundation Model Development — Vertical, task-specific models vs. horizontal commodity models (Shuna Labs approach)
  • 3D Coordinates & Physical AI — Translating AI instructions to real-world robotic/CNC operations (Technodate)

Data Resources

  • Google Vani Project — Open dataset of long-tail Indian language speech data (district-level collection)
  • NASCOM Support — Open-weight model distribution and governance framework
  • Longtail Languages Dataset — Addressing languages with no prior coverage in AI

Emerging Concerns

  • Quantum Cryptography Threats — Post-quantum security solutions needed proactively (5–10 year horizon)
  • Kill Switches & API Restrictions — API rate limits, policy changes, sanctions, licensing terms as sovereignty risks
  • Token Tax — Per-token pricing models that incentivize external dependency vs. on-premise deployment

Policy & Governance Insights

  • India AI Mission: Provides strategic funding, compute allocations, and ecosystem support for sovereign AI development
  • Global AI Conventions: Currently absent; fragmented company-level governance (ethics offices, guardrails) insufficient; multi-disciplinary, multi-country framework needed (analogy: UN Law of the Sea)
  • Data Center Taxation Policy: Government clarity on tax treatment attracting global compute operators while maintaining sovereignty
  • Defense & Critical Infrastructure: Outsourcing intelligence to foreign systems is not strategically viable; domestic capability (Aar Defense) is foundational

End of Summary