The New Digital Commons: Building India’s Open AI Public Goods

Contents

Executive Summary

This panel discussion explores how open AI platforms can function as digital public goods in India and the Global South, with emphasis on democratizing AI access, building inclusive solutions for underserved populations, and creating sustainable governance models. The speakers demonstrate through real-world implementations—from farmer chatbots serving 10 million queries to multilingual speech systems—that open, collaborative AI infrastructure can deliver measurable social impact while reducing dependency on proprietary models.

Key Takeaways

Open AI platforms must combine technical openness, quality assurance, institutional capacity-building, and governance frameworks. Openness without benchmarking, auditing, and trust-building yields neither innovation nor impact.
Knowledge and platform modularity drive exponential diffusion. Once a solution is packaged as reusable infrastructure (as with Mahavisttar), deployment time shrinks from months to weeks, enabling rapid scaling across regions and populations.
The success of AI-as-public-good hinges on institutional alignment, not just model availability. Government departments, civil society, and enterprises must be educated, incentivized, and held accountable—requiring cultural and organizational change, not just technology transfer.
India's DPI legacy (Aadhaar, UPI) provides a proven template for AI infrastructure. Lessons on data privacy, interoperability, and federated governance directly apply to building trustworthy, inclusive AI systems for the Global South.
Startup survival will depend on impact and distribution, not IP or model access. As powerful models become commoditized, competitive advantage shifts to understanding user needs, building distribution networks, and creating tangible economic or social value—not to owning foundational models.

Key Topics Covered

Open AI platforms as digital public infrastructure (DPI) — positioning AI as fundamental infrastructure like electricity, roads, and ports
Multilingual and multimodal AI systems — addressing language barriers and contextual nuances in agriculture, healthcare, and governance
Data democratization and the data economy — models for data access, ownership, and compensation in open ecosystems
Institutional capacity building — moving from pilots to production-grade adoption through education, trust-building, and organizational alignment
India's DPI legacy — applying lessons from Aadhaar and UPI to AI infrastructure
The India AI Mission's seven chakras — human capital, inclusion, trust, resilience, science & innovation, resources, and social good
Governance of open platforms — ensuring accountability, transparency, and preventing surveillance in government-led AI systems
Startup viability in an AI-commoditized landscape — what business models will survive when powerful models are accessible to all
Public-private partnerships — mechanisms for collaboration between government, civil society, tech, and enterprises
Digital divide and accessibility — ensuring marginalized populations benefit from AI advancements

Key Points & Insights

Open ≠ Complete; Quality & Safety Essential
- Openness requires accompanying benchmarks, evaluations, and quality assurance. Praep Gupta emphasized that uploading 650 models and 250 datasets to Hugging Face is insufficient without running them through rigorous benchmarks so "the entire world can see the quality of that openness."
The Three Layers of Open AI Ecosystems (Jibu's Framework)
- Layer 1: Open model ecosystem (Falcon, Swiss models, apertures) reducing adoption barriers
- Layer 2: Open infrastructure & public platforms (e.g., Albert for French bureaucrats)
- Layer 3: Scientific research collaboration (MONAI, MIA for medical imaging)
Data, Evaluation, and Models as Interdependent Components
- Venit Singh demonstrated that foundational models lack domain-specific data (e.g., agricultural terminology in Hindi). The solution requires: open datasets → open evaluation frameworks → fine-tuned open models using LoRA transformers.
Production-Grade Adoption Requires Institutional Trust, Not Just Technical Openness
- Tanvi Lal's Mahavisttar project showed that government departments must be incentivized and trained to implement solutions. Trust develops over time through education, risk management protocols, and demonstrated accountability—not a one-time switch.
Exponential Scaling Through Knowledge Reusability
- Mahavistar took 9 months to launch; the Ethiopia adaptation took 3 months; Gujarat dairy farmer rollout (36 million farmers) took 3 weeks. This 100x acceleration demonstrates that knowledge and platform modularity enable rapid diffusion.
Cost of Adoption Can Drop by 10x with Open Systems
- Digital Green reduced the cost of agricultural practice adoption from $35 to ~66 US cents per query through open voice/multilingual AI. This directly enables inclusion of small-holder farmers in the Global South.
Trust ≠ Trustworthiness; Verification & Public Auditing Are Missing
- Jibu warned that many celebrated pilot programs lack independent auditing. Claims about impact remain unverified, and there is often misalignment between what is claimed and ground reality.
Governance & Data Privacy Must Prevent Surveillance While Enabling PPPs
- Shavant Taluru stressed that government AI platforms should facilitate rather than surveil. Data traceability restrictions are essential to build private-public partnerships, drawing from UPI's success with data privacy as an architectural pillar.
Data Economy Models Are Emerging (Incentive-Based, Not Just Open-Source)
- Crowdsourcing models compensate frontline workers for voice/text data contributions. Kar (not-for-profit) partners with healthcare workers who receive payment per voice note, aligning incentives across the ecosystem.
Inclusion Means Reaching the Bottom 30% Without Fast Internet
- Jibu's responsible computing initiatives address the fact that 70% of India lacks reliable broadband. Web accessibility projects (84 websites reaching 1M users) and tribal women's self-help training (7,000 participants) demonstrate impact-focused inclusion beyond technology deployment.

Notable Quotes or Statements

Praep Gupta (Nvidia): "AI will become that fundamental infrastructure which every country will need... Being open is super critical because if you keep the systems closed, you cannot really drive the innovation."
Tanvi Lal (People + AI): "Openness means collaboration across civil society, tech, government and all the other bodies. We will only grow if the ecosystem learns from each other."
Venit Singh (Digital Green): "The cost of adoption went down by 10x... and what we are seeing right now is 66 cents in US dollars per query. That gives real agency and inclusion."
Shavant Taluru (Afar AI): "The government shouldn't get into surveillance. It should be more about facilitating. Traceability of data otherwise [would prevent] private-public partnerships from taking off."
Jibu (Mozilla Foundation): "Openness doesn't always translate into trustworthiness. Trustworthiness only happens when these systems are properly contested, audited, and governed with the public interest in mind."
Moderator (UNDP): "If India is leapfrogging towards making AI as public good, we are not just doing it for the 1.4 billion people in India but we are setting a powerful template for the global south."

Speakers & Organizations Mentioned

Speaker	Role/Organization	Key Focus
Praep Gupta	VP Industry Solutions & Architecture, Nvidia	AI infrastructure, compute acceleration, open-source model ecosystem (Neatron)
Tanvi Lal	Director of Strategy, People + AI (Astep Foundation)	Human-centered AI, policy design, Mahavisttar agricultural platform
Venit Singh	CTO, Digital Green Foundation	Agricultural chatbots, multilingual AI for farmers (Farmer Chat: 10M queries, 1M downloads)
Shavant Taluru	CEO, Afar AI	Generative AI, immersive tech, unified AI interface (India AI Mission), DPI layer
Jibu	Fellow, Mozilla Foundation	Open internet, digital rights, responsible AI, accessible AI literacy
Moderator	UNDP Regional Team	Bridging policy and people; focusing on public goods perspective

Organizations/Initiatives Mentioned:

Astep Foundation (founded by Nandan & Rohini Nilekani, Shankar Marwaha) — creators of Aadhaar, UPI
Digital Green Foundation — 1.5M+ farmers served, North Africa operations
Nvidia (Inception Program, Neatron initiative)
Mozilla Foundation (Common Voice, Mozilla AI, Responsible Computing Challenge)
India AI Mission (seven chakras framework)
Mahavisttar project (20+ partner organizations, Maharashtra Department of Agriculture)
Kar (not-for-profit; healthcare worker incentive model)
IUDX (Indian Urban Data Exchange, IISC, Ministry of Urban Affairs)
MOSPI (Ministry of Statistics & Program Implementation) — MCP server for ESAN portal
NIT Warangal (tribal AI literacy training)
Hugging Face (model repository, benchmarking)

Technical Concepts & Resources

AI Models & Architectures:

Neatron (Nvidia) — open-source models with accompanying datasets, recipes, and tools
LLaMA (Meta) — foundational open-source language model
Deepseek, Kiwi — emerging open-source models
Falcon (UAE) — example of domestic open-source model reducing adoption barriers
ASR/TTS Systems — AI for Bharat's speech models (relevant to Indian languages)
Fine-tuning via LoRA (Low-Rank Adaptation) — efficient method for adapting open models to domain-specific tasks

Datasets & Data Platforms:

Common Voice (Mozilla) — open-source speech dataset to democratize speech technology
Mozilla Data Collective — recent initiative for responsible data sharing
AI-ready data systems — focus on making statistical and administrative data accessible for AI
ESAN Portal (MOSPI) — government statistical datasets published via MCP server

Tools & Infrastructure:

Hugging Face — repository for models and datasets with benchmarking; 650 Nvidia contributions in prior year
IUDX (Indian Urban Data Exchange) — interoperable government data pipeline (Pune smart city example)
Web accessibility conversion tools — 84 websites, 1M users (Jibu's project)
Multimodal, multi-turn conversational systems — core design pattern for Farmer Chat

Research/Evaluation Frameworks:

Benchmarking & evaluation standards — essential to ensure quality of open-source contributions
Medical imaging collaboration platforms (MIA)
Clinical research collaboration platforms (MONAI)
Responsible computing initiatives — university-based capacity building (NIT Warangal, tribal women's groups)

Key Methodological Insights:

Sandboxing & vulnerability testing — responsibility of builders on open-source; essential before production deployment
Prompt engineering — user-level skill development to maximize model utility
Cost-per-query metrics — measurable adoption and sustainability indicator (66 cents/farmer query in India)
Multi-turn conversation design — enables user feedback, error correction, and iterative trust-building

Policy & Governance Implications

Institutional alignment over technology transfer — governments must invest in organizational change, staff training, and accountability mechanisms alongside technology deployment
Data privacy as architectural principle — prevent surveillance while enabling PPPs (from UPI model)
Public auditing of pilot programs — claimed impact should be independently verified; many current implementations lack validation
Incentive-based data economy models — compensating contributors (farmers, healthcare workers) for data rather than only open-source approaches
Federated governance for DPI — multiple stakeholders (government, civil society, private sector, academia) co-design and co-govern systems
Digital divide as equity issue — offline or low-bandwidth accessibility essential for inclusion of bottom 30% in India

Word Count: ~2,900 | Timestamp: Conference talk discussing India's AI public goods ecosystem and global South applicability