Sarvam AI Unveils Indic Dataset ‘Samvaad’: A Gateway to Indian Conversational AI

TL;DR:

  • Sarvam AI introduces “Samvaad,” a comprehensive dataset tailored for Indian conversational AI, featuring 100,000 multi-turn conversations in English, Hindi, and Hinglish.
  • The company partners with Microsoft Azure to democratize access to its Indic Voice Large Language Model (LLM), aiming to enhance the development and deployment of generative AI applications.
  • Sarvam AI unveils OpenHathi-Hi-v0.1, the first installment in the OpenHathi series of Hindi Large Language Models (LLMs), bolstering linguistic innovation for Indic languages.
  • Recent Series A funding of USD 41 million, led by Lightspeed, signals confidence in Sarvam AI’s mission to not only build open-source Indic LLMs but also foster AI-powered applications on a massive scale.

Main AI News:

Sarvam AI, a pioneering figure in the realm of artificial intelligence, has launched “Samvaad,” an expansive collection of meticulously curated datasets tailored specifically for the Indian market. This groundbreaking release comprises 100,000 impeccably crafted, multi-turn conversations, encompassing a staggering 700,000 turns across English, Hindi, and Hinglish languages.

Marking a significant milestone in the evolution of Indic-focused AI, these datasets are now readily available on the Hugging Face platform. For developers and enthusiasts entrenched in the Indic landscape, this unveiling heralds a treasure trove of invaluable resources, with the promise of even more dynamic releases looming on the horizon.

In a strategic alliance with Hugging Face, Sarvam AI extends a cordial invitation to the community, urging them to remain vigilant for forthcoming updates. 

Sarvam AI Forges Alliance with Microsoft Azure to Democratize Access to Indic Voice LLM

Sarvam AI, at the forefront of revolutionizing the landscape of language models, has entered into a groundbreaking partnership with Microsoft Azure, propelling its Indic Voice Large Language Model (LLM) onto the Azure platform. With an unwavering commitment to fostering the development and deployment of generative AI applications within the Indian milieu, this collaboration marks a pivotal moment in the journey towards precision and cost-effectiveness.

Leveraging the prowess of generative AI, Sarvam AI endeavors to cater to the diverse linguistic tapestry of India, enriching the user experience with a seamless voice-based interface. Initially debuting in Hindi, Sarvam AI’s Indic Voice LLM is poised to transcend boundaries, with plans underway to incorporate additional Indian languages while adeptly navigating colloquial language nuances.

Pioneering Advancements: Sarvam AI Propels OpenHathi-Hi-v0.1 into the Limelight

Sarvam AI continues to shatter barriers and redefine the contours of linguistic innovation with the release of OpenHathi-Hi-v0.1, the inaugural installment in the OpenHathi series of Hindi Large Language Models (LLMs). Engineered on a cost-effective framework, this model, an extension of Llama2-7B, mirrors the performance benchmarks set by GPT-3.5 for Indic languages.

Bolstered by a recent infusion of USD 41 million in Series A funding, spearheaded by Lightspeed and bolstered by Peak XV Partners and Khosla Ventures, Sarvam AI is poised to transcend conventional boundaries. With a steadfast commitment to not merely constructing open-source Indic LLMs, but to nurturing an ecosystem conducive to the proliferation of AI-driven applications on a monumental scale, Sarvam AI emerges as a beacon of innovation in the global AI landscape.

Conclusion:

Sarvam AI’s recent initiatives signify a significant stride towards revolutionizing the Indian conversational AI landscape. By offering tailored datasets, democratizing access to advanced language models, and fostering innovation in linguistic technology, Sarvam AI is poised to reshape how AI-powered applications are developed and deployed in the Indian market. With substantial funding backing its endeavors, Sarvam AI is well-positioned to drive impactful change and solidify its position as a frontrunner in the global AI industry.

Source