Sunday, November 17, 2024
Google search engine
HomeData Modelling & AIIndia’s AI Leap 🇮🇳 : 6 LLMs that are Built in India

India’s AI Leap 🇮🇳 : 6 LLMs that are Built in India

Introduction

In the world of big-league tech, where giant global players usually lead the AI race, India is making some exciting moves of its own. A whole new world of Indian-made Large Language Models (LLMs) and AI tools is starting to shine, each with its own special flair. We’re here to put these local heroes under the spotlight, showing off their cool features and groundbreaking progress. 

Ready for an adventure into the diverse and dynamic world of India’s own AI creations? Let’s jump in and discover what makes these Indian LLMs and AI tools not just smart, but truly remarkable. Fasten your seatbelts – we’re about to take a thrilling ride into the heart of India’s 🇮🇳 AI innovation!

OpenHathi

OpenHathi, with its name meaning “elephant” in Hindi, is not just a large language model, but a symbol of the growing power of Indian languages in the AI landscape. This 7B parameter model, developed by Sarvam AI, marks the first release in the OpenHathi series, designed to empower diverse applications in the Indian market. As the first publicly available Hindi Large Language Model (LLM), OpenHathi represents a pivotal moment in India’s AI evolution. 

OpenHathi | India's LLM

Key Features

  • Bilingual Training: OpenHathi leverages not just Hindi but also English and Hinglish data during training, enhancing its comprehension and generation capabilities across both languages.
  • Custom Tokenization: A unique sentence-piece tokenizer with a 16K Hindi vocabulary merges with the Llama2 tokenizer to significantly reduce tokenization overhead for Hindi text.
  • Phased Training: The model undergoes a three-phase training process:
    • Phase 1: Bilingual text translation using low-rank adapters, fostering cross-lingual understanding.
    • Phase 2: Bilingual next-token prediction with low-rank adapters, enabling context-aware language generation.
    • Phase 3: Supervised fine-tuning on internal datasets for specific tasks, tailoring the model’s ability to handle diverse applications.
  • Open-source Accessibility: The OpenHathi base model after phase 2 is publicly available via HuggingFace, allowing developers and researchers to fine-tune it for their specific needs and tasks.
  • Cross-lingual Potential: OpenHathi’s bilingual training opens doors for potential applications in cross-lingual translation, information retrieval, and other tasks that require seamless interaction between Hindi and English.

Click here to explore OpenHathi.

Tamil-LLAMA

Tamil-LLAMA is a cutting-edge large language model specifically designed for the Tamil language. Developed by Abhinand Balachandran, it builds upon the foundation of the LLaMA model but significantly enhances its capabilities in handling Tamil text.

Tamil-LLAMA | LLMs that are Built in India

Key Features

  • Enhanced vocabulary: The model’s vocabulary expands upon the original 32,000 tokens by incorporating an additional 16,000 Tamil-specific tokens, enabling more nuanced and accurate processing of Tamil language.
  • Efficient training: Leveraging the LoRA methodology, Tamil-LLAMA achieves optimal training efficiency while maintaining model robustness.
  • Multiple variations: Four variations are available: Tamil LLaMA 7B, 13B, 7B Instruct, and 14B Instruct. Each variation offers different parameter sizes and fine-tuning approaches, catering to diverse needs and computational resources.
  • Fine-tuning with focused datasets: To further refine its Tamil comprehension and generation abilities, the model undergoes additional training with a Tamil-translated version of the Alpaca dataset and a subset of the OpenOrca dataset, specifically chosen for their relevance to Tamil language tasks.
  • Open-source availability: The code, models, and datasets are all publicly available, fostering further research and development in Tamil language processing.

Overall, Tamil-LLAMA represents a significant leap forward in the field of Tamil language AI. Its combination of enhanced vocabulary, efficient training methods, focused fine-tuning, and open-source accessibility makes it a valuable tool for researchers, developers, and anyone interested in leveraging the power of AI for Tamil language applications.

Click here to explore this LLM built in India.

Krutrim

Krutrim ia an ambitious initiative from the Ola group, aims to revolutionize the AI landscape in India and beyond. It’s not just another model, but a comprehensive AI computing stack designed to empower individuals, businesses, and researchers across various domains.

LLMs that are Built in India | Krutrim

Key Pillars

  • AI Computing Infrastructure: Krutrim envisions building the hardware and software infrastructure that will power the next generation of AI applications. This includes high-performance computing resources, specialized AI accelerators, and efficient cloud infrastructure.
  • AI Cloud: Krutrim’s cloud platform will provide developers and researchers with easy access to AI tools and resources, enabling them to build and deploy AI applications without the need for extensive hardware investments.
  • Foundational Models: Krutrim is developing a suite of large language models, speech recognition systems, and computer vision models specifically tailored for the Indian market and its diverse languages and cultural nuances. These models will provide a foundation for building various AI applications.
  • AI-Powered End Applications: Krutrim’s ultimate goal is to create practical and impactful AI applications across various sectors, such as healthcare, education, agriculture, and finance. These applications will be designed to address the specific needs of India and its diverse population.

Click here to explore this LLM built in India.

Project Indus

Tech Mahindra has just unveiled a really cool project called Project Indus, which is all about making computers understand Hindi and its many dialects! It is at the forefront of a groundbreaking initiative in language technology, developing a pure Hindi Large Language Model (LLM) powered by AI. This model is notable for its substantial scale, encompassing 539 million parameters and a vast collection of 10 billion tokens from Hindi and its dialects.

The project’s ambitious goal is to build an Open Source LLM, aiming to revolutionize language technology and meet the needs of a quarter of the world’s population. This endeavor will create extensive language repositories, promising significant benefits for sectors like rural finance, retail, and logistics, thereby contributing to growth across India.

Project Indus

The initial phase of Project Indus focuses on Hindi and its 37 dialects, laying a solid foundation for future expansion. Over time, the project will incorporate additional languages and dialects, broadening its scope and impact. This initiative by Tech Mahindra is more than just a technological advancement; it’s a step towards bridging language barriers and fostering inclusivity on a global scale.

It’s set for beta testing, you can contribute here: https://www.projectindus.in/en/

Click here to explore this LLM built in India.

Bhashini

Bhashini, a landmark initiative by the Government of India, stands as a powerful answer to the digital divide within the country. Its focus transcends the scope of simply developing Large Language Models (LLMs). Instead, Bhashini represents a comprehensive, multi-faceted program aimed at democratizing internet and digital services access across various Indian languages.

Bhashini

Bhashini encompasses a diverse landscape of language technology projects, with LLM development as a crucial element. This holistic approach extends beyond individual languages, seeking to create bridgepoints between technology and India’s rich linguistic heritage. By breaking down language barriers, Bhashini envisions a future where digital inclusivity isn’t just a promise, but a lived reality for every citizen.

Bhashini’s core lies in the conviction that linguistic diversity should not be a barrier to digital empowerment. Through its various projects, it seeks to seamlessly integrate India’s diverse tongues with cutting-edge technologies. This dedication reflects a profound commitment to fostering a more inclusive digital landscape, ensuring that individuals across the country can access and utilize the full potential of the digital world.

While still in its beta phase, the Bhashini app marks a significant milestone in the program’s journey. Available for download on both Apple Store and Google Play Store, the app offers a glimpse into the transformative potential of Bhashini. As the program evolves and expands, its impact is expected to be felt across various domains, from education and healthcare to governance and economic development.

Bhashini holds undeniable potential to bridge the digital divide. Its long-term effectiveness hinges on factors like accessibility, technology development, and government support. Despite challenges, Bhashini’s ambitious vision offers hope for a future where linguistic diversity empowers in the digital age.

Click here to explore this made in India LLM.

CoRover.ai

CoRover stands out as a groundbreaking enterprise in the AI industry, boasting the distinction of being the world’s first human-centric Conversational and Generative AI platform that delivers the highest ROI. This platform is fortified with secure, scalable, and reliable patent-pending technology, encompassing AI, ML, NLP, AR, and VR. It’s versatile in its offerings, featuring Multi-Format capabilities with AI VideoBots, VoiceBots, and ChatBots, and provides Multi-Lingual support in over 100 languages, catering to a user base of over 1 billion. A key feature of CoRover is its Video-Voice Commerce Virtual Assistants, enabling complete transactional processes, including payments.

CoRover.ai |LLMs that are Built in India

The platform has further expanded its capabilities with BharatGPT, its proprietary Generative AI for text, voice, and video, which also integrates the option to use ChatGPT. CoRover’s mission is to revolutionize user-system interaction, making it as intuitive as conversing with an intelligent person. The company’s innovative strides have earned commendations from global leaders such as Microsoft’s Satya Nadella and India’s Prime Minister Narendra Modi, and it collaborates with numerous Fortune 100 companies, marking its significant impact in the AI realm.

Click here to explore this LLM made in India.

Conclusion

As we reach the end of our journey through the dynamic and inspiring realm of India’s AI innovations, it’s crucial to pause and offer a resounding round of applause to the brilliant minds and teams who are steering this remarkable revolution. From the innovative corridors of Sarvam AI with OpenHathi to the creative minds behind Tamil-LLAMA, from the visionary thinkers at Ola group for Krutrim to the tech pioneers at Tech Mahindra for Project Indus, and the dedicated officials championing Bhashini, each has contributed immensely to this rich narrative of technology and transformation.

While not all these projects are LLMs in the strictest sense, their inclusion is crucial due to their significant contributions to the field. Their unique features, from multilingual capabilities to domain-specific expertise, reflect a deep understanding of India’s multifaceted linguistic landscape. 

This story, however, is continually unfolding, and there may be chapters yet untold. If you know of other Indian LLMs or similar transformative projects that haven’t been mentioned here, let’s enrich this narrative together.

Share your insights in the comments below, and let’s celebrate the full spectrum of India’s vibrant AI landscape, a landscape where every innovation, big or small, plays a crucial role in shaping a technologically inclusive and culturally rich future.

Himanshi Singh

21 Dec 2023

I am a data lover and I love to extract and understand the hidden patterns in the data. I want to learn and grow in the field of Machine Learning and Data Science.

RELATED ARTICLES

Most Popular

Recent Comments