Overview
Tech Mahindra’s Indus GenAI solution empowers all Indic languages that have originated out of the great Indus Civilization. Tech Mahindra, with use of AWS powerful platform and services, have sought to build an Open-Source Large Language AI model to serve the needs of 25% of the world's population.
The Indus Project uses AWS S3 and EC2 instances and intends to make India based Foundational Model in Indic languages. In the current version Indus LLM supports Hindi and its 37 dialects.
AWS-Powered LLM model
The solution's robust architecture is deployed on AWS cloud platform and uses S3 and EC2 services for allowing full flexibility to run the solution.
- High performance GPU based instance: The model uses AWS EC2 G5 instances for graphic intensive application and ML inference.
- User Management and Content Delivery: Utilizes Amazon Cognito for secure authentication
- Storage Powerhouse: Application also uses Amazon S3 for scalable content storage
- Database Services: Solution uses RDS which makes it easier to setup, operate and scale relational database.
Key Features:
- Built Indus LLM from scratch which is 1.2 Bn parameter model in Hindi and its 37 dialects
- Built from 22 Bn tokens using cleaned Hindi and Dialects data.
- Most economical, sustainable and ethical model
- No spurious token has only Hindi and its dialects.
- Features above Llama 2.0 model on Indic LLM Leaderboard for abstract reasoning.
- Indus was deployed on AWS Sagemaker and tested for Inference to a decent performance.
- NLP Toolkit- This toolkit is used for cleaning of any data other than Hindi and also transliterates dates and English text. This also removes bad words
- Bias Removal Toolkit- We have annotated data created for bias and non bias. A cosine difference mechanism is applied as a classifier to find and remove biased sentences
- Bias Removal Dashboard- Dashboard to represent biases removed
Benefits:
- Helps to bridge digital gap for Hindi speaking masses of rural India Enables Indic Digital Transformation
- Can be finetuned for specific use cases to cater to the needs of native Hindi users
- Enables Digital transformation in Indic languages
Highlights
- India’s first grounds up LLM focused on Hindi and its 37 Dialects
Details
Unlock automation with AI agent solutions

Pricing
Custom pricing options
How can we make this page better?
Legal
Content disclaimer
Support
Vendor support
Please reach out to DL (AI.alliances@techmahindra.com ) for further communication