
Overview
This model, "Mistral-7B-Retail-Banking-v1", is a fine-tuned version of the "Mistral-7B-Instruct-v0.2" model, specifically tailored for the Retail Banking domain. It is optimized to answer questions and assist users with various banking transactions. It has been trained using hybrid synthetic data generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools.
The goal of this model is to show that a generic verticalized model makes customization for a final use case much easier. For example, if you are "ACME Bank", you can create your own customized model by using this fine-tuned model and doing an additional fine-tuning using a small amount of your own data. An overview of this approach can be found at https://www.bitext.com/two-step/Â
Highlights
- Intended Use: - Recommended applications: This model is designed to be used as the first step in Bitext’s two-step approach to fine-tuning LLMs for the Retail Banking domain, providing customers with fast and accurate answers about their banking needs. - Out-of-scope: It should not be used for non-banking related inquiries or for providing advice on medical, legal, or critical safety issues.
- The model was trained using a dataset designed for Retail Banking interactions, now publicly available on Hugging Face at https://huggingface.co/datasets/bitext/Bitext-retail-banking-llm-chatbot-training-dataset. This dataset comprises 26 different intents such as check_balance, transfer_money, open_account, and more, each with around 1000 examples.
- The training dataset includes: 25,545 question/answer pairs 4.98 million tokens 1224 entity/slot types Each entry consists of: Instruction: User request Category: High-level semantic category Intent: Specific intent of the user request Response: Example response from a virtual assistant The dataset covers a wide range of banking-related categories such as ACCOUNT, ATM, CARD, CONTACT, FEES, FIND, LOAN, PASSWORD, and TRANSFER, ensuring comprehensive training for handling diverse retail banking queries.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/host/hour |
|---|---|---|
ml.g5.2xlarge Inference (Real-Time) Recommended | Model inference on the ml.g5.2xlarge instance type, real-time mode | $0.00 |
ml.p3.2xlarge Inference (Batch) Recommended | Model inference on the ml.p3.2xlarge instance type, batch mode | $0.00 |
ml.g4dn.4xlarge Inference (Real-Time) | Model inference on the ml.g4dn.4xlarge instance type, real-time mode | $0.00 |
ml.p3.2xlarge Inference (Real-Time) | Model inference on the ml.p3.2xlarge instance type, real-time mode | $0.00 |
ml.g4dn.8xlarge Inference (Real-Time) | Model inference on the ml.g4dn.8xlarge instance type, real-time mode | $0.00 |
ml.g5.xlarge Inference (Real-Time) | Model inference on the ml.g5.xlarge instance type, real-time mode | $0.00 |
ml.g4dn.xlarge Inference (Real-Time) | Model inference on the ml.g4dn.xlarge instance type, real-time mode | $0.00 |
ml.g4dn.2xlarge Inference (Real-Time) | Model inference on the ml.g4dn.2xlarge instance type, real-time mode | $0.00 |
ml.g5.4xlarge Inference (Real-Time) | Model inference on the ml.g5.4xlarge instance type, real-time mode | $0.00 |
ml.p3.8xlarge Inference (Batch) | Model inference on the ml.p3.8xlarge instance type, batch mode | $0.00 |
Vendor refund policy
This product is offered for free. If there are any questions, please contact us for further clarifications.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Amazon SageMaker model
An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
Version release notes
Add support for more instance types.
Additional details
Inputs
- Summary
The model accepts text requests which specifies the user query.
- Input MIME type
- text/plain, application/json
Support
Vendor support
Please write to marketing@bitext.com to contact our support team. We are available Monday to Friday, 10am to 7pm PST. For queries during this time we will revert within 4 hours. For queries beyond this time, we would revert back in 8hours.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.