Listing Thumbnail

    Bitext FT Mistral-7B for Retail Banking

     Info
    Sold by: Bitext 
    Deployed on AWS
    Mistral-7B-Retail-Banking-v1

    Overview

    This model, "Mistral-7B-Retail-Banking-v1", is a fine-tuned version of the "Mistral-7B-Instruct-v0.2" model, specifically tailored for the Retail Banking domain. It is optimized to answer questions and assist users with various banking transactions. It has been trained using hybrid synthetic data generated using our NLP/NLG technology and our automated Data Labeling (DAL) tools.

    The goal of this model is to show that a generic verticalized model makes customization for a final use case much easier. For example, if you are "ACME Bank", you can create your own customized model by using this fine-tuned model and doing an additional fine-tuning using a small amount of your own data. An overview of this approach can be found at https://www.bitext.com/two-step/ 

    Highlights

    • Intended Use: - Recommended applications: This model is designed to be used as the first step in Bitext’s two-step approach to fine-tuning LLMs for the Retail Banking domain, providing customers with fast and accurate answers about their banking needs. - Out-of-scope: It should not be used for non-banking related inquiries or for providing advice on medical, legal, or critical safety issues.
    • The model was trained using a dataset designed for Retail Banking interactions, now publicly available on Hugging Face at https://huggingface.co/datasets/bitext/Bitext-retail-banking-llm-chatbot-training-dataset. This dataset comprises 26 different intents such as check_balance, transfer_money, open_account, and more, each with around 1000 examples.
    • The training dataset includes: 25,545 question/answer pairs 4.98 million tokens 1224 entity/slot types Each entry consists of: Instruction: User request Category: High-level semantic category Intent: Specific intent of the user request Response: Example response from a virtual assistant The dataset covers a wide range of banking-related categories such as ACCOUNT, ATM, CARD, CONTACT, FEES, FIND, LOAN, PASSWORD, and TRANSFER, ensuring comprehensive training for handling diverse retail banking queries.

    Details

    Sold by

    Delivery method

    Latest version

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Bitext FT Mistral-7B for Retail Banking

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (10)

     Info
    Dimension
    Description
    Cost/host/hour
    ml.g5.2xlarge Inference (Real-Time)
    Recommended
    Model inference on the ml.g5.2xlarge instance type, real-time mode
    $0.00
    ml.p3.2xlarge Inference (Batch)
    Recommended
    Model inference on the ml.p3.2xlarge instance type, batch mode
    $0.00
    ml.g4dn.4xlarge Inference (Real-Time)
    Model inference on the ml.g4dn.4xlarge instance type, real-time mode
    $0.00
    ml.p3.2xlarge Inference (Real-Time)
    Model inference on the ml.p3.2xlarge instance type, real-time mode
    $0.00
    ml.g4dn.8xlarge Inference (Real-Time)
    Model inference on the ml.g4dn.8xlarge instance type, real-time mode
    $0.00
    ml.g5.xlarge Inference (Real-Time)
    Model inference on the ml.g5.xlarge instance type, real-time mode
    $0.00
    ml.g4dn.xlarge Inference (Real-Time)
    Model inference on the ml.g4dn.xlarge instance type, real-time mode
    $0.00
    ml.g4dn.2xlarge Inference (Real-Time)
    Model inference on the ml.g4dn.2xlarge instance type, real-time mode
    $0.00
    ml.g5.4xlarge Inference (Real-Time)
    Model inference on the ml.g5.4xlarge instance type, real-time mode
    $0.00
    ml.p3.8xlarge Inference (Batch)
    Model inference on the ml.p3.8xlarge instance type, batch mode
    $0.00

    Vendor refund policy

    This product is offered for free. If there are any questions, please contact us for further clarifications.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Amazon SageMaker model

    An Amazon SageMaker model package is a pre-trained machine learning model ready to use without additional training. Use the model package to create a model on Amazon SageMaker for real-time inference or batch processing. Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.

    Deploy the model on Amazon SageMaker AI using the following options:
    Deploy the model as an API endpoint for your applications. When you send data to the endpoint, SageMaker processes it and returns results by API response. The endpoint runs continuously until you delete it. You're billed for software and SageMaker infrastructure costs while the endpoint runs. AWS Marketplace models don't support Amazon SageMaker Asynchronous Inference. For more information, see Deploy models for real-time inference  .
    Deploy the model to process batches of data stored in Amazon Simple Storage Service (Amazon S3). SageMaker runs the job, processes your data, and returns results to Amazon S3. When complete, SageMaker stops the model. You're billed for software and SageMaker infrastructure costs only during the batch job. Duration depends on your model, instance type, and dataset size. AWS Marketplace models don't support Amazon SageMaker Asynchronous Inference. For more information, see Batch transform for inference with Amazon SageMaker AI  .
    Version release notes

    Add support for more instance types.

    Additional details

    Inputs

    Summary

    The model accepts text requests which specifies the user query.

    Input MIME type
    text/plain, application/json
    <s>[INST] I want to transfer money, can you help me make a bank transfer? [/INST]
    https://github.com/bitext/bitext-mistral-7b-banking/blob/main/examples/sample_input.txt

    Support

    Vendor support

    Please write to marketing@bitext.com  to contact our support team. We are available Monday to Friday, 10am to 7pm PST. For queries during this time we will revert within 4 hours. For queries beyond this time, we would revert back in 8hours.

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    No customer reviews yet
    Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.