Listing Thumbnail

    LakeFusion Master Data Management

     Info
    Deployed on AWS
    LakeFusion brings multidomain Master Data Management (MDM) to the Databricks Lakehouse across industries. It simplifies entity resolution, unifies data, and ensures data consistency at scale, making it your trusted partner in achieving data excellence.

    Overview

    Open image

    LakeFusion is a Databricks-native Master Data Management (MDM) solution designed to help organizations take full control of their data quality and integrity at scale. Built to run entirely within the Databricks Lakehouse, LakeFusion resolves entities across multiple sources by eliminating duplicates and inconsistencies, while generating accurate, unified master records for streamlined operations and data-driven decision-making. By creating golden records and establishing a single source of truth, LakeFusion ensures trusted, high-quality data across the enterprise. It also standardizes and enriches incoming data to maintain consistency across systems.

    Whether you're working with millions or billions of records, LakeFusion scales effortlessly to handle large datasets with speed and precision, making it an ideal choice for organizations looking to harness the full potential of their data within the Databricks ecosystem.

    To learn more about experiencing seamless MDM within Databricks, visit https://www.lakefusion.ai/ 

    Highlights

    • The only MDM solution built natively for Databricks, delivering unmatched simplicity and cost savings.
    • From demo to implementation in under 6 weeks, delivering results three times faster than external vendors.
    • Perform all MDM operations within Databricks, streamlining workflows and reducing operational friction.

    Details

    Delivery method

    Supported services

    Delivery option
    Lakefusion-ECS-Fargate

    Latest version

    Operating system
    Linux

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    LakeFusion Master Data Management

     Info
    Pricing is based on a fixed subscription cost. You pay the same amount each billing period for unlimited usage of the product. Pricing is prorated, so you're only charged for the number of days you've been subscribed. Subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Fixed subscription cost

     Info
    $6,666.66/month

    Vendor refund policy

    LakeFusion does not offer refunds for any products or services listed on AWS Marketplace. All purchases are final. For support, contact us at support@lakefusion.ai  or visit https://www.lakefusion.ai . In limited cases and subject to their discretion, AWS may issue refunds for purchases. Please contact AWS Marketplace for more information on eligible refunds

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Lakefusion-ECS-Fargate

    Supported services: Learn more 
    • Amazon ECS
    Container image

    Containers are lightweight, portable execution environments that wrap server application software in a filesystem that includes everything it needs to run. Container applications run on supported container runtimes and orchestration services, such as Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS). Both eliminate the need for you to install and operate your own container orchestration software by managing and scheduling containers on a scalable cluster of virtual machines.

    Version release notes

    Version: Initial-deployment Release Date: 20 June 2025 Deployment Type: Container

    Additional details

    Usage instructions

    LakeFusion Deployment on AWS (via CloudFormation + Fargate + Terraform)

    1. Prerequisites a. Before deploying LakeFusion, ensure you have the following:
    • AWS Account Requirements
    • AWS account with permissions to:
    • Launch CloudFormation stacks
    • Create IAM roles and policies
    • Use ECS Fargate
    • Write to CloudWatch Logs b. Databricks Configuration
    • Workspace URL (e.g., https://<your-org>.databricks.com)
    • Workspace ID
    • Personal Access Token (PAT)
    • App Client ID & Secret
    • Domin name c. AWS Networking Setup
    • VPC (default)
    • Subnet
    • Security Group
    1. Deployment Architecture Overview This deployment leverages the following components:
    • CloudFormation Template: Collects user inputs and provisions resources
    • ECS Fargate Task: Automatically launched by CloudFormation to run LakeFusion installer
    • Terraform Automation: Executed inside the container to provision: IAM roles and policies Networking resources Databricks integration
    • All logs and operations are visible in CloudWatch Logs.
    1. Step-by-Step Deployment Guide Launch the CloudFormation Stack
      • Log in to the AWS Console
      • Navigate to CloudFormation > Create Stack > With new resources (standard)
      • Upload the provided LakeFusion CFN template file or provide its S3 URL
      • Click Next Provide Stack Parameters DatabricksWorkspaceURL (e.g. https://<your-org>.databricks.com) WorkspaceID (numeric only) PersonalAccessToken AppClientID, AppClientSecret Click Next, then Next, and finally Create Stack. Fargate Task Execution After stack creation begins CloudFormation provisions an ECS Fargate Task The task runs Terraform scripts inside a container Monitor deployment progress via CloudWatch Logs: Go to ECSClusters Locate your stack's task and view logs in CloudWatch
    2. Accessing LakeFusion
      • Once the deployment completes successfully: Open the URL in your browser to access the LakeFusion interface
    3. Additional Resources & Support For a detailed installation guide, troubleshooting steps, and configuration examples, visit: https://support.lakefusion.ai/portal/en/kb/articles/lakefusion-installation-guide-via-aws-url 

    Resources

    Support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    No customer reviews yet
    Be the first to review this product . We've partnered with PeerSpot to gather customer feedback. You can share your experience by writing or recording a review, or scheduling a call with a PeerSpot analyst.