Overview
LakeFusion Dashboard
LakeFusion Master Data Management MDM Native to Databricks Intuitive Dashboard for seamless data management and processing
LakeFusion Dashboard
LakeFusion Golden Record
LakeFusion Data Quality

Product video
LakeFusion is a Databricks-native Master Data Management (MDM) solution designed to help organizations take full control of their data quality and integrity at scale. Built to run entirely within the Databricks Lakehouse, LakeFusion resolves entities across multiple sources by eliminating duplicates and inconsistencies, while generating accurate, unified master records for streamlined operations and data-driven decision-making. By creating golden records and establishing a single source of truth, LakeFusion ensures trusted, high-quality data across the enterprise. It also standardizes and enriches incoming data to maintain consistency across systems.
Whether you're working with millions or billions of records, LakeFusion scales effortlessly to handle large datasets with speed and precision, making it an ideal choice for organizations looking to harness the full potential of their data within the Databricks ecosystem.
To learn more about experiencing seamless MDM within Databricks, visit https://www.lakefusion.ai/Â
Highlights
- The only MDM solution built natively for Databricks, delivering unmatched simplicity and cost savings.
- From demo to implementation in under 6 weeks, delivering results three times faster than external vendors.
- Perform all MDM operations within Databricks, streamlining workflows and reducing operational friction.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
- $6,666.66/month
Vendor refund policy
LakeFusion does not offer refunds for any products or services listed on AWS Marketplace. All purchases are final. For support, contact us at support@lakefusion.ai or visit https://www.lakefusion.ai . In limited cases and subject to their discretion, AWS may issue refunds for purchases. Please contact AWS Marketplace for more information on eligible refunds
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Lakefusion-ECS-Fargate
- Amazon ECS
Container image
Containers are lightweight, portable execution environments that wrap server application software in a filesystem that includes everything it needs to run. Container applications run on supported container runtimes and orchestration services, such as Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS). Both eliminate the need for you to install and operate your own container orchestration software by managing and scheduling containers on a scalable cluster of virtual machines.
Version release notes
Version: Initial-deployment Release Date: 20 June 2025 Deployment Type: Container
Additional details
Usage instructions
LakeFusion Deployment on AWS (via CloudFormation + Fargate + Terraform)
- Prerequisites a. Before deploying LakeFusion, ensure you have the following:
- AWS Account Requirements
- AWS account with permissions to:
- Launch CloudFormation stacks
- Create IAM roles and policies
- Use ECS Fargate
- Write to CloudWatch Logs b. Databricks Configuration
- Workspace URL (e.g., https://<your-org>.databricks.com)
- Workspace ID
- Personal Access Token (PAT)
- App Client ID & Secret
- Domin name c. AWS Networking Setup
- VPC (default)
- Subnet
- Security Group
- Deployment Architecture Overview This deployment leverages the following components:
- CloudFormation Template: Collects user inputs and provisions resources
- ECS Fargate Task: Automatically launched by CloudFormation to run LakeFusion installer
- Terraform Automation: Executed inside the container to provision: IAM roles and policies Networking resources Databricks integration
- All logs and operations are visible in CloudWatch Logs.
- Step-by-Step Deployment Guide
Launch the CloudFormation Stack
- Log in to the AWS Console
- Navigate to CloudFormation > Create Stack > With new resources (standard)
- Upload the provided LakeFusion CFN template file or provide its S3 URL
- Click Next Provide Stack Parameters DatabricksWorkspaceURL (e.g. https://<your-org>.databricks.com) WorkspaceID (numeric only) PersonalAccessToken AppClientID, AppClientSecret Click Next, then Next, and finally Create Stack. Fargate Task Execution After stack creation begins CloudFormation provisions an ECS Fargate Task The task runs Terraform scripts inside a container Monitor deployment progress via CloudWatch Logs: Go to ECSClusters Locate your stack's task and view logs in CloudWatch
- Accessing LakeFusion
- Once the deployment completes successfully: Open the URL in your browser to access the LakeFusion interface
- Additional Resources & Support For a detailed installation guide, troubleshooting steps, and configuration examples, visit: https://support.lakefusion.ai/portal/en/kb/articles/lakefusion-installation-guide-via-aws-urlÂ
Resources
Vendor resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.