Overview

Product video
lakeFS is the control plane for AI data. It provides Git like version management and governance for object stores, giving data and AI teams a reliable foundation for building and operating large scale, production grade AI systems. lakeFS solves the core challenges of managing fast changing, high volume data by creating a consistent and programmable layer on top of your data lake. This enables organizations to bring order, predictability, and automation to the full lifecycle of AI data.
With lakeFS, teams create isolated data branches to test feature engineering, retrain models, or run analytics without disrupting production datasets. Every change to data is atomic and reversible, allowing safe iteration, continuous experimentation, and predictable promotion of validated data to production. lakeFS turns any object store into a structured environment with commits, merges, rollbacks, and lineage, enabling full reproducibility and control across all AI and analytics workflows.
lakeFS works as a scalable, low overhead control plane that integrates with existing compute engines, pipelines, and governance tools. It supports Spark, Python, Hive, Presto, Flink, and modern AI and ML workload patterns. Because it runs on top of S3 compatible storage, lakeFS requires no changes to the underlying data layout and can operate at the scale of enterprise and hyperscale data platforms.
Key capabilities include zero copy branching for isolated model training and data preparation, reproducible data snapshots for compliance and auditing, automated checks that validate data before merging, and instant rollback to recover from bad data, pipeline failures, or unintended model drift. lakeFS ensures every version of your AI data is tracked, discoverable, and available for debugging, analysis, and governance.
By using lakeFS as the control plane for AI data, organizations increase the quality and velocity of model development, reduce incidents caused by bad data, and achieve reliable and explainable AI behavior. lakeFS brings a modern software development lifecycle to data, enabling consistent, trustworthy, and governed AI operations at scale.
For custom pricing, EULA, or a private contract, please contact AWS-Marketplace@treeverse.com , for a private offer.
Highlights
- Data teams efficiency - eliminates repetitive manual tasks such as manual rollback of production data or data reproducibility. Save data engineers time by automation.
- High quality data product - validate the data coming into or analyzed within the lake before it is exposed to external users, taking advantage of a CI/CD pipeline for your data, preventing inconsistencies and error.
- Data resilience - quickly recover from mistakes / inconsistencies by rolling back the entire data lake to its previous consistent state.
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/12 months |
|---|---|---|
lakeFS Managed Service | Git-like version control for a data lake, in a fully managed service. | $85,000.00 |
The following dimensions are not included in the contract terms, which will be charged based on your usage.
Dimension | Cost/unit |
|---|---|
Additional cost per API call to lakeFS | $0.002 |
Vendor refund policy
We do not currently support refunds.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Reach support through email or within lakeFS cloud chat on https://lakefs.cloud Email support, contact us through our website, or report an issue on the built-in chat in the product.
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.