Listing Thumbnail

    Data on Amazon EKS

     Info
    The "Data on EKS" solution is a modern data lakehouse architecture built on AWS Elastic Kubernetes Service (EKS). It enables scalable, modular, and secure data ingestion, transformation, governance, and analytics using open-source and AWS-managed tools. The solution supports both real-time and batch processing to meet diverse enterprise data requirements.

    Overview

    Product Features

    Product Benefits

    Unified Architecture

    Open Standards

    High Performance

    Faster Insights

    Fine-Grained Governance

    Cost Optimization

    Enterprise Flexibility

    Key Components:

    Apache Iceberg: Open table format enabling fast queries, ACID compliance, and time travel

    Apache Kafka: Real-time data ingestion

    Apache Spark: Data transformation and enrichment

    Apache Airflow: Pipeline orchestration

    Trino: Query engine

    Superset: BI & dashboarding

    Unity Catalog: Metadata management

    Apache Ranger: Access control and governance

    Use Cases

    Use Cases Primary Use Cases:

    Unified batch and streaming pipelines for large-scale data processing

    Building enterprise-grade data lakes with fine-grained governance

    Democratizing data access through SQL and dashboards

    Real-time alerting and operational analytics

    Industry-Specific Applications:

    Retail

    Financial Services

    Healthcare

    Telecom

    Product-Specific Information

    AWS Services Used: Amazon EKS, Amazon S3, AWS DMS, IAM, CloudWatch, Secrets Manager

    Open-Source Tools: Apache Kafka, Spark, Iceberg, Trino, Airflow, Ranger, Superset

    Security Model: RBAC via Ranger, encryption via S3-SSE and TLS, scoped IAM roles

    Supported Workloads: Streaming ingestion, batch ETL, ad hoc SQL, interactive dashboards

    Scalability: Kubernetes-native autoscaling for pods and compute resources, modular microservices design

    Highlights

    • Modernizes data architecture with a governed, scalable, and cost-effective platform.
    • Enhances productivity across data roles, improving efficiency.
    • Supports rapid decision-making through real-time insights.

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Pricing

    Custom pricing options

    Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.