Listing Thumbnail

    MarkLogic Multi-Model Database: Enterprise Edition v. 10

     Info
    Deployed on AWS
    MarkLogic Server Enterprise Edition v. 10 with Semantics, Advanced Security, and Tiered Storage Options
    4.3

    Overview

    MarkLogic Server is the agile, scalable, and secure foundation of the MarkLogic Data Platform. A multi-model database with a wide array of enterprise-level data integration and management features, MarkLogic helps you create value from complex data - faster.

    MarkLogic Server natively stores JSON, XML, text, geospatial, and semantic data in a single, unified data platform. This ability to store and query a variety of data models provides unprecedented flexibility and agility when integrating data from silos. MarkLogic is the best, most comprehensive database to power an enterprise data platform.

    MarkLogic Server is built to securely integrate data, track it through the integration process, and safely share in it in its curated form. Meet business-critical goals and accelerate innovation with MarkLogic.

    Highlights

    • Best-in-class multi-model database: Advanced search, robust metadata management and semantic capabilities.
    • ACID Transactions: 100% ACID compliant, high-performance distributed transactions. Guaranteed strongly consistent read and write operations.
    • Secure and Governed: Granular role-based access controls and advanced security certifications. Includes features like BYOK, data loss prevention, ABAC policies, and more.

    Details

    Delivery method

    Delivery option
    64-bit (x86) Amazon Machine Image (AMI)

    Latest version

    Operating system
    AmazonLinux amzn2-ami-hvm-2.0.20220218.3-x86_64-gp2

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    MarkLogic Multi-Model Database: Enterprise Edition v. 10

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time. Alternatively, you can pay upfront for a contract, which typically covers your anticipated usage for the contract duration. Any usage beyond contract will incur additional usage-based costs.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (345)

     Info
    • ...
    Dimension
    Cost/hour
    r5.2xlarge
    Recommended
    $4.373
    m5ad.12xlarge
    $26.235
    g5.48xlarge
    $104.94
    r5n.12xlarge
    $26.235
    i3.xlarge
    $2.186
    r5a.4xlarge
    $8.745
    m5zn.6xlarge
    $13.118
    c6a.48xlarge
    $104.94
    m6i.2xlarge
    $4.373
    c5a.16xlarge
    $34.98

    Vendor refund policy

    We do not currently support refunds, but you can cancel at any time.

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    64-bit (x86) Amazon Machine Image (AMI)

    Amazon Machine Image (AMI)

    An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.

    Version release notes

    This is the 10.0-11.1 release of MarkLogic on AWS Marketplace.

    See http://developer.marklogic.com/products/cloud/aws  for additional details.

    Additional details

    Usage instructions

    This AMI includes a MarkLogic Essential Enterprise Production license. The AMI is configured to store MarkLogic configuration and data on an attached EBS storage. When you launch this AMI via the EC2 Console, the storage will be pre-configured and it must remain on /dev/sdf device. Leave off the 'Delete-on-termination' checkbox, to enable you to keep your data. If you start the EC2 instance without using supplying any configuration data as described in the documentation (link below), then the MarkLogic server will initialize the server and create a default administrator account. You can access the Administration portal on port 8001 using username "admin" and the password equal to the EC2 instance ID (e.g. "i-001602692a5d518a4").

    MarkLogic also provides a Cloud Formation template for launching this AMI that provides the easiest way to gain the benefits of high-availability and scalability.

    FOR MORE DETAILED INSTRUCTIONS, SEE http://developer.marklogic.com/products/cloud/aws 

    Support

    Vendor support

    For support, Contact MarkLogic by creating a ticket at https://help.marklogic.com/  or sending an email to cloud-support@marklogic.com . Support is not included in hourly fee. Community-based support is available at http://developer.marklogic.com/qa . Free MarkLogic training is available here https://www.marklogic.com/learn/university/  https://help.marklogic.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Catalogs
    Top
    25
    In Financial Services, Databases
    Top
    100
    In Databases, Analytic Platforms

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Multi-Model Data Storage
    Natively stores JSON, XML, text, geospatial, and semantic data in a single unified platform
    ACID Transaction Support
    100% ACID compliant with high-performance distributed transactions and strongly consistent read and write operations
    Advanced Search and Metadata Management
    Advanced search capabilities with robust metadata management and semantic query functionality
    Role-Based Access Control
    Granular role-based access controls and attribute-based access control (ABAC) policies for data governance
    Tiered Storage Options
    Support for tiered storage architecture to optimize data management across different storage tiers
    Distributed SQL Database Architecture
    Cloud-native, distributed SQL database designed for high availability and global distribution across multiple regions and availability zones
    High Availability and Fault Tolerance
    Continues serving queries during node failures, availability zone failures, and AWS region failures without service interruption
    PostgreSQL Compatibility
    Postgres-compatible SQL interface enabling seamless integration with existing applications and tools
    ACID Transaction Support
    Supports ACID transactions ensuring data consistency and reliability across distributed deployments
    Multi-Region Data Placement
    Enables single unified database deployment across multiple AWS regions with configurable data locality and low-latency access
    Graph Database Engine
    Cloud-based graph database powered by ArangoDB supporting native graph query processing and relationship traversal for connected data analysis
    Multi-Model Data Support
    Unified platform supporting graph, JSON document, full-text search, and machine learning capabilities through a single query language
    Security and Access Control
    Advanced security features including private endpoints, single sign-on (SSO), and audit logging for access management and compliance
    High Availability and Disaster Recovery
    Data replication with multi-region cloud backups and fully-managed infrastructure ensuring business continuity
    Advanced Analytics and Machine Learning
    Integrated machine learning capabilities enabling predictive analytics, pattern detection, and insights extraction from connected data

    Contract

     Info
    Standard contract
    No

    Customer reviews

    Ratings and reviews

     Info
    4.3
    76 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    54%
    41%
    5%
    0%
    0%
    9 AWS reviews
    |
    67 external reviews
    External reviews are from G2  and PeerSpot .
    PranavChaudhari

    Centralized multi-model data platform has improved retrieval speed and supported trusted analytics

    Reviewed on May 28, 2026
    Review from a verified AWS customer

    What is our primary use case?

    Our main use case for MarkLogic  is as a centralized repository where we store our data. MarkLogic  functions as a NoSQL database, allowing us to store XML, JSON, and text format data. MarkLogic is also a very fast database, providing really fast results when we query something.

    We use MarkLogic on a daily basis. Our transactions include REST APIs that are created in MarkLogic. Day to day, we receive many calls that interact with and update the data in MarkLogic. We have the Data Hub Framework installed in MarkLogic, enabling data to come from multiple sources. We then tune this data and maintain it as a golden record after harmonization and curation.

    What is most valuable?

    MarkLogic is known for several strengths, particularly its multi-model database capability.

    The multi-model database capability in MarkLogic can handle documents, graphs, and relational data all in one place. This is where MarkLogic has helped us considerably because a single database is able to accomplish a lot of enterprise work. We do not have to reach out to many products, as a single product having extensive capability is beneficial for our organization.

    MarkLogic also covers many things around security and has strong search capability. The combination means we can confidently handle large, complex datasets in MarkLogic.

    MarkLogic has had a very positive impact on our client and our organization. Because it is a multi-model database, we can handle multiple data types. A single platform takes all the data from all sources, reducing our dependency on multiple systems. The transactions in MarkLogic are ACID, providing atomicity, consistency, isolation, and durability of the transactions and data. We have used it for many purposes, including a stewardship platform that runs complex code in MarkLogic and helps us perform deduplication by identifying duplicate data.

    We observed a significant improvement in data retrieval time with MarkLogic. Queries that previously took seconds or longer are now executed much faster because the data was previously in SQL and is now in a NoSQL XML format. We use CTS search, and the data we need already sits in the index, allowing us to query the documents we need. The indexing capability is an advantage that adds real value to saving query time.

    Regarding AI capabilities and its governance and security, MarkLogic has very good security. AI usage is not extensive in MarkLogic, but it provides fine-grained access and role-based access to every document or even element level. This means AI models or applications can access data while we restrict them to see only what we want them to see. MarkLogic has built-in data governance features such as metadata management, data lineage, and auditing. These features help track where the data is coming from and how it is used, which is important when feeding data into an AI system to ensure trust and accountability.

    MarkLogic itself is not an AI model, but it plays a critical role in providing high-quality, trusted data to an AI system. In terms of reliability, features like ACID transactions and consistency ensure that data remains correct and stable, which reduces the chance of incorrect and inconsistent AI results. Since AI is not heavily involved in MarkLogic, the data gives us correct results.

    What needs improvement?

    MarkLogic can be improved by allowing multiple language support, as it currently supports JavaScript, XQuery, and SQL, whereas other languages are not supported at this time.

    Regarding improvements to MarkLogic, the learning curve is an area that could be enhanced. MarkLogic offers many features, but for new developers or teams unfamiliar with the ecosystem, it can take time to become comfortable with concepts like XQuery and server-side JavaScript. Improving onboarding resources or simplifying development workflows could help teams adopt it faster. MarkLogic has its own courses, but it would be better if there were more documentation and videos available.

    For how long have I used the solution?

    I have been using MarkLogic for approximately four or five years.

    What other advice do I have?

    How we query MarkLogic depends on our data structure and how we are trying to obtain the output from it.

    Regarding improvements to MarkLogic, the learning curve is an area that could be enhanced. MarkLogic offers many features, but for new developers or teams unfamiliar with the ecosystem, it can take time to become comfortable with concepts like XQuery and server-side JavaScript. Improving onboarding resources or simplifying development workflows could help teams adopt it faster. MarkLogic has its own courses, but it would be better if there were more documentation and videos available.

    I rate MarkLogic as a nine out of ten. I give it a nine out of ten because it is a very powerful and reliable platform with strong performance and offers flexibility and enterprise-grade capabilities. The only reason I would not give it a ten is because of the learning curve and some limitations in tooling and integrations. Overall, it is a highly capable and robust platform that has delivered strong business outcomes for us.

    Which deployment model are you using for this solution?

    Private Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Anonymous

    Robust Data Management with MarkLogic's Advanced Features

    Reviewed on May 25, 2026
    Review provided by G2
    What do you like best about the product?
    I like Progress MarkLogic's ability to handle multiple types of data on a single platform while providing very fast search and query performance. It's a big plus that it combines tools for document storage, search, and integration, simplifying the architecture significantly. I appreciate its flexibility since it supports schema and flexible data models, allowing new data sources to be onboarded quickly without a lot of time spent on schemas. The built-in search capability with indexing and full-text search is powerful and performs well even with large enterprise data sets, which improves the application's performance and user experience. I also appreciate its scalability and reliability. It supports clustering, high availability, and enterprise-grade security features, making it suitable for mission-critical applications. Overall, the flexibility, strong search capabilities, scalability, and enterprise security are what I really like about MarkLogic.
    What do you dislike about the product?
    One big challenge is the learning curve. Progress MarkLogic has so many advanced features like semantic search and multimodal capabilities, which can make it take a long time for new developers to fully understand the ecosystem, query language, and best practices. This can slow down onboarding compared to more commonly used databases. Another area that needs improvement is the cost. Being an enterprise-grade solution, the licensing and infrastructure costs are relatively high, which can sometimes make it hard for small teams to adopt it compared to open-source alternatives. Additionally, the development and debugging experience could be better. Troubleshooting complex queries or performance issues can require deep platform knowledge. More effective debugging tools, monitoring dashboards, and documentation would make daily operations easier. Also, because it's such a feature-rich platform, I sometimes end up using only a small percentage of its capabilities. So for simpler use cases, it can occasionally feel heavier than necessary.
    What problems is the product solving and how is that benefiting you?
    Progress MarkLogic centralizes data management, enhancing fast search and real-time retrieval. It streamlines complex data integration from multiple formats, improving consistency without heavy transformations. Its built-in search engine boosts query performance even with large datasets, reducing development effort while offering strong security features.
    PranavChaudhari

    Unified document modeling has streamlined multi-format data integration and querying

    Reviewed on Apr 06, 2026
    Review from a verified AWS customer

    What is our primary use case?

    MarkLogic  has been instrumental in various data-related tasks throughout my projects. When I joined a project, I started using MarkLogic  for integrating data from multiple legacy systems. Since then, I have utilized it for data ingestion, transformation, and querying tasks. In one of our projects, we were integrating customer data coming from two different sources. One source was sending XML data, and another source was sending JSON data. We combined them and stored it in one format.

    We used MarkLogic to ingest both data into a collection and applied logic to transform it into a map where fields like address, customer ID, and customer detail were structured differently in both systems. We normalized them into a single model that our downstream system could use. When we introduced a new field into one of the source systems, instead of redesigning everything, we simply updated the transformation logic and started storing that field into the document.

    Another use case involved handling both XML and JSON data effectively. Once we adjusted our understanding of patterns, MarkLogic effectively handled those formats, making it easier to adapt to changes without major rework. MarkLogic offers the best features including handling both XML and JSON data effectively, and we have a lot of indexes. For example, we can index a specific element in the database of a document.

    What is most valuable?

    MarkLogic offers the best features, including handling both XML and JSON data effectively in tandem with flexible transformation logic where needed, without the issue of redesigning in case of format changes. MarkLogic made data handling easier without substantial rework. We have many indexes; for example, we can index a specific element in the document.

    We also manage role-based user access at the document level seamlessly, enhancing security.

    Regarding the impact, we reduced the time spent on data preparation and almost saved two weeks of time for everyone each quarter. MarkLogic made our process smoother and faster, enhancing collaboration and efficiency between teams. With efficient configurations, we completed more projects in less time, thereby improving productivity.

    What needs improvement?

    I wish I had known one thing earlier about MarkLogic, specifically regarding indexing. Initially, our focus was mostly on ingestion and transformation, and things seemed fine when the database was smaller. However, as our data size grew, queries started performing slowly. We realized the importance of configuring the right indexes. Performance heavily depends on well-indexed documents.

    It is about learning; concepts such as indexing, data modeling, and writing efficient queries are not very straightforward. MarkLogic should be more beginner-friendly, with resources providing hands-on experiences. Debugging  performance issues or unexpected results is sometimes challenging, necessitating extra log analysis.

    Regarding upgrades and environment management, we faced challenges planning for upgrades. It was not straightforward, and the impact on configurations and queries could not be easily estimated. Improved upgrade documentation and guidance would enhance the experience.

    I think it would be beneficial if MarkLogic allowed the use of Python for querying, as currently, the options are limited to XQuery and JavaScript.

    For how long have I used the solution?

    I have been working in my current field for about five years.

    What do I think about the stability of the solution?

    MarkLogic is stable.

    What do I think about the scalability of the solution?

    In terms of scalability, MarkLogic supports horizontal scaling, allowing more nodes to be added to distribute load. It can handle very large datasets efficiently. I find it quite scalable based on our project experiences.

    How are customer service and support?

    Regarding customer support, in my experience, it was generally good, although we did not have to rely on it too frequently. Most issues were resolved through documentation or internal knowledge. Support is reliable, and responsiveness is rated well in the industry. I would rate MarkLogic's customer support around seven point five out of ten.

    Which solution did I use previously and why did I switch?

    Previously, we did not use any specific solution; we relied on a combination of relational databases along with ETL scripts for data integration. Handling XML and JSON required separate logic, and maintaining pipelines took considerable effort. We faced challenges with changing requirements. We switched to MarkLogic for its unified approach to manage different data formats and reduce tool necessity, thereby enhancing efficiency.

    What was our ROI?

    In terms of return on investment, centrally managed data allowed for substantial storage, transformation, and migration. MarkLogic is efficient and low-maintenance, contributing significantly to our success.

    What's my experience with pricing, setup cost, and licensing?

    Regarding pricing, the setup cost of MarkLogic is quite high. It requires a larger budget, depending on data size, with larger data sizes demanding more clusters, directly influencing cost.

    Which other solutions did I evaluate?

    Before choosing MarkLogic, we evaluated options, including Oracle and MongoDB. Relational databases lacked flexibility, and MongoDB, despite its flexibility, required more custom work for advanced search and indexing. MarkLogic provided comprehensive advantages with inbuilt features and better native XML and JSON handling.

    What other advice do I have?

    The advice I would give to others looking into using MarkLogic is to emphasize the importance of indexes. Understand  your document structure and consider data integration needs when dealing with multiple formats. An improved indexing strategy significantly enhances performance. Start with a smaller pilot use case instead of a universal rollout at once. I would rate this review an eight out of ten.

    Mampi Bhattacharya

    Unified data hub has streamlined multi-format integration and improved data quality

    Reviewed on Apr 04, 2026
    Review provided by PeerSpot

    What is our primary use case?

    Our main use case for MarkLogic  is as a data hub for integrating and managing data from multiple sources. I mostly work on ingesting data from different systems and transforming it and storing it in a structured way so it could be used by downstream applications. For example, in one of our projects, we were pulling data from a couple of legacy systems and normalizing it into a common format using a MarkLogic  pipeline. I also use it for writing queries, mainly XQuery or JavaScript, to fetch and process data based on day-to-day business needs. Sometimes, it is about building APIs on top of MarkLogic to expose that data to other services. Overall, it is a mix of data ingestion, transformation, and querying, making that data usable for applications.

    When considering that use case, the biggest difference I notice with MarkLogic compared to traditional approaches or manual processes is how much it reduces the effort of handling different data formats. Before this, in some projects, we had to write separate pipelines or scripts to process XML, JSON, and other formats. This became quite fragmented. With MarkLogic, we could handle both structured and unstructured data in the same place, which simplified things a lot. For example, in one use case, we were getting data from two systems, one sending XML and another JSON. Instead of building separate transformation layers, we ingested both into MarkLogic and then harmonized them into a common model. That saved a lot of development time and also reduced complexity. Another thing that stands out is how quickly we could query and retrieve data. Since everything is stored in a flexible document model, we did not have to redesign schemas every time whenever requirements changed. That made iterations much faster compared to traditional setups. It is not always simple. There is a learning curve, especially with data modeling and writing efficient queries. Once the team gets comfortable, it becomes a very powerful platform for data integration.

    One thing I did not expect initially was how useful MarkLogic is for tracking data lineage and debugging data issues. When something looked wrong in the final output, we could actually trace back how the data was ingested and transformed step-by-step. Since everything is stored as documents and transformations are applied in stages, it became easier to identify where things went wrong. Regarding one of the cases I remember, a field was getting overwritten during harmonization, and instead of digging through multiple systems, we tracked it directly within MarkLogic and fixed it quickly. The flexibility of the data model helped a lot. When requirements kept changing, it really helped. Instead of redesigning schemas, we could adjust the document structure and update transformations, which made our workflow more adaptable. Overall, it helped with debugging and maintaining data quality, which was a nice advantage.

    What is most valuable?

    A few features really stand out for me, especially when working on data integration projects. One of the most valuable ones is its ability to handle both structured and unstructured data in a single platform. For example, we deal with XML, JSON, and some semi-structured data from different systems. MarkLogic allows us to store and query everything together without needing separate pipelines. That simplified our architecture quite a bit.

    Another strong feature is the flexible document model. Unlike relational databases, we do not have to strictly define schemas upfront. In one project, requirements kept changing and it was dynamic. Instead of redesigning tables, we could adjust document structures and just continue with that. That saved a lot of rework. I also found the search and indexing capabilities very powerful. Even with large datasets, we could retrieve data quickly. We had use cases where we needed to search across multiple kinds of fields and documents, and MarkLogic handled that efficiently without much tuning.

    One feature I personally relied on a lot is the built-in support for XQuery and JavaScript. It gave us flexibility in how we query and process data. For some transformations, XQuery worked really well, while for others we used JavaScript, depending on the complexity. Overall, what stands out is how all these capabilities are available in one platform, which reduces the need for multiple tools.

    The flexible document model probably had the biggest impact on our projects. I can share a good example from a project where we were integrating customer data from multiple systems. Each system had a slightly different structure: some had extra fields, some were missing fields, and formats were not consistent at all. If we used a relational database, we would have had to keep altering schemas or creating multiple tables to handle those variations. But with MarkLogic, we could store each record as a document and gradually harmonize it into a common model. At one point, the business added new attributes to customer data midway through the project. Instead of redesigning anything, we just updated the transformation logic and started storing the new fields in the document. That change was handled pretty smoothly without impacting existing data. That flexibility saved a lot of time and avoided rework. It also made our system more adaptable to changing requirements, which happens quite often in real projects.

    One small but really useful feature is the built-in transaction and consistency handling. In one scenario, we were ingesting data in batches and there was a failure midway due to a data issue. MarkLogic handled it in a way that partial data was not left in an inconsistent state, which saved us from a lot of cleanup work. Additionally, the security and role-based access control is quite granular. We had cases where certain data needed to be restricted to specific roles. We managed that at a document level, which was really very helpful. These are not things you notice immediately, but in real-world usage, they make the system much more reliable and easier to manage.

    MarkLogic has had a strong impact, especially in how we handle and use data across teams. Earlier, data was scattered across different systems, and getting a unified view required a lot of manual effort and coordination. With MarkLogic acting as a central data hub, that process became much more streamlined. Instead of pulling data from multiple sources every time, we had a consolidated dataset available, which made development and reporting faster. It reduced a lot of back and forth between teams and saved a few hours every week during development and testing stages. I think it also improved data quality. Since we had defined ingestion and transformation steps, inconsistencies were caught earlier rather than later in the pipeline. Overall, it helped improve efficiency, reduced duplication of work, and made it easier for teams to rely on consistent data.

    In terms of metrics, we did see measurable improvements over time. Earlier, when we had to pull and reconcile data from multiple systems manually, it used to take a few hours for certain use cases. After moving that into MarkLogic pipelines, the same process became almost near real-time or at least reduced to minutes in most cases. In terms of development effort, we saved around 20 to 30 percent of time on data handling tasks, mainly because we did not have to build separate transformation layers or repeatedly fetch data from different sources. We also saw fewer data-related issues during testing since data was being standardized during ingestion, so inconsistency reduced. Improvements were noticeable in faster data access and also reduced manual effort and better data consistency.

    What needs improvement?

    One area where I feel MarkLogic can improve is the learning curve. When I first started, concepts like data modeling, indexing, or writing efficient XQuery were not very intuitive. It took some time to understand how to structure data properly to get good performance. More beginner-friendly guidance or simplified onboarding would help new users. Another thing is tooling and debugging experience. While it is powerful, debugging queries or transformations can sometimes feel less straightforward compared to more modern developer tools. In a few cases, we spent extra time figuring out why a query was not performing well. Better integration with modern development ecosystems and more lightweight setup options would make it easier for teams to adopt quickly. It is a strong platform, but improving ease of use and developer experience would make a big difference.

    One small thing I have noticed is around monitoring and visibility. When things are running fine, it is great, but when something slows down, such as a query or ingestion job, it is not always very easy to quickly pinpoint the issue. We sometimes had to dig through logs or try different things to understand what was happening. Having more intuitive monitoring dashboards or clearer, cleaner insights into query performance and pipeline stages would be really helpful. Small improvements in documentation with more real-world examples would make a difference, especially for new developers trying to understand best practices. Nothing major, but these small things would make day-to-day usage smoother.

    For how long have I used the solution?

    I have been using MarkLogic for around two years.

    What do I think about the scalability of the solution?

    In my experience, MarkLogic has handled scalability quite well, especially as our data volumes grew. For example, in one project, we started with a relatively small dataset, but over time it scaled to millions of documents. We did not have to redesign the system; we mainly scaled by adding more resources and adjusting indexes. The platform handled that transition smoothly. On the user side, as more services started consuming data from MarkLogic, we did not see major issues in terms of performance. Query response times remained fairly consistent as long as indexing and data modeling were done properly. Overall, it scales well, but getting the best performance depends on how well you design and configure it.

    How are customer service and support?

    Regarding customer support for MarkLogic, in our experience, we did not rely heavily on customer support on a day-to-day basis because once the system was set up, it was quite stable, and most issues could be handled internally. That said, there were occasions, especially early on, where we needed help around configuration and performance tuning. In those cases, the support team was helpful and guided us in the right direction, although it sometimes took back and forth to fully resolve the issue. Overall, the support is considered above average, with good responsiveness and helpful guidance when needed. We mostly relied on internal expertise and documentation and reached out for complex issues.

    Which solution did I use previously and why did I switch?

    Before moving to MarkLogic, we were mainly using a combination of relational databases and some custom ETL scripts for handling data integration. The challenge with that setup was that it became quite complex as the data volume and variety increased. For example, handling both XML and JSON data required separate processing logic, and maintaining those pipelines took a lot of effort. We switched to MarkLogic because it offered a more unified approach, allowing us to handle different data formats in one place and reducing the need for multiple tools and scripts. The flexibility of the document model and built-in capabilities for ingestion and querying made the overall system simpler and easier to manage compared to what we had earlier.

    Which other solutions did I evaluate?

    We evaluated a few other options before choosing MarkLogic. We looked at traditional relational database solutions as well as a couple of NoSQL options like MongoDB. Relational databases were strong for structured data but did not handle changing schemas or mixed data formats well for our use case. MongoDB was closer to what we needed in terms of flexibility, but when it came to advanced search, indexing, and built-in support for both XML and JSON, MarkLogic had an edge.

    What other advice do I have?

    I suggest starting with a clear understanding of your data use case before adopting MarkLogic, especially if you are dealing with multiple data formats or large volumes of data. That is where it really adds value. From our experience, it is important to invest time early in data modeling and indexing. We initially underestimated that, leading to performance issues later. Once we optimized indexes properly, things improved a lot. Also, make sure your team gets comfortable with the basics of XQuery or JavaScript in MarkLogic. There is a learning curve, but once you get past it, development becomes much smoother. I recommend starting with a smaller use case or pilot project, which helped us understand how to structure data and pipelines before scaling it across the system. It is a powerful platform, but you get the best results when you plan the foundation properly and gradually build on it.

    Regarding additional thoughts on MarkLogic, one additional thought I would add is how MarkLogic fits as a long-term solution. From my experience, it is not just a tool you use for a short-term project. Once it is set up properly, it becomes a core part of your data architecture. More teams gradually started relying on it once they saw the value of having centralized and consistent data. It is important to have the right expertise in the team. It is a powerful platform, but to get the best out of it, you need people who understand how to design data models and optimize queries properly. It is a strong platform for enterprise use when you treat it as a long-term investment rather than just a quick solution.

    My overall rating for MarkLogic is 8 out of 10.

    reviewer2813067

    Unified search and data management has simplified complex XML and JSON workflows

    Reviewed on Apr 04, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I used MarkLogic  for a total of two years.

    When it comes to main use cases, I used MarkLogic  as a backend service for handling complex structured data such as XML or JSON. I have REST API services and modules using XQuery where the system needed efficient storage, query, and transforms of large volumes of data. Additionally, I worked with real-time ingestion pipelines where data from multiple sources were processed and stored in MarkLogic, enabling real-time access and updates.

    MarkLogic is designed for multi-handle multi-model data, which means it can natively store and query XML documents, JSON documents, and unstructured and semi-structured data. Instead of normal database joins, MarkLogic works by querying inside a document efficiently using indexes. In one of my projects, we used MarkLogic to manage a large-scale document processing system, where we ingested data from multiple upstream systems in XML and JSON format, such as product or property-related data. As soon as the data was ingested, it became immediately searchable due to MarkLogic indexing. MarkLogic handles semi-structured data by storing it as a document, automatically indexing it, and allowing real-time query and updates using XQuery and strong consistency.

    My experience with MarkLogic demonstrates how we leveraged its features beyond just data storage. For example, I worked on optimizing queries written in multiple modules, mostly related to searching with text and applying structured filters, which significantly improved query accuracy and performance. Apart from basic features, I have worked on performance tuning, indexing strategies, and combining full-text search with structured query. I also used MarkLogic as both a database and search engine, which helped to simplify our architecture.

    In our use cases, MarkLogic's universal indexing and clustering have a direct impact on performance and scalability, and it has helped us significantly. In normal databases, we need to define indexes up front, and if a new query comes in, we often need a schema or indexes. In MarkLogic, all data such as XML and JSON were automatically indexed, and we did not need to pre-plan any query patterns. In real time, we had a dynamic search requirement with filters, pricing, location, and keywords, and instead of creating multiple indexes manually, we leveraged our universal index plus range index. For example, when a user searches with multiple filters plus keywords, queries are still fast because MarkLogic uses its internal index instead of scanning documents. Regarding clustering, we have our MarkLogic clustered environment. When multiple nodes work together, horizontal scaling is part of it, as we could add more nodes if data grew, ensuring high availability. For instance, if one node failed, another would handle that traffic. During high traffic, the system stayed stable, and we handled the large data volume without performance degradation. Universal indexing helps us avoid manual indexing management while still providing fast queries for dynamic searches, and clustering allows us to scale horizontally and handle high traffic with no latency and high availability.

    What is most valuable?

    MarkLogic offers several powerful features. First, there is universal indexing, in which it automatically indexes all the stored XML and JSON documents. Second, it can handle both XML and JSON unstructured data in a single database, which makes it flexible for complex and evolving data requirements. The third is the ACID property. Unlike many NoSQL databases, it provides strong consistency with ACID transactions, which is critical for real-time and reliable applications. It also supports horizontal scaling and clustering, which helps in handling large volumes of data and high traffic efficiently.

    One feature that stood out to me in our project is its ability to combine search and database capability on a single platform. The tight integration of full-text search with structured query makes it very powerful for building real-time search applications without relying on any external tools. It simplifies our architecture and reduces system complexity. I also appreciate its flexibility with data models, especially handling both XML and JSON seamlessly, which can be very useful in our use cases with multiple data resources.

    Using MarkLogic has had a significant positive impact on our organization, especially in terms of performance, flexibility, and reliability. With MarkLogic's universal indexing and built-in search, we have seen query response times improve noticeably. Complex searches dropped from a few seconds to sub-second response times in many cases. Users could perform combined keyword plus filter searches in real time, directly improving our application experience. Before implementing MarkLogic, we were using a relational database and NoSQL as separate search engines, requiring Elasticsearch and others. With MarkLogic providing a single platform for both storage and search, we reduced integration overhead, maintenance efforts, and failure points. The schema flexibility for XML and JSON allowed us to onboard new data sources faster. The ACID transactions that MarkLogic provides are crucial and something rarely supported by NoSQL databases. MarkLogic improved our system by enabling faster search, reducing the response time from seconds to sub-seconds, reducing architectural complexity by combining database and search, and improving reliability through ACID transactions and clustering.

    What needs improvement?

    While MarkLogic is powerful, there are areas where I feel it could improve. When I started with MarkLogic, I found that its learning curve and developer experience are not very comfortable for beginners. Technologies such as XQuery are less common compared to Java and Python, so new developers take time to get comfortable with it. Improving documentation and modern tooling would greatly aid onboarding. Second, the cost and licensing can be a concern for smaller teams and startups. MarkLogic's enterprise status makes it less likely to be the first choice for those teams. While it supports deployment in the cloud, the experience could be more seamless compared to fully cloud-native databases. Overall, MarkLogic is excellent for enterprise use cases, especially where search and structured data need to work together, but improving developer experience and ecosystem support would enhance its efficiency.

    A couple of additional areas where MarkLogic could improve are around integration, performance tuning, visibility, and support experiences. While MarkLogic supports REST APIs well, integrating with the modern data ecosystem sometimes requires extra effort compared to other platforms, as out-of-the-box connectors are limited. Although performance is strong, understanding query behavior can be challenging, and debugging slow queries or analyzing indexing usage is not always transparent. Regarding support and documentation, response times can vary depending on the issue or server availability. More real-world examples and troubleshooting guides would enhance developer productivity. Improvements in integration and modern tools in XQuery, along with better observability, are necessary.

    Beyond what I mentioned earlier, there are a few additional areas I can point to. While MarkLogic supports powerful querying via XQuery and JavaScript, many developers are more comfortable with SQL. An intuitive SQL-like query support or a better abstraction layer would enhance adoption across teams. Furthermore, migrating from other databases, whether relational or non-SQL, requires effort in data transformation. Better migration tooling with automated data mapping would also make transitions smoother.

    For how long have I used the solution?

    In my current field, I have been working for the last three years.

    What do I think about the stability of the solution?

    MarkLogic is pretty stable in my experience. It is highly stable and reliable.

    What do I think about the scalability of the solution?

    MarkLogic offers excellent scalability, especially for enterprise-scale applications. In our use case, as data and traffic increased, we were able to scale by adding nodes to the clusters without major changes to the applications, making the scaling very smooth and predictable.

    How are customer service and support?

    MarkLogic has been generally good and reliable in my experience. When I connect with them, it is very responsive. I have gone through support tickets, and proper tracking is available, so overall, it is a good support system, and I would rate it slightly higher than average.

    I would rate MarkLogic's customer support an eight due to its responsiveness, especially for higher priority issues. Support engineers demonstrate good product expertise, and the structure of the ticketing and enterprise support models work well. If someone inquires, I would suggest looking for alternatives if their team is small or they have cost constraints, but if there are no budget issues and their team is large, MarkLogic is reliable and comfortable, providing scalability.

    Which solution did I use previously and why did I switch?

    Before adopting MarkLogic, we were using a combination of traditional relational databases such as Oracle along with a separate search solution, such as Elasticsearch.

    The main reason for switching from Oracle and Elasticsearch to MarkLogic was simplifying our architecture by consolidating database and search into a single platform. With Oracle and Elasticsearch, we had two separate systems, and syncing between them was complex and error-prone. MarkLogic allowed us to manage these components on one platform. Given that our data was semi-structured, managing it in a relational database was tough, but MarkLogic's document model made schema evolution easier without extensive migration.

    How was the initial setup?

    We did not purchase MarkLogic through the AWS  marketplace.

    What was our ROI?

    We saw a clear return on the investment after implementing MarkLogic in terms of saving and personnel efficiency. Since we did not need a separate database and search system, we avoided building and maintaining integrations. This led to roughly a thirty to forty percent reduction in backend development effort. With flexible schema and universal indexing, new features and data sources were onboarded faster, reducing feature delivery time by around forty to fifty percent. In terms of infrastructure and maintenance, we also achieved twenty to thirty percent savings in infrastructure and maintenance overhead.

    What's my experience with pricing, setup cost, and licensing?

    My experience with MarkLogic's pricing and licensing is that it positions itself as an enterprise-grade product. The cost is relatively high compared to open-source alternatives. We use enterprise licensing models, which gives us access to enterprise features and official support. The initial setup cost is moderate to high, mainly due to infrastructure provisioning, licensing costs, and initial configuration and onboarding efforts.

    Which other solutions did I evaluate?

    Before finalizing MarkLogic, we evaluated a few alternatives. We looked at MongoDB, which is good for flexible document storage but required a separate search solution for advanced queries. We also considered using Oracle, which is strong and reliable but less flexible for semi-structured data. Therefore, we selected MarkLogic because it uniquely provides multi-model support along with built-in search and ACID transactions with real-time indexing.

    What other advice do I have?

    My experience with MarkLogic has been very positive. It is a powerful platform, especially for data-driven and search-driven applications where handling complex XML and JSON data and real-time querying is important. The combination of database and search capabilities along with strong consistency and scalability make it an excellent choice for enterprise use cases. However, there are areas such as developer experience, ecosystems, and the learning curve that could be improved to enhance accessibility. I would rate MarkLogic an eight overall.

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    View all reviews