Listing Thumbnail

    Diffbot APIs

     Info
    Sold by: Diffbot 
    Diffbot's mission is to structure the world wide web to create the first complete map of public human knowledge.

    Overview

    Diffbot's APIs are the easiest way for developers to integrate data from the web. Instead of writing rule-based scrapers, which are prone to break and difficult to maintain, Diffbot automatically classifies any URL into one of 20 common page types and automatically extracts the page into structured data using computer vision and natural language processing. Use Diffbot on individual pages, build structured database by crawling entire sites, or query for facts across the entire web using the world's largest Knowledge Graph.

    Highlights

    • Automatic data extraction from articles, products, images, discussion threads, video pages and more.
    • With Crawlbot, crawl and extract data from entire sites.
    • See all of our tiers at https://www.diffbot.com/pricing

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Diffbot APIs

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (3)

     Info
    Dimension
    Cost/unit
    Startup Tier (1000 API calls)
    $1.176
    Plus Tier (1000 API calls)
    $1.059
    Pro Tier (1000 API calls)
    $0.941

    Vendor refund policy

    Please see seller website for refund details.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Email support available for Startup level. Phone and email support available for Plus and Professional. support@diffbot.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    |
    29 external reviews
    Star ratings include only reviews from verified AWS customers. External reviews can also include a star rating, but star ratings from external reviews are not averaged in with the AWS customer star ratings.
    Justin W.

    The most Competant Web Crawling Service I've used

    Reviewed on Feb 03, 2023
    Review provided by G2
    What do you like best about the product?
    Overall, Diffbot's tools are simple to use and understand outside of more complex use cases. We use several of their features to deliver content insights to our clients. I would recommend Diffbot to any person or organization that needs to pull large amounts of data from arbitrary web sources.

    The first tool we use is the crawlbot, which we appreciate is configurable and extremely capable. In most of our use cases - we just need to point to a URL and have it repeat every so often to discover new content. After crawling, the data is available via an easy-to-parse JSON file.

    We also use the Diffbot Knowledge Graph API. The powerful DQL language allows us to query a massive amount of data to find articles and entities. DQL is simple to use, and the GUI interface allows easy testing and iteration.

    Diffbot's customer service is also exceptional. Our contact has been very attentive in helping us learn how to properly use Diffbot's services to meet our needs. He has organized one-off Zoom meetings to walk us through the appropriate method for creating DQL queries and has expedited bug fixes required for our use cases.
    What do you dislike about the product?
    Diffbot is a powerful tool, and with its numerous capabilities, it can be difficult for those unfamiliar with it to understand how to use it properly. Fortunately, Diffbot provides excellent customer service, which can help guide you through the process of determining the best practices for your use case.
    What problems is the product solving and how is that benefiting you?
    Diffbot offloads the complex and difficult process of web crawling, scraping and analysis/parsing. Rather than writing our own in-house web crawler, we can spend our time elsewhere building features for our clients.

    Diffbot's Knowledge Graph allows us to find relationships between articles and entities across the web in near real-time. This feature has been invaluable in providing insightful information to our clients.
    Kurt L.

    Diffbot is a game-changer.

    Reviewed on Dec 07, 2022
    Review provided by G2
    What do you like best about the product?
    Diffbot makes the difficult task of managing data and extracting useful information much easier. They provide access to a seemingly infinite amount of company and contact information and are continuously improving their user interface to add even more value. I use Diffbot every chance I can!
    What do you dislike about the product?
    Diffbot is very responsive and always willing to help. Their interface still needs some improvements, but I have been their client for over a year now and have seen vast improvements.
    What problems is the product solving and how is that benefiting you?
    Diffbot is a better version of ZoomInfo with more capabilities beyond primary company, industry and contact info. They have additional tools which allow for data enrichment and are progressing towards in-depth market analytics. Indeed a total-package solution.
    Computer Software

    Diffbot Increases Efficiency

    Reviewed on Feb 25, 2021
    Review provided by G2
    What do you like best about the product?
    Prior to using Diffbot, we relied primarily on RSS feeds and a web scraping tool that is based on the visual layout and HTML of a webpage. We were very dependent on X Paths to get the data we wanted. We find that the Diffbot crawlers are more stable in the long term because they are not as impacted by website design changes. This saves us a lot of time that we would otherwise be spending on maintenance.
    What do you dislike about the product?
    The two issues that are most challenging for us are:

    1. Diffbot does not recognize PDF documents, and we frequently would like to ingest them as articles.

    2. We find it difficult to troubleshoot a crawler in situations where it is not bringing in data or it is not bringing in the data we are expecting.
    What problems is the product solving and how is that benefiting you?
    The biggest problem that Diffbot solved for us is reducing the amount of maintenance we have to do on our scraped websites. We use heavily Diffbot's full text capability and Diffbot’s metadata is also useful for us. The metadata that we use most is Diffbot’s language designation to ensure that our clients are seeing only articles in the languages that they choose.

    We also see great potential for using the bulk API to become more efficient in our content ingest process and we are excited to continue to explore this option.
    Venture Capital & Private Equity

    Great enrichment tool

    Reviewed on Feb 12, 2021
    Review provided by G2
    What do you like best about the product?
    1) Enrichment data
    2) Ability to query data in aggregate
    What do you dislike about the product?
    1) Being charged based on entities
    2) Being charged as we go (I wish there was a way to limit my queries)
    What problems is the product solving and how is that benefiting you?
    Lead enrichment
    Lead sourcing
    Customer profiling
    Tom W.

    Excellent and reliable service over 4 years

    Reviewed on Jan 21, 2021
    Review provided by G2
    What do you like best about the product?
    High detection accuracy and uptime: most of the time we can send API requests and know that the responses from Diffbot will be valid.
    What do you dislike about the product?
    Some old versions of Python are used (<3.0) and could be upgraded.
    What problems is the product solving and how is that benefiting you?
    We have been using the Article and Analyse APIs as a core part of our pipeline. After doing a build-vs-buy comparison, we realized that it would be far preferable to leave this step to an external best-in-class solution, rather than to build (and importantly *maintain*) in-house. Wherever the automated page structure analysis fails, our team can easily "teach" it the structure, and in the rare cases where that fails, the Diffbot team are very responsive to address issues.
    View all reviews