Overview
Diffbot's APIs are the easiest way for developers to integrate data from the web. Instead of writing rule-based scrapers, which are prone to break and difficult to maintain, Diffbot automatically classifies any URL into one of 20 common page types and automatically extracts the page into structured data using computer vision and natural language processing. Use Diffbot on individual pages, build structured database by crawling entire sites, or query for facts across the entire web using the world's largest Knowledge Graph.
Highlights
- Automatic data extraction from articles, products, images, discussion threads, video pages and more.
- With Crawlbot, crawl and extract data from entire sites.
- See all of our tiers at https://www.diffbot.com/pricing
Details
Unlock automation with AI agent solutions

Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Cost/unit |
---|---|
Startup Tier (1000 API calls) | $1.176 |
Plus Tier (1000 API calls) | $1.059 |
Pro Tier (1000 API calls) | $0.941 |
Vendor refund policy
Please see seller website for refund details.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Email support available for Startup level. Phone and email support available for Plus and Professional. support@diffbot.comÂ
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Customer reviews
The most Competant Web Crawling Service I've used
The first tool we use is the crawlbot, which we appreciate is configurable and extremely capable. In most of our use cases - we just need to point to a URL and have it repeat every so often to discover new content. After crawling, the data is available via an easy-to-parse JSON file.
We also use the Diffbot Knowledge Graph API. The powerful DQL language allows us to query a massive amount of data to find articles and entities. DQL is simple to use, and the GUI interface allows easy testing and iteration.
Diffbot's customer service is also exceptional. Our contact has been very attentive in helping us learn how to properly use Diffbot's services to meet our needs. He has organized one-off Zoom meetings to walk us through the appropriate method for creating DQL queries and has expedited bug fixes required for our use cases.
Diffbot's Knowledge Graph allows us to find relationships between articles and entities across the web in near real-time. This feature has been invaluable in providing insightful information to our clients.
Diffbot is a game-changer.
Diffbot Increases Efficiency
1. Diffbot does not recognize PDF documents, and we frequently would like to ingest them as articles.
2. We find it difficult to troubleshoot a crawler in situations where it is not bringing in data or it is not bringing in the data we are expecting.
We also see great potential for using the bulk API to become more efficient in our content ingest process and we are excited to continue to explore this option.
Great enrichment tool
2) Ability to query data in aggregate
2) Being charged as we go (I wish there was a way to limit my queries)
Lead sourcing
Customer profiling