Diffbot
Diffbot’s APIs automatically extract structured data from any web page. Diffbot’s approach to web extraction, using a combination of computer vision and machine learning, means it works autonomously and at scale.
Automatically Extract Content from Page
Automatic data extraction from articles, products, discussion...
Extract Discussions from Webpages
Automatically extracts clean threads, reviews, and comments f...
Extract Images from Webpage
Extract the primary image(s) of a submitted web page and get ...
Extract Products from Webpage
Automatically extract complete data from any shopping or e-co...
Extract Videos from Webpage
Automatically extract detailed video information—including mo...
News and Content Search
Search from Diffbot's entire database (currently about 800 mi...