WebScraper 4.7.2

WebScraper uses the Integrity v8 engine to quickly scan a website, and can output extracted data as CSV or JSON. Plus download images to a folder.

  • Easy to scan a site – just enter the starting URL and press “Go”
  • Easy to export – choose the columns you want
  • Plenty of extraction options, including HTML elements with certain classes or IDs, regular expressions, or entire content in a number of formats (html, plain text, markdown)
  • ‘helper’ utilities within the app make it easy to find a suitable class / id or produce a regular expression (regex) to extract the data you want
  • Since v4.1 can download to a folder all images discovered
  • Configuration of various limits on the crawl and the output file size

What’s New in WebScraper

Version 4.7.4:

  • When opening a project file after running a scan or partial scan, any existing results are cleared from the table and Go button is reset
  • Fixes and improvements to class/id helper:
    • if search box had been used to filter the list, the incorrect item could be sent after double-clicking a result to choose it
    • search box above list is now case-insensitive and searches the class names as well as the contents
  • Inherits some minor improvements to the scanning engine
  • Some ‘under the hood’ changes to enable some advanced options

Requirements for WebScraper

  • Intel, 64-bit processor
  • OS X 10.8 or later, Mojave is supported

  • CAN NOT DOWNLOAD: Some probably encounter the following error: This site can’t be reached ...sundryfiles.com’s server IP address could not be found. DNS_PROBE_FINISHED_NXDOMAIN. In this case, please use Google DNS and you will get rid of trouble.
  • If downloaded file can not be extracted (file corrupted...), please make sure you have downloaded the file completely and don't use Winzip, it sucks! We would recommend using The Unarchiver.
  • By reason, the App does not work and can not be opened. Mostly, just Disable the Gatekeeper, and you get rid of troubles.
Size: 7.21 MB