BatchURLScraper - Extraction data using XPath, CSSPath, XQuery and Regex

Started by majento, 11-17-2020, 09:52:13

Previous topic - Next topic

majentoTopic starter

Hello!

We present to your attention a free BatchURLScraper software, designed to extract data from web pages using XPath, CSSPath, XQuery and Regex methods.







BatchURLScraper features:

  • data parsing and extraction from a list of URLs
  • flexible configuration of parsing using XPath, CSSPath, XQuery and Regex extraction methods
  • export reports to Excel (CSV format)
Download page (5 Mb): https://site-analyzer.pro/soft/batch-url-scraper/

We will be glad to receive any feedback and wishes regarding the work of the program.
  •  


majentoTopic starter

New version BatchURLScraper 1.3







What's new:

  • expanded the number of pages for parsing from 1000 to 5000 URLs
  • added the ability to scrape through HTML templates
  • added the ability to extract data through CSSpath attributes
  • added the ability to scrape through External and Internal HTML
  • added the ability to use Proxy Servers lists
  • fixed bug with incorrect User-Agent saving
Homepage: https://site-analyzer.pro/soft/batch-url-scraper/
  •  


majentoTopic starter

New version BatchURLScraper 1.4

What's new:

  • fixed error with validation of HTML templates
  • optimized work with regular expressions
  • we added ability to ignore duplications in scraping results
  • fixed problem with not correct using pauses between requests to web pages
  • range of pauses between requests has been extended to one and a half minutes
  • finalized and improved translation
  • fixed memory leaks
  •  

Ensafeindia

ixed error with validation of HTML templates
optimized work with regular expressions
we added ability to ignore duplications in scraping results
fixed problem with not correct using pauses between requests to web pages
range of pauses between requests has been extended to one and a half minutes
finalized and improved translation
fixed memory leaks