Microsys
  

Crawl and Analyze PDF Files When Crawling Websites with Website Scraper

You can have content in PDF documents analyzed during site crawl when using our website scraper tool.
Help: overview | previous | next

 To see all the options available, you will have to switch off easy mode 

 With options that use a dropdown list, any [+] or [-] button next to adds or removes items in the list itself 

Crawl and Analysis of PDF Files

First we need to enable a special setting to crawl content inside PDF files:

crawl PDF files

After this we point our website scraper to a PDF to TEXT conversion tool executable:

pdf files conversion tool

After having configured above, crawl your website as you normally would when using A1 Website Scraper - the scan will include analysis of PDF files like this example file.
A1 Website ScraperA1 Website Scraper | help | previous | next

Extract data from sites into CSV files. By scraping websites, you can grab data on websites and transform it into CSV files ready to be imported anywhere, e.g. SQL databases
This help page is maintained by

As one of the lead developers, his hands have touched most of the code in the software from Microsys. If you email any questions, chances are that he will be the one answering.
Share this page with friends   LinkedIn   Twitter   Facebook   Pinterest   YouTube  
 © Copyright 1997-2020 Microsys

 Usage of this website constitutes an accept of our legal, privacy policy and cookies information.