Microsys
  

Crawl PDF Files When Using Website Scraper

You can have content in PDF documents analyzed during site crawl when using our website scraper tool.

Crawl and Analysis of PDF Files

First we need to enable a special setting to crawl content inside PDF files:

crawl PDF files

After this we point our website scraper to a PDF to TEXT conversion tool executable:

pdf files conversion tool

After having configured above, crawl your website as you normally would when using A1 Website Scraper - the scan will include analysis of PDF files like this example file.
A1 Website Scraper
A1 Website Scraper | help | previous | next
Extract data from sites into CSV files. By scraping websites, you can grab data on websites and transform it into CSV files ready to be imported anywhere, e.g. SQL databases
This help page is maintained by
As one of the lead developers, his hands have touched most of the code in the software from Microsys. If you email any questions, chances are that he will be the one answering.
Share this page with friends   LinkedIn   Twitter   Facebook   Pinterest   YouTube  
 © Copyright 1997-2024 Microsys

 Usage of this website constitutes an accept of our legal, privacy policy and cookies information.