Website Scraper and e107 CMS Websites
e107 CMS Websites and Website Scraper Tool
Some websites uses a content management system. Such systems sometimes include code that prevent website crawling by unknown robots.
From reports by users of A1 Website Scraper it seems that
e107 CMS is one such system.
Program Website Scan Settings for e107 CMS
Scan website | Crawler engine: Set max simultaneous connections/threads to one.
Crawler engine | Advanced engine settings: Set GET as default for page requests.
Mask the identity of the crawler in our website scraper software:
- Mimic "user surfing website":
- In General options | Internet crawler set user agent to Mozilla/4.0 (compatible; MSIE 7.0; Win32).
- In Scan website | Webmaster filters disable/uncheck Download "robots.txt" and Obey "robots.txt" file if found.