Microsys
  

Website Scraper and NetSuite CRM/ERP Websites

If your website is powered by NetSuite, A1 Website Scraper has to be configured quite precisely to crawl it successfully.

NetSuite Sites and Website Scraper Program

From feedback received by customers of A1 Website Scraper and NetSuite we know one has to configure our software quite carefully. The reason appears to be that NetSuite powered websites reserves bandwidth and server usage for real visitors and search engines, i.e. unknown crawlers are bandwidth throttled.

While our website scraper program features lots of intelligent behavior to crawl such websites, NetSuite websites require you configure the crawler engine quite precisely.


Website Scraper Tool Settings for NetSuite


  • Remove all analysis and output file extension filters. Use the [-] button until all extensions in both have been deleted. This will make our website scraper only use MIME filters. This is necessary since NetSuite uses various uncommon file extensions for redirects etc.

  • In Scan website | Crawler engine set max simultaneous connections/threads to one.

  • In Crawler engine | Advanced engine settings set/enable GET to be default for page requests.

  • At this point one can do one of two things. Either attempt to mask the website scraper as a search engine crawler or as a user surfing the website:
    • Settings to mimic "user surfing website":
      • In General options and tools | Internet crawler set user agent to Mozilla/4.0 (compatible; MSIE 7.0; Win32).
      • In Scan website | Webmaster filters disable/uncheck Download "robots.txt" and Obey "robots.txt" file if found.

A1 Website Scraper
A1 Website Scraper | help | previous | next
Extract data from sites into CSV files. By scraping websites, you can grab data on websites and transform it into CSV files ready to be imported anywhere, e.g. SQL databases
This help page is maintained by
As one of the lead developers, his hands have touched most of the code in the software from Microsys. If you email any questions, chances are that he will be the one answering.
Share this page with friends   LinkedIn   Twitter   Facebook   Pinterest   YouTube  
 © Copyright 1997-2025 Microsys

 Usage of this website constitutes an accept of our legal, privacy policy and cookies information.