Same Website Crawl: Static vs Dynamic Database (e.g. PHP or ASP)
Abstract: The website crawler in A1 Sitemap Generator does not care about the platform your websites uses, if it is PHP or ASP, if dynamic or static etc. Navigation:A1 Sitemap Generator - Help Index
Website Crawlers Can not Tell
Using
virtual directories with URL rewriting?
Site crawlers do not care if directories in URLs are physical on the disk or virtual.
Websites using Cold Fusion, ASP.Net, JSP, PHP etc. as server side language has no consequence. Website crawlers only see the HTML generated by the server-side language.
Note:
In settings, the crawler in our site analysis tool can be set to accept/ignore URLs with certain file extensions.
If you have troubles, read about
finding all pages and links.
Sites dynamicly generated by scripts and databases are crawled without problems by site crawlers and robots.
Note:
Some search engine robots may slow when crawling URLs with ?.
However, that is mainly because search engine are worried spending crawling resourced on lots of URL with auto generated content.
To mitigate this, you can use mod rewrite or similar in your website.
Note:
Our sitemap generator and the
MiggiBot crawler engine does not care about how URLs look.
Verify Website HTML Output to Crawlers and Browsers
Normally, websites never cloak content based on
user agent string and IP address.
However, by setting the useragent ID,
you can check the
HTML source
search engines and browsers see when retrieving pages form your website.
How to Successfully and Completely Scan Your Website
Problematic Websites and Specific Website Platforms
Some few website platforms take measures against crawlers they do not recognize to
reserve bandwidth and server usage for real visitors and search engines. Here is a list of known solutions for those website platforms:
If your problem is still not solved, you can try check the solutions used for e.g.
NetSuite Websites.
You can often apply the same solutions to a wide range of websites.