While usually not recommended because of
duplicate content issues,
some websites mix domains, links and
www and
non-www usage in URLs.
In such cases, after configuring the site root scanned by sitemapper, which is usually the primary domain, make a list af
root aliases.
If you intend to create XML sitemaps, please check this page about
domains in XML sitemaps.
Note:
You need to use the
[+] button
to add a
root path alias into the
dropdown list.
In
Scan website | Crawler options
you can configure the sitemap generator tool to automatically add common root path aliases:
If you use
http://example.com/blogs/ as root, all paths
outside (excluding
root path aliases)
such as e.g.
http://example.com/forum/
will neither be included in
output nor for
analysis.
A better alternative might be to keep the website root as
http://example.com/, and start using
analysis filters
and
output filters
to control your website crawl and resulting output.
Websites with site
areas that have no
incoming links from within the rest of the website can sometimes cause a problem.
Remember that crosslinking
hidden pages will not
help if none of them are linked from rest of website.
This problem can easily be overcome in our sitemap generator software.
It is possible to start a website scan from multiple paths in addition to website directory root.
Note: It is often better to make sure your website is cross linked, so crawlers can find all pages in their own.