Microsys
        

Sitemaps Generated have "&" Entity Escaped into "&" in URLs

Making sure XML and HTML Documents are Valid and Entity Escaped

There are many rules for XML documents and HTML files. While most internet browsers are forgiving, most sitemap validators are not.

This includes when submitting XML Sitemaps to Google. It is desired behavior that A1 Sitemap Generator entity escapes convert
  • & into &
  • < into &lt;
  • > into &gt;
in URLs, titles, descriptions etc. when building HTML and XML sitemap files. By doing this, all HTML Validators, XML Validators, internet browsers etc. can correctly parse URLs when you submit/view sitemap files.

Notes:
  • When viewing XML sitemap files in internet browsers, they will usually show &amp; as the & ampersand character. To see the actual code of generated XML sitemaps, use text editor tools or view source in your internet browser.
  • You should not copy and paste URLs with converted & to &amp; into your internet browser address field. That will normally not work.


Information and references:
  • Official sitemaps protocol: entity escaping.
    • As with all XML files, any data values (including URLs) must use entity escape codes
  • W3C HTML4 document: character entity references.
    • Authors should use "&amp;" (ASCII decimal 38) instead of "&" to avoid confusion
    • "&amp;" represents the & sign.

Webmaster and website software tools


Business and desktop software utilities

Website and webmaster guides


Search engine optimization help

 © Copyright 1997-2012 Microsys