Microsys
  

Language Detection and Analysing Pages for Similar Content

You can improve analysis of similar and duplicate content by ensuring language identification is correct.

Page Language Detection in A1 Website Analyzer

Our software determines the primary page language by checking the following things:
  1. Checks if the webserver responds with content-language HTTP response header:
    • PHP pages: Insert this code <?php header("Content-Language: en"); ?>.

  2. The page is checked for content-language META tag:
    <meta http-equiv="content-language" content="en">

  3. The page is checked for lang inside the HTML tag:
    <html lang="en">

  4. The page is searched for Open Graph Protocol attribute property og:locale inside META tags.

  5. The page is checked for xml:lang inside the HTML tag:
    <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">

  6. The page is checked for alternate / hreflang inside the link tag:
    <link rel="alternate" href="http://example.com/name-of-page.html" hreflang="en">

  7. The page URL is checked for common language/culture and country codes.

    Note: This requires enabling option Scan website | Data collection | Inspect URLs to detect language. For more info see:
  8. Planned: Compare content against word lists for each language. Select best match.

A1 Website Analyzer
A1 Website Analyzer | help | previous | next
SEO website crawler tool that can find broken links, analyze internal link juice flow, show duplicate titles, perform custom code/text search and much more.
This help page is maintained by
As one of the lead developers, his hands have touched most of the code in the software from Microsys. If you email any questions, chances are that he will be the one answering.
Share this page with friends   LinkedIn   Twitter   Facebook   Pinterest   YouTube  
 © Copyright 1997-2020 Microsys

 Usage of this website constitutes an accept of our legal, privacy policy and cookies information.