Advanced SEO Techniques

Sitemap.xml

Overview
Structured Data
Structured Data Overview
10:18
501
Adding JSON-LD Structured Data
16:29
502
Site Name JSON-LD
503
Articles, News & Blog Post JSON-LD
504
Site Search JSON-LD
506
Client Reviews JSON-LD
506
Breadcrumbs JSON-LD
508
FAQ Page JSON-LD
509
Q&A Page JSON-LD
510
Twitter Cards
601
Embedding Rich Text in JSON
702
Validating & Debugging JSON-LD
703
Technical SEO
Controlling the Robots
801
Sitemap.xml
802
robots.txt
803
Removing Pages from Google
804
Removing Your WEBFLOW.IO Staged Site from Google
805
More Advanced SEO Techniques
810
Voice Search
810
Partially-Dynamic Headings
3:35
811
Full Site SEO Text Search
3:35
812
Influencing Google Search Appearance
Influencing Google's Search Appearance
901
No items found.
Published
November 2, 2022
Updated
in lightbox

Sitemap.xml is a special file in websites which helps search engine crawlers to index the site.

It provides two key functions;

  1. It gives crawlers a nice ToC of all of the pages on your site, so that it doesn't have to slowly discover them by navigating page-by-page through links.
  2. It helps focus a crawler's attention on what has changed, using the last-modified date.

Here's an example sitemap.xml, for this site.

Noteable Webflow Points

What Webflow includes;

  • All static pages
  • All individual Collection Pages

What Webflow does not include;

  • Pages with Collection List pagination
    Although these are legitimate discrete links, they are low value compared to the collection pages themselves. As a complication, the number of possible URLs multiplies out fast due to the way Collection List pagination works,

Webflow uses the default location of /sitemap.xml

Webflow does not include the last-modified date, possibly due to some complications with updating CMS items. However this is probably not needed, see notes below.

The Hostname Bug

No matter how many domain names you have on your site, Webflow will only generate the sitemap.xml with ONE of them.

If you have a default domain set, and republish your site, Webflow should always use that default domain as the one displayed in the sitemap.xml.

For example, on Sygnal's site, I have the www.sygnal.com domain specified as the default, and that results in a sitemap xml containing;

<loc>https://www.sygnal.com/</loc>

However if you DO NOT have a default domain name specified, the sitemap.xml gets a fairly random domain name in it.

For example, if I have domain names sygnal1.com and sygnal2.com, and I have not set any default name in Webflow, then Webflow will unpredictably choose one for the sitemap.xml- and then it will deliver that same sitemap.xml for both sites.

That means Google might request https://sygnal1.com, but the sitemap contains URLs pointing to https://sygnal2.com.

To Google, that's an invalid sitemap, since it's not pointing to the current site.

FAQs

There are some pages I don't want Google to index, how do I remove them from my sitemap.xml?

There is no way to exclude a page from Webflow's automatic sitemap.xml.

The only option Webflow offers is to replace your entire sitemap with a literal, custom-edited one, copy-paste style.

However there are almost zero situations where that's actually useful.

Sitemaps only have an "assist" role; they don't determine what should be indexed, or indicate what you want to appear in search results.

If you're trying to exclude a page, use META noindex instead.

References

Last-modified

Definition of last-modified

Notes about Google's use of the last-modified date are highly conflicted as to whether it's beneficial to ranking.

https://www.seroundtable.com/google-last-modified-date-xml-sitemaps-30026.html

https://webmasters.stackexchange.com/questions/117090/when-should-i-update-lastmod-value-in-the-sitemap

https://stackoverflow.com/questions/31349345/how-to-properly-format-last-modified-lastmod-time-for-xml-sitemaps?stw=2

Sitemap validation notes

https://support.google.com/webmasters/thread/148829982?hl=en&msgid=149001137

Videos
No items found.
Table of Contents
Comments
Did we just make your life better?
Passion drives our long hours and late nights supporting the Webflow community. Click the button to show your love.