An Introduction to Google Sitemaps

... and why I 'm dying to get finally in the Google SERP

Have you also experienced that getting indexed on Google, despite the Google crawler visits each day your site, is getting tougher and tougher, not to say it's apparently almost impossible in short term?! Between us, in the corridors of Google, they're talking about the notorious 'Google Sandbox' theory. According this theory, a new website is first 'sandboxed' and doesn't get a ranking when the keywords of that website are not incredibly competitive. The Google Sandbox is in fact a filter placed in March of 2004 which new websites prevents from having immediately success in the Google search engine result pages. This filter "is only intended to reduce search engine spam". The sandbox filter is not a permanent filter for your website, what means you can only wait, wait and wait until Google liberates you from this filter. In mean time, don't recline, but write original and well optimized content; write, publish and share articles, place a link on other websites etc.

An example:

I started with wallies.info this year on April 1st and submitted this URL on Google, Yahoo and MSN Search on the same day. Two months later, when I'm searching for 'http://www.wallies.info' and 'wallies.info', Google has twice 1 search result, Yahoo! twice 65 results and MSN Search 313 and 266 results. A remarkable difference, isn't it?! Anyway, Google has a huge problem and backlog to index (new) pages. But two or three times a week, I receive a Google Alert for these two searches, but they aren't encountered again in the Google search engine results pages (SERP) at all.

With the introduction of Google Sitemaps (https://www.google.com/webmasters/sitemaps/), a beta website update reporting service, on Friday 3rd of June 2, I hope this will restrict the Sandbox waiting room. With a Sitemap, crawlers are better enabled to find out recently changed pages and get immediately a list of present pages. As Google Sitemaps is released under a Creative Commons license, all search engines can make use of it. Important to know is that Google Sitemaps will not influence the calculation of your PageRank.

Sitemaps has its own variant of the XML protocol and is called the 'Sitemap Protocol'. For each URL some additional information such as the last modified date can be included.

There are several methods to create your XML Sitemap:

1. The Sitemap Generator (https://www.google.com/webmasters/sitemaps/docs/en/sitemap-generator.html) is a simple script that can be configured to automatically create Sitemaps and submit them to Google.

2. Make your own Sitemap script

3. With the Open Archives Initiative (OAI) protocol for metadata harvesting (http://www.openarchives.org/OAI/openarchivesprotocol.html)

4. With RSS 2.0 and Atom 0.3 syndication feeds

5. A simple list of URLs with one per line

In the current RSS era, it's obvious that the fourth method is the most logical and easiest. Roughly said, you need only to make a new XML template. For a working Sitemap example of the wallies.info blog, got to http://www.wallies.info/blog/gsm.php.

This XML Sitemap has to be submitted on the Google Sitemaps page ( https://www.google.com/webmasters/sitemaps/ ). When you've updated your listed pages or your Sitemap has changed, you have to resubmit your Sitemap link for re-crawling. After I've submitted the wallies.info Sitemap, it took approximately between 3 and 4 hours before Google has downloaded the file.

Please note that Sitemaps doesn't influence in no way the calculation of your PageRank, Google doesn't add every submitted Sitemap URL to the Google Index and Google doesn't guarantee anything about when or if your Sitemap pages will appear in the Google SERP.

Off course, it's easier for you to set up an automated job to submit this XML-file.

You can do this with an automated HTTP request, like this example (your sitemap has to be URL encoded, this is everything behind /ping?sitemap=):

www.google.com/webmasters/sitemaps/ping?sitemap=
http%3A%2F%2Fwww.yoursite .com%2Fsitemap.xml

What is the Sitemap Protocol?

The Sitemap Protocol informs the Google search engine which pages in your website are available for crawling. A Sitemap consists of a list of URLs and may also contain additional information about those URLs, such as when they were last modified, how frequently they change, etc.

An example of the XML Sitemap format:

-

-

http://www.wallies.info/blog/

2005-06-07T05:34:36+02:00

daily

1.0

-

http://www.wallies.info/blog/item/130/index.html

2005-06-05T10:59:22+02:00

1.0

-

...

The XML Sitemap Format uses the following XML tags:

- urlset : this tag encapsulates all other tags of this list;

- url : this tag encapsulates the changefreq, lastmod, loc and priority tags of this list;

- changefreq (optional) is how frequently the content at the URL is likely to change. Valid values are 'always', 'hourly', 'daily', 'weekly', 'monthly', 'yearly' and 'never';

- lastmod (optional) is the time the content at the URL was last modified. The timestamp has to be in a ISO 8601 format;

- loc (required) : the URL location / a URL for a page on your site (< 2.048 characters).

- priority (optional) : the priority of the page relative to other pages on the same site and is a number between 0.0 and 1.0 (default 0.5). This priority is only used to select between URLs on your site. The priority of your pages will not be compared to the priority of pages on other sites.

An urlset may contain up to 50.000 URL's and the file must not be larger than 10MB when uncompressed. Multiple Sitemaps are gathered in a Sitemap index file with a maximum of 1,000 sitemaps of the same site.

The Google Sitemaps URL: https://www.google.com/webmasters/sitemaps/

For feedback of this Sitemaps article, please feel free to visit http://www.wallies.info/blog/item/132/index. html

Walter V. is a self-employed internet entrepreneur and founder-webmaster of several websites, including wallies.info: A snappy blog about snappy blue things: blog | wiki | forum | links - http://wallies.info

mblo.gs: a snappy moblog community - http://mblo.gs

In The News:


pen paper and inkwell


cat break through


Attack Of The Killer Google Zombies!

Don't share this with people of a nervous disposition because... Read More

Search Engine Spam

Running an online business relies to a greater or lesser... Read More

Reciprocal Links to Boost Link Popularity ?

Link popularity means the number of incoming links pointing to... Read More

Writing Search Engine Friendly Webpages

In order to tap the huge stream of targeted traffic... Read More

History of World / Regional Search Engines and Directories

Computers have become a way of life for people around... Read More

Straight Talk on Search Engine Optimization

To get your website listed well in the major search... Read More

Top 2 Ways To Get Higher Rankings in Major Search Engines

Top 10 search engine rankings. Everybody wants it but a... Read More

How To Improve Your Search Engine Ranking

With search engines like Google currently indexing over 8 billion... Read More

10 Ways To Indirectly Get To The Top Of Search

There are millions of web sites trying to get listed... Read More

Keywords, Choose Them Wisely

By now you have likely heard that keywords and keyword... Read More

A Real Example of Search Engine Optimization (SEO) Success

The term, Search Engine Optimization (SEO), refers to a set... Read More

Its All in the Title

Unfortunately, we don't live in a perfect world. You may... Read More

Have You Heard Of Website Optimization

Have you heard of website optimization ? If you are... Read More

Easy SEO in 6 Simple Steps

If you want to increase traffic to your website and... Read More

No Cost Search Engine Marketing

As a matter of fact, I recommend NOT wasting money... Read More

How Real SEO Analysis Works

If you're serious about SEO, you need to know how... Read More

SEM - Research Measures Success

SEM - Research Measures SuccessSearch engine marketing success comes from... Read More

Google Traffic Report Card-Does Your Website Pass? Part 1

This is part 1 of a 7 part series that... Read More

Link Horse Trading For The PR Challenged

After 105 days Google finally updated PR. And it's about... Read More

Recommended Tools When You Put On The SEO Cap For Your Web Site

The name of the game is search engine optimization and... Read More

Sales And Crawlers, Update! Update! Update!

The importance to the algorithmic web crawlers that speed throughout... Read More

Five Simple Steps to Getting Links to Your Site

Today if you want your site to survive in the... Read More

Increase Your Search Engine Ranking

There are methods to increase your search engine rankings which... Read More

Keywords are the ?KEY? to a Popular and Profitable Web Site

Keyword Research will reveal answers to 3 critical questions:1. Is... Read More

Google Rankings ? Achieving a Top 10 Position in Google ? Part 2

Achieving a top ranking position in Google is every webmasters... Read More

What Makes The Perfect SEO Firm?

SEO companies come in all shapes and sizes. You've got... Read More

Google vs. Yahoo -- How To Rank High On Each One

Google likes incoming links, especially links from high-ranking, on-topic pages... Read More

Being dumped by Google? Learn how to avoid becoming a victim next time around!

After Google latest update nicknamed "Florida", many webmasters discovered that... Read More

Meta Tags - What Are They and Which Search Engines Use Them?

Defining Meta Tags is much easier than explaining how they... Read More

Getting Listed in the ODP, Google Directory

First of all, the Google directory is really just the... Read More

Breaking the Myth About Page Rank (PR)

The most difficult challenge most web designers face is getting... Read More

Professional SEO: Hand Off to Bob or Outsource the Job

We are often asked if professional SEO (search engine optimization)... Read More

Arrogant Overture Placing Greed Ahead Of Their Customers Needs

According to the dictionary, the definition of the word "overture"... Read More