Archive for the ‘Technical Issues’ Category
It appears that something is up with Google and their cache reporting. A lot of pages seem to have cache dates of August 12 or there abouts, sites that were previously cached regularly are reporting that they have not been cached.
So the questions now are :-
Is it the cache date reporting that is wrong or
Is it that Google has suddenly ramped down their caching?
Are they ‘full’ again as they were a couple of years back?
Is something BIG, HUGE, MONUMENTAL, about to happen, are we in for another update of ‘Florida’ proportions?
Much has been said of late at the apparent worsening of web spam handling within the Google index. Could we be standing on the precipice blissfully unaware
Today I thought i would mention the case of the curiously disappearing pages in google. I am now getting regular enquiries from people who are concerned about their site losing page saturation in the might G. No indexed page = no traffic of course, which in turn = no money.
having worked with a few sites who have suffered this, I have found that the main issues contributing to it were
Weak pages.
By this I mean that the pages were pretty much duplicated with only a small element changing on each page. This is especially a problem with E com sites as the header, nav footer all stay the same. Visible content might look a little different but when you look at the code, you see that in hundreds of lines, maybe only 4 or 5 are unique.
Poor navigation/site structure
In this case I noticed that the actual site hierarchy was poorly designed. There was no clear structure for the search engines to apply weight to.
Poor linking structure
here I saw poor linking, in as much as most pages were linked to each other from most other pages. this only served to water down PageRank (link juice) to a point where the pages were all seen as unimportant (again related to poor structure/architecture
Too many links
This was a common theme as many carts have pop out or drop down option selections, which look innocent enough, but on further investigation can be seen to be causing problems . It is possible to have hundreds of links per page, and this isn’t good.,
While a flat file structure is OK for a small site, a clear linking and hierarchy MUST be evident in a large site. This allows Google to apply it’s weight to each page, its trust to each page, pro-rata.
Poor PageRank
Poor linking leads to poor PR spread, and that is not good in the eyes of Google. Despite what many say, actual PR matters to Google, it matters for many things, and gauging the value of a page is one of them.
All the above serve only to confuse the search engines as to the importance of pages within your site. Each site has a page saturation level, it is worked out by the two main elements in the google algorithm (yes there are really only 2 when it is all boiled down)
1. Importance – this is a measure of value in the eyes of google and is pretty much page rank
2. Relevance this is a textual value.
The above are further split in to the 250 or more sub elements that make up the algorithm, but when all said and done, it is those 2 that matter.
With most of a shopping carts pages being near duplicate content, and the page cross linking structure being higgledy piggldy at best, how is Google supposed to know what is important or relevant? Put simply, they can’t.
The result of this is that wile a site may be showing some 1500 or so pages indexed, when you get to between 200-300 pages, the cached versions stop showing. So the reality is even worse. No cache in the main index, yet often times a cache when you visit the individual page?
While Google announced the scrapping of the infamous supplementary index, it appears that just like in the case of Mark Twain (Samuel Langorne Clemens, rumours of its death were greatly exaggerated.
Finally, I will say this. If you read the Google webmaster technical advice pages, it tells you not to make the errors above that contribute to page dropping.
Question:
I recently changed hosts, and at the same time I moved the store from domain.com/store to the root domain.com/ Sales have disappeared and I am getting loads of 404 errors, what can I do? this is an absolute nightmare for me and the business.
Answer:
Sadly you have made an all too common error in assuming that it is OK to simply move your site around without it having an affect. The search engines know all the URLs on your site, and expect them to be there. People will have bookmarked pages to come back later and buy from you. Now they are all getting 404 errors. The spiders feel unloved, and worse, potential buyers think you have gone bust. Read the rest of this entry »
Q: (privacy requested for obvious reasons)
Hi OWG, for some odd reason, Google has suddenly started spidering our site under the wrong domain name. The home page and a couple of others have been spidered under XXX.co.uk while the rest are under the original domain of yyy.co.uk. We really are at a loss as to why this has happened, can you help please, as yyy.co.uk is just an old domain name that we foreward to our real site.
A: No problem! This often happens and is a timebomb waiting to happen in many cases. It is a way of setting yourself up to be knocked down, due to poor redirection of domains. Read the rest of this entry »