Archive for » October, 2007 «

Over the past two days, I have been writing about duplicate content. As discussed earlier, WordPress blogs are notorious for duplicate content. Duplicate content can confuse search engines, and can get you penalized by Google.

Today’s Lesson

While researching on Google’s website (webmaster tools), Google suggests using a robots.txt file as one way to avoid duplicate content.

The robots txt file, gives the crawlers, bots and spiders “instructions” as to what to crawl on your site.

With the robots txt. file, you can avoid sections of your blog from being crawled, thus, avoiding duplicate content.

In researching this issue, I find differing opinions. Some will say a definite “Yes”, you need a robots txt. file. Others claim, it’s not necessary.

Today’s Lesson

Having reviewed your site for duplicate content, do you deem it necessary to add a robots.txt file to your blog?

To learn more about robots txt. files, here’s a link that gives very valuable information.

To know what others are doing, Daniel, at Daily Blog Tips, wrote a great post, where he researched how others are dealing with this issue. He includes sites such as Problogger, John Chow, and TechCrunch. The results are quite interesting.

Adding a robots txt. file to your blog is a decision only you can make.

To see how your site looks to the robots, you can type in http://yoursitename.com/robots.txt

When you hit the search button, a new screen will appear. It may look like this:

User-agent: *
Disallow:

This (*) tells all crawlers, spiders and bots (user agents) to crawl your site. “Disallow:” means that they are allowed to crawl everything on your site.

What have you decided?

Do you feel comfortable setting up a robots txt. file?

Do you think you need one?

What I did was install a plugin for this purpose. It is called the KB Robots txt. plugin. and was written for WordPress blogs, by Adam R. Brown. It can be downloaded here. Many thanks, Adam.

Today’s Lesson

In an effort to reduce the amount of duplicate content I found on my blog pages, the first this was to use a plugin.

The plugin I am using, is named: Homepage Excerpts WordPress Plugin written by Daniel Scocco. Thank you Daniel.

What this easy to install, plugin does, is to give you the option of showing excerpts, instead of full posts.

I don’t necessarily like that the fact that a reader who comes into my blog via the homepage, can only read one full post, and then has to click on others to read the full content. However, if I can avoid the duplicate content issue, hopefully my readers will understand.

On the bright side, it does give readers a chance to scroll through previous posts fairly quickly, and they can determine which ones they may want to read. And…the excerpts take up less space than a full post would.

Today’s Assignment

Look at your homepage.

How many full posts does it include?

Should you consider adding the Homepage Excerpts WordPress Plugin? Or, use another method of excerpting your older posts?

If you are not using WordPress, does your blogging platform have an option you can use to avoid duplicate content?

Over the past week, I have been spending time on Google’s website researching how search engines may see our blogs.

Today’s Lesson

One issue that caught my eye, is duplicate content in blogs.

With WordPress, we have duplicate content throughout our blogs. One post can be “recorded” in many different areas…i.e. monthly archives, recently written, favorite posts, feeds, articles, and in numerous categories. That’s a lot of duplication in a blog.

To quote Google:

Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. Mostly, this is not deceptive in origin. Examples of non-malicious duplicate content could include:

* Discussion forums that can generate both regular and stripped-down pages targeted at mobile devices
* Store items shown or linked via multiple distinct URLs
* Printer-only versions of web pages

To address this issue, Google tries to “filter” some of this duplicate content, in an effort to give readers the best results for their search results. However, duplicate content can unknowingly, be classified by Google, as a deceptive practice.

Today’s Assignment

Review your site.

Where do you have duplicate content?

Do you think your site could be “labeled” as deceptive?

Does your blog platform/program, address this issue?

I have looked at my sites and found numerous areas where I have duplicate content. Based on my finding, I have made several adjustments.

Future lessons will address how I am dealing with these issues.

Related Posts with Thumbnails