Archive for » October, 2007 «

Over the past two days, I have been writing about duplicate content. As discussed earlier, WordPress blogs are notorious for duplicate content. Duplicate content can confuse search engines, and can get you penalized by Google.

Today’s Lesson

While researching on Google’s website (webmaster tools), Google suggests using a robots.txt file as one way to avoid duplicate content.

The robots txt file, gives the crawlers, bots and spiders “instructions” as to what to crawl on your site.

With the robots txt. file, you can avoid sections of your blog from being crawled, thus, avoiding duplicate content.

In researching this issue, I find differing opinions. Some will say a definite “Yes”, you need a robots txt. file. Others claim, it’s not necessary.

Today’s Lesson

Having reviewed your site for duplicate content, do you deem it necessary to add a robots.txt file to your blog?

To learn more about robots txt. files, here’s a link that gives very valuable information.

To know what others are doing, Daniel, at Daily Blog Tips, wrote a great post, where he researched how others are dealing with this issue. He includes sites such as Problogger, John Chow, and TechCrunch. The results are quite interesting.

Adding a robots txt. file to your blog is a decision only you can make.

To see how your site looks to the robots, you can type in http://yoursitename.com/robots.txt

When you hit the search button, a new screen will appear. It may look like this:

User-agent: *
Disallow:

This (*) tells all crawlers, spiders and bots (user agents) to crawl your site. “Disallow:” means that they are allowed to crawl everything on your site.

What have you decided?

Do you feel comfortable setting up a robots txt. file?

Do you think you need one?

What I did was install a plugin for this purpose. It is called the KB Robots txt. plugin. and was written for WordPress blogs, by Adam R. Brown. It can be downloaded here. Many thanks, Adam.

Today’s Lesson

In an effort to reduce the amount of duplicate content I found on my blog pages, the first this was to use a plugin.

The plugin I am using, is named: Homepage Excerpts WordPress Plugin written by Daniel Scocco. Thank you Daniel.

What this easy to install, plugin does, is to give you the option of showing excerpts, instead of full posts.

I don’t necessarily like that the fact that a reader who comes into my blog via the homepage, can only read one full post, and then has to click on others to read the full content. However, if I can avoid the duplicate content issue, hopefully my readers will understand.

On the bright side, it does give readers a chance to scroll through previous posts fairly quickly, and they can determine which ones they may want to read. And…the excerpts take up less space than a full post would.

Today’s Assignment

Look at your homepage.

How many full posts does it include?

Should you consider adding the Homepage Excerpts WordPress Plugin? Or, use another method of excerpting your older posts?

If you are not using WordPress, does your blogging platform have an option you can use to avoid duplicate content?

Over the past week, I have been spending time on Google’s website researching how search engines may see our blogs.

Today’s Lesson

One issue that caught my eye, is duplicate content in blogs.

With WordPress, we have duplicate content throughout our blogs. One post can be “recorded” in many different areas…i.e. monthly archives, recently written, favorite posts, feeds, articles, and in numerous categories. That’s a lot of duplication in a blog.

To quote Google:

Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. Mostly, this is not deceptive in origin. Examples of non-malicious duplicate content could include:

* Discussion forums that can generate both regular and stripped-down pages targeted at mobile devices
* Store items shown or linked via multiple distinct URLs
* Printer-only versions of web pages

To address this issue, Google tries to “filter” some of this duplicate content, in an effort to give readers the best results for their search results. However, duplicate content can unknowingly, be classified by Google, as a deceptive practice.

Today’s Assignment

Review your site.

Where do you have duplicate content?

Do you think your site could be “labeled” as deceptive?

Does your blog platform/program, address this issue?

I have looked at my sites and found numerous areas where I have duplicate content. Based on my finding, I have made several adjustments.

Future lessons will address how I am dealing with these issues.

It’s one thing to try and protect your blog posts from being plagiarized, however, now I’ve been asking myself, a different question.

Today’s Lesson

In the event of a natural, or man made disaster, how safe is my blog?

Recently, (week of 10/21/07), if you watch the news, you saw how a million (+/-) residents of the State of California were evacuated, due to wildfires. Many, were forced to leave, with only a few minutes notice. One man, when interviewed, stated that before he could get his pants on, the fire was already licking at the side of their home.

Years ago, we were evacuated due to the threat of fire. We were put “on standby”, so we took precautions, and packed a large truck, full of valuables and clothes we may need to last a few days. When we got the call to leave, we were ready. Fortunately, our home wasn’t affected by the fire, but it was an eye opening experience.

The recent terror in California, got me to thinking. Am I prepared, if we were stuck by a disaster? Just the thought of it, motivated me to save my blog information, not only to my hard drive, but on my web host’s server, as well.

In response to the self directed question, I wrote a post on my other blog, titled:How To Prepare For Disaster, with hopes of alerting others to the importance of being prepared.

Today’s Lesson

Are you backing up your blog information?

In the event of a disaster, would your blog be lost?

Could you be back online, in a short period of time?

If you lost all of your blog, would you care?

If you got the call to evacuate, what would you grab?

In my previous post, I wrote about adding internal links in your posts, to deter having your content plagiarized.

Today’s Lesson

How would you know if your content is being plagiarized, and if so, what will you do?

Shortly after I started blogging, I wanted to see how many of my articles were indexed, so I did a search for my site. I was indexed about 40 times. That confused me, as I had not written 40 posts.

Well, some of my posts were referenced by Digg, Technorati and Sphere. But than I found an odd entry. When I clicked on it, I saw my entire page, including all graphics, but at the very top, was a full banner of Google AdSense ads, all in a foreign language.

I wasn’t sure what to do, so I researched the URL, and found the IP address for the site. I then went to my webhosting account, and used the “IP Deny Manager” to block the IP address from having access to my site.

Todays Assignment

Go To Copyscape.com, a free service for checking your site for plagiarized content.

On Copyscape, you can check your site, read additional pages for suggestions of how to handle plagiarism, plus download a banner for your site.

Do a search for your site.

Are any of your posts are being plagiarized? If you find an instance of plagiarism, what action will you take?

BTW: I checked my sites with Copyscape. I found a new instance of plagiarism of my other blog. A site extracted a sentence from a product review I had done.

Related Posts with Thumbnails