Monday, January 4, 2010

SEM 101: Duplicate Content 101

What duplicate content is, how search engines deal with it, and how to avoid the pitfalls.
Search Engine Watch
About | Blog | Forums | Search Marketing Topics | Ratings & Stats | View Online
SEW Experts   SEM 101
ClickZ - News and expert advice for the digital marketer ClickZ Events - Solutions for Interactive Marketers Search Engine Watch - Search Engine Marketing Tips & Search Engine News Search Engine Strategies - the Event for Search Engine Marketing & Optimization
Subscribe to Newsletters Subscribe to RSS Feeds Free Webcasts Members Area Forums How to Advertise
SES New York 2010, March 22-26

Top Jobs

U.S. Account Director
Econsultancy.com New York, United States

Online Marketing Coordinator
Cortina Learning International, Inc. Wilton,

SEO Specialist/SEO Senior Manager
SparkNET Corporation De Pere, United States

Director of Digital Media
University of Louisville Louisville, United States

Senior Account Executive
Confidential (B2B) New York, United States

SEW Expert - Ron Jones Duplicate Content 101
More SEW EXPERTS: SEM 101 SEW EXPERTS: SEM 101

By Ron Jones, SEW, Jan 4, 2010
Columns  |  Contact Ron  |  Biography

If you're involved with SEO in any way, you've probably heard about duplicate content. If you're not exactly sure what it is and how it affects your SEO efforts, then this article is for you. This topic can be rather technical and have some advanced features, but I'll try to keep it basic and won't get into technical details here.

What is Duplicate Content?

In the basic sense, duplicate content is when two or more Web sites have the same content on their site. This isn't just the same subject or headers, but the exact same content word for word or maybe almost word for word.

This may happen for several reasons. You may have written an article and another site or blog picked up the article and posted it word for word.

Another situation is you might have multiple sites with different domains but have similar content. This is sometimes true for sites in different countries. Sometimes there are legitimate reasons for having the same content. Still, it's good to understand the pitfalls.

Why is Duplicate Content a Problem?

The goal of the search engine is to deliver the best value for a given search term or phrase. The more this happens, the more searchers will continue to use that search engine.

The intent for the search engine is to avoid serving up many of the exact same Web pages in the search results. Thus, creating confusion for the searcher and delivering a poor searcher experience. So they attempt to filter all of the duplicate content and choose one based on certain criteria and then serve it up.

The problem is that you're in jeopardy of your page not showing up in favor of another page with the same content. There is speculation about a penalty for duplicate content, but I don't think it's so much a penalty as missing an opportunity to show up in the SERPs.

How Search Engines Deal with Duplicate Content

Search engines send out a bot or program to surf the Internet and collect all of the content it finds. This content is indexed and placed into a database.

During this process, the content is compared against other duplicate content. Then an attempt is made to determine the original. Some clues that help it determine this are:

  • How trusted is the domain?

  • Are there links on one that point back to an original?

  • Or where do most of the links point to?

  • Where is the first place Google found the content?

  • Has any of the content appears to have been "scraped" or repurposed?

One is then picked and used and the others are discarded. This list should also give you some ideas on what you can combat duplicate content issues.

What Can You do to Avoid Duplicate Content Issues

Now that you have a good idea what duplicate content is and how it's dealt with, let's look at what you can do to avoid the pitfalls. First, duplicate content issues don't have anything to do with your site HTML code, only your page content.

Another way to deal with this issue is by using a canonical tag. A canonical page is basically an authoritative page among a group of pages that have similar content.

Also, Google recently posted an article on ways to handle legitimate cross-domain content duplication. They announced the support of a link element and other tips for handling the problem. Basically, Google recognizes there are some legitimate uses for duplicate content and they want to help site owners with solutions.

As I mentioned earlier, being a basics article, I won't get into any technical details. You may, however, need some technical help to dig further into this topic and plan to implement a solution.

Please feel free to share any lessons learned or other best practices with avoiding duplicate content issues.

» Print this article   » E-mail a colleague   » Post a comment   » Share Tweet it on Twitter Share it on Facebook Share it on LinkedIn

Biography
Ron is President/CEO of Symetri Internet Marketing, which provides strategic SEM consulting and training. Ron is actively involved in the SEM community and speaks at conferences and seminars, as well as hosting regional SEM events where he provides participants SEM training and education best practices. Ron also serves on the Board of Directors for SEMPO and is also one of the authors for the SEMPO Institute Fundamentals and Advanced courses.

Article Archives by Ron Jones:
Duplicate Content 101 - January 4, 2010
Link Building Tactics 101, Part 2 - December 28, 2009
Link Building Tactics 101, Part 1 - December 21, 2009
Usability and SEM 101, Part 2 - December 14, 2009
Usability and SEM 101, Part 1 - December 7, 2009
Measuring Success 101, Part 2 - November 30, 2009
» More Articles by Ron Jones


White Papers

Send us Feedback | Technical Questions or Bug Reports | Legal Notices, Licensing, Reprints & Permissions | Privacy Policy

To unsubscribe, sign up for other newsletters or to change your e-mail address:
Update Your Profile

Incisive Media Plc. 120 Broadway, 6th Floor, New York, NY 10271
Incisive Interactive Marketing LLC. 2009 All rights reserved.
EmailLabs - High Performance Email Marketing
Get a Free Email Marketing Demo
All Search Engine Watch newsletters are sent from the domain "newsletters.clickz.com".
When configuring e-mail or spam filter rules, please use this domain name rather than the sender address, which may vary.

No comments: