26 comments »

Duplicate Content Filters Explained: Why Your Personal Blog and ActiveRain Blog Can Have Duplicate Content

One of the most misunderstood concepts in the real estate blogosphere and SEO is what duplicate content filters are and how they affect your real estate blog or website.

One of the most misunderstood concepts in the real estate blogosphere and SEO is what duplicate content filters are and how they affect your real estate blog or website. The most often asked question at RSS Pieces is “Will I be penalized for posting the same post on Active Rain as on my personal blog?” The answer is complex, so let’s break it down into easy to understand chunks based on what SEO experts from SEOmoz, Search Engine Watch and WebConfs have to say on the matter of duplicate content.

 

 

Do duplicate content filters exist?

Yes and no. Don’t you hate that as an answer? They exist but not in the way that most real estate professionals think. If you think that every little page of duplicated content will be penalized and banned by Google, you are wrong. If you think pages that are pulling your RSS feed back through your competitors site will be penalized and banned, that’s pretty unlikely. Search engines only penalize sites in very specific circumstances and where there is gross negligence. The short explanation of duplicate content filters is they are out there scouring the Net for spammers and mirror sites gaming the system to increase advertising revenues and search engine positions for specific terms. They are NOT out there heavily weighing your personal site against your ActiveRain blog. They are basically looking for EXACT or NEARLY EXACT replicas of your site on the Net. 

According to WebConfs, “Search engine spam is any deceitful attempts to deliberately trick the search engine into returning inappropriate, redundant, or poor-quality search results. Many times this behavior is seen in pages that are exact replicas of other pages which are created to receive better results in the search engine. Many people assume that creating multiple or similar copies of the same page will either increase their chances of getting listed in search engines or help them get multiple listings, due to the presence of more keywords.”

 Read also: WebConfs article on Duplicate Content Filters

Wow, hold up! So, let’s undress that slippery little statement in relation to duplicating your content across your personal blog and ActiveRain:

  1. For the most part, personal blogs generally have the “read more” feature where you break up the text into smaller excerpted blocks for the home page while your AR blog shows the full article on the blog’s home page along with a number of other posts. Therefore, the overall content on those home pages can vary substantially to a search engine.

  2. Take an individual page on AR and a single post on your personal blog and compare the total content. Most of your AR posts will have substantially more comments than those on your personal blog, therefore, search engines see the content as noticeably different.

  3. Your AR blog and your personal blog will vary considerably due to the very natures of the platforms selected for the blog. For example, a WordPress, Diachronics and many other blog platforms allow you to add various widgets that further alter the page content that can be indexed. Each page of your personal blog may contain more customizable information like menuing systems and categories along with static text that, again, alter the overall indexable content of the site.

  4. Both your AR and personal blog should be completely relevant to the search results returned by the engines and not likely to be seen as spam or as mirror sites.

What is duplicate content?

According to SE Roundtable, “What is dupe content: Multiple URL’s with same content…identical homepages w/same content. Why is it a problem? Because “they” say so. Recommends looking at the webmaster guidelines at G, Open Directory, and Yahoo. The real reason that this is a problem uis that you wind up confusing the SE robots.”

Read also: Duplicate Content Issues by Search Engine Roundtable

What is a Duplicate Content Filter

Duplicate Content Filters are simply algorithms designed to compare one page against another. If the filter considers two or more pages to be substantially similar, they simply keep the most trusted one (in Google’s case, the one with the most backlinks and PageRank) in the primary index while moving the others to the supplemental index. That doesn’t mean the site gets banned, it simply means that single page is put into the Google’s supplemental index which in the overall scheme of things isn’t really that bad.

What kind of content duplication causes Google to penalize your site?

According to RandFish of SEOmoz, “Penalties require a good bit of abuse to go into effect, but I've seen it happen, even on domains from respectable brands. The penalties really arise when you start copying hundreds or thousands of pages from other domains and don't have a considerable amount of unique content of your own.” 

Read also: SEOmoz’s illustrated guide to Duplicate Content Filters

So, unless your blog or website has grown to the hundreds of thousands of pages and is duplicating each and every page, you are considered safe. You may also want to consider that search engines have gotten increasingly more complex in the ways they implement duplicate content filters, as previous incarnations of the filters wrecked some havoc in the SERPs. Search Engines, for the most part, are only looking for the really bad eggs that are copying large volumes of pages and creating exact replicas of existing sites.

Related Posts
Guide to a successful real estate blog launch
Drive local traffic with reviews and interviews
PODCAST: Damion Flynn, MS Realtor
ActiveRain Talks Social Networking
Teach your old blog new tricks: 5 simple SEO tricks that can have your blog ranking better in 24 hours


http://www.rsspieces.com/000AE8
http://www.rsspieces.com/duplicate-content-filters-explained-why-your-personal-blog-and-activerain-blog-can-have-duplicate-content
Posted on June 19, 2007 11:11:02
Comment from: Loren Nason [Visitor] Email · http://www.futureofrealestatetechnology.com

Thats great information Mary.


I have put 1 or 2 of my articles on active rain but i have been concerned about doing it too much and getting penalized by the dupe content filter.


From what you are saying the chances of that happening are so slim i will have better luck at the craps table.


:-)



Thanks


Time to start putting my stuff on AR


Loren

PermalinkPermalink June 19, 2007 11:59:11
Comment from: teresa boardman [Visitor] Email · http://www.stpaulrealestateblog.com

Nice post.  I am a bad blogger and don't follow the "they say" rules. :)

PermalinkPermalink June 19, 2007 12:44:17
Comment from: Matt McGee [Visitor] Email · http://www.smallbusinesssem.com/

Good post, Mary. Nice to see this kind of information being shared with R.E. agents.



Couple points if I may:



1) Duplicate content filters don't only hit spammers and mirror sites. An agent who puts the same article/post on her own blog and on Active Rain runs the risk of having one of those pages filtered. But as you said, it's no big deal if one page goes into supplemental index. Plus, there may be more overall benefits to getting the AR exposure even if it means the post on your own blog gets dinged as dupe content.



Example: I share some of the blog posts I write with Search Engine Guide (SEG). I do some slight rewriting of the post and wait about 6-8 weeks before giving it to the other site. This way, if the version on my site becomes popular (via links, comments, traffic, etc.), it has a chance to gain some trust/authority. I know the version that SEG posts will have trust/authority, so I hope that my own version can at least compete, and possibly win if there's a dupe content issue. But even if the SEG version wins and gets better search engine visibility, that's okay because I still get exposure from it -- it brings lots of traffic to my blog. So the dupe content risk is well worth it to me. The same might very well apply with individual RE blogs and AR.



2) SEs do block-level content analysis, so even if the AR version has more comments than your own version, the text that is duplicated can still be identified as such.



In the same way, the fact that your own blog has different widgets, links, and other sidebar/header/footer content doesn't really matter. The engines ignore that content for the sake of measuring dupe content, and good thing -- if they didn't ignore it, every blog post you write could be a dupe of the last one. :-)



The "meat" of a page or post is what matters where dupe content is concerned.



3) I think you have a typo in your closing comments: Rand's quote says "hundreds or thousands", but you quoted it as "hundreds OF thousands."



Thanks!

PermalinkPermalink June 19, 2007 19:18:07
Comment from: Athol Kay [Visitor] Email · http://www.reagentinct.com

Great post Mary. I'd been considering AR on and off for a while, but didn't want to jinx myself before Google. Made my Feed Bag today.

PermalinkPermalink June 19, 2007 19:36:57
Comment from: Kermit Johnson [Visitor] Email · http://www.realestatetwincities.net

I previously had a website hosted by Advanced Access.  A large number of my pages were sent to supplemental results (aka Google Hell.)  Advanced Access offered no explanation.  Rankling fell from page one for many important keywords, to total obscurity. 


The site did not have duplicated pages.  However, I am wondering if the spiders detected some duplicated keywords or terms?  Obviously, a site about Minneapolis real estate will have many pages with the words like "home, house for sale, real estate, real estate agent etc.  popping up over and over again.


What do you think?  Do you have any resources that discuss the "supplemental results" conundrum?

PermalinkPermalink June 19, 2007 19:59:01
Comment from: Austin Realtor's Wife [Visitor] Email · http://www.Realtorwives.com

You completely dispelled a myth that I think I created in my own mind.  I have avoided re-posting on AR in fear of getting wiped off the face of GoogleEarth!


Thanks for breaking it down so well nice work!

PermalinkPermalink June 19, 2007 20:39:35
Comment from: Mariana Wagner [Visitor] Email · http://SpringsRealtyScoop.com

Whew - That is a relief ... although I must say that I DO try to "mix it up a bit" between my BLOG and my AR blog.

PermalinkPermalink June 19, 2007 21:25:53
Comment from: Ines [Visitor] Email · http://www.Miamism.com

Mary, we've talked about this a couple of times but it so helps to see it again in writing  (for those of us that are not very technical)....thanks!

PermalinkPermalink June 19, 2007 22:29:20
Comment from: Jeff Dowler [Visitor] Email · http://jdowler.realtownblogs.com

Great article, as always, mary. I am very careful to not copy content across my various blogs...guess it's my paranoia. Glad to know I was behaving and don't have to worry.


Miss you on AR but pop by to read now and then.


Cheers,


Jeff

PermalinkPermalink June 19, 2007 23:24:21
Comment from: Debi Braulik [Visitor] Email · http://www.northslopehomes.com

Thank you for explaining this in great detail. Since i don't think I'll have hundreds of thousands of pages, I'm safe. :)

PermalinkPermalink June 20, 2007 00:20:12
Comment from: Ann Cummings [Visitor] Email · http://www.anncummings.com

Hi Mary,


Thanks for this clarification!  I have always heard to avoid duplicate content, but this post makes it clearer the why's and why not's about dupe content.  And I feel a bit better about using some AR posts on my nice new blog that you've got me going on...  ;-)


Thanks again!

Ann

PermalinkPermalink June 20, 2007 08:07:31
Comment from: Russell Rockefeller [Visitor] Email · http://www.Campaignstreet.com

So very true. Basically the only penalty that could happen is that your EXACT site link where the duplicate content exists could end up in a supplemental results index. Essentially, the rules for duplicate content are to prevent people from creating an advertisement and then registering 9000 domains seo'ing the content with black hat techniques, and finally submitting all 9000 pages to be indexed.


It's very unlikely that you would be harshly penalized for having the odd duplicate entry on your page.


Russell Rockefeller


http://www.Campaignstreet.com


 

PermalinkPermalink June 20, 2007 09:20:28
Comment from: Guest [Visitor] Email · http://www.rsspieces.com
Mary.McKnight

Matt,


Thanks for the comment and your thoughts.  However, I would never misquote SEOmoz, the quote in my article is as it is on Rand's original SEOmoz piece.  Perhaps he meant to say "hundreds of thousands" but it reads "hundreds or thousands."


 

PermalinkPermalink June 20, 2007 10:28:09
Comment from: Property Hogs [Visitor] Email · http://www.propertyhogs.com/hogblog

The best thing to do is just post part of your article on other places and then have a "read more' link on the bottom leading to your own site with the whole article on it. You can of course duplicate the same thing everywhere and the worst that can happen is that Google will not index some of the pages.

PermalinkPermalink June 20, 2007 13:35:20
Comment from: Guest [Visitor] Email · http://www.rsspieces.com
Mary.McKnight

Property Hogs,


The read more tag is only valuable in that it drives more traffic back to your primary site.  The duplication of content across AR and your personal blog is not a  likely scenario for getting dropped from Google's index as  fully explained on the referenced sources int his post.

PermalinkPermalink June 20, 2007 16:09:25
Comment from: Guido [Visitor] Email · http://www.helpusell-properties.com

“Penalties require a good bit of abuse to go into effect. The penalties really arise when you start copying hundreds or thousands of pages from other domains and don't have a considerable amount of unique content of your own.”


This is essentially the key to not get banned!

PermalinkPermalink June 21, 2007 03:37:00
Comment from: agentscoreboard [Visitor] Email · http://www.agentscoreboard.com/blog

Thanks... this is good stuff.  I'm glad someone is out there being helpful

PermalinkPermalink June 21, 2007 15:31:06
Comment from: Chris Lengquist [Visitor] Email · http://kansascityrealestateblog.blogspot.com

Thank you.  Sometimes I duplicate an entire post.  Sometimes I make people make the jump.  I'm not sure if I still will.   But this clears up the subject for me. 


AR gets noticed by google so much better than my other blog.  My blogger blog gets noticed by Yahoo!, MSN and Ask so much better than my AR blog. 


I just get confused and keep blogging.  I'm generating leads...so I'm happy.

PermalinkPermalink June 21, 2007 18:18:42
Comment from: Cyndee Haydon [Visitor] Email · http://realestateblog.clearwaterhomeconnection.com

Thanks for that info - I had asked that same question on AR and Ines told me you had a great explanation - thanks. Hearing nothing but great things about your services. :-)  Look forward to learning more.

PermalinkPermalink June 25, 2007 01:26:10
Comment from: Jennifer Kirby [Visitor] Email · http://minneapolisrealestateblog.blogspot.com

I was wondering about this very topic the other day. I am so glad I found your article on the Carnival of Real Estate. I wouldn't have though AR and my personal blog would be penalized, but it is good to be backed up for real results.

PermalinkPermalink July 03, 2007 10:45:16
Comment from: Christopher Myers [Visitor] Email · http://www.OrlandoPropertyGroup.com

Mary, thanks for the clarification!  I've been searching for an answer on this topic.  I should have thought to look here first!

PermalinkPermalink September 29, 2007 18:42:54
Comment from: Russell Rockefeller [Visitor] Email · http://www.Extravagantmedia.com

I don't intend on necroing this thread but I tripped over this while doing a Google search and saw my post from a few months ago. In case someone else comes here looking for answers on this topic, one thing I just thought of adding is that you can always use NoFollow tags on your duplicate or similar site content in order to be assured that you won't receive a penalty.


 

PermalinkPermalink December 29, 2007 19:43:23
Comment from: Guest [Visitor] Email · http://www.rsspieces.com
Mary.McKnight

Russell,


Thanks for dropping by and I agre no-follow tags would work, or you can always use the dreaded, no-cache/no-index tags.  Although, i don't much think that Google cares about the small time offenders- they are after the real crooks.

PermalinkPermalink January 20, 2008 16:52:42
Comment from: Patricia Beck [Visitor] Email · http://www.patricia-beck.com

I have been looking for an answer to this for a while now!!  Unfortunately, everyone has a different opinion.  I am so glad I found this article and it is in my favorites, I am going to read the other articles you linked in here.  I really appreciate you giving a clear answer to something so complicated! 

PermalinkPermalink February 29, 2008 11:53:57
Comment from: Dave Creaturo [Visitor] Email · http://www.Charlestonsouthernhomes.com
*****

Hi Mary,


My name is Dave Creaturo and I’m from Charleston SC. I’ve been reading your blogs for the last few months and have found your blog the most useful and user friendly that I’ve come across.


Here’s my Problem I started blogging a few months ago and found my website(page 30 in google)  www.Charlestonsouthernhomes.com reach #1 down to  #5 up to #2 down to #4 and so on. I then started copying my blog onto my AR blog and have seen my site drop to #12-15.    Keyword: Charleston SC real estate


What Happened???  Is it the Active Rain ghost?

PermalinkPermalink April 25, 2008 17:03:54
Comment from: Dan Simon [Visitor] Email · http://charlestonhomesforyou.com
****-

Great Post.  Very informative and to the point.  Im going to put some of these practices in place right away.

PermalinkPermalink July 23, 2008 12:46:16
Comment on this article


Your email address will not be displayed.


Your URL will be displayed.
Poor Excellent

Standard HTML is allowed in posts

Line breaks become <br />


Remember me


Allow users to contact you through a message form.
Captcha image.

Please enter the characters from the image above. (case insensitive)

This post has 7 feedbacks awaiting moderation...


real estate blogs

Like what you have been reading on this site?
Subscribe to our feed below.







Valid XHTML 1.0 Transitional