Out of over a million live articles, 13,915 duplicate article titles (only title is duplicated, content is unique) exist on 34,782 articles (3.5%) and we need your help to figure out what action to take.
- November 4th, 2005 we stopped accepting article titles that were exact-match duplicates of existing article titles.
- This was a defensive position prior to our current proprietary anti-duplicate content filters & routines that was receiving heavy development later that month in 2005.
- Early 2006-Today, we reversed our Nov 2005 position and began allowing exact-match duplicate article titles because we now had/have the capabilities to ensure algorithmically that the article body would not be duplicate or derivative of other works.
Why we’re thinking duplicate article titles are a liability?
(even in a world where we know for certain the article body is not a duplicate or derivative of any other article)?
Answer = We perceive that it’s better to have a site of 100% original article titles than to allow a small percentage of them to be duplicated. This is a perception not based on fact; but on belief that some traffic referral partners may give more weight to an article title than the article body and that ~21,000+ articles may be overlooked as not credible (therefore not indexed) even though they have quality original content in all of them.
How to solve this?
Here’s what we’ve ruled out so far:
- Simple strategy seems to be to append something to the article title itself so that only the 1st time an article title is used, all 2nd uses of the exact same title will have some additional related-value text auto-appended to it.
- What can not be auto-appended to duplicate article titles:
- Authors name
- Company Names/URL’s
- What should we append to duplicate article titles: _____________________
- Or take NO ACTION as we shouldn’t perceive this as a quality issue?
How do you think we should solve this problem?