Duplicate content or pages with similar content is a major problem on the web both for users and for web designers and SEO specialists.
There are many tools around the web to help alleviate the problem. These are my favorites:
Webconfs Similar Page Checker after entering the 2 urls outputs the results as a percentage.
Check a website for duplicate content, broken links, internal page rank and redirections. The online tool can also creates an XML sitemap. There is a free version limited to one check every month and a certain number of pages.
Duplicate Content Tool (no longer available) Shows some interesting data like Total HTML similarity, Standard text similarity and Smart text similarity which analyses the main content of the page stripping out the footer, navigation, etc.
Plagiarism checker tool which allows you to check duplication of text and also makew URL searches to rule out plagiarism in your text.
Another tool (aptly named “Similarity Analyzer” ) that I used to use along time ago is from an Italian site called tool.motoricerca.info
Copyscape (http://www.copyscape.com/) is another good site to use. It has both free and paid options.
Thanks Caroline I use that as well
Duplicate content is definitely something you should stay away from, apart of copyscape there are also few really good software that checks for duplicate content
Shame the Duplicate Content Tool is not working anymore, it was really handy.
It’s hard to get the balance right (in terms of how much unique content to have on a page)