For your first question, there is a tool for checking plagiarism. It's called CopyScape and can be found at: http://www.copyscape.com/
I don't understand your second question though. Sorry I couldn't be of more assistance.
If you are new to domains and looking to buy, sell and learn about domains then you have come to the right place. DNForum is the largest domain name community on the internet and continues to grow every day. There are over 105,000 domainers on DNForum doing everything from buying domains, selling domains, learning about domains and discussing domains. Take a minute and Register.
Register Today on DNForum IT'S FREE!I am looking for a tool that can compare two blocks of text
to see if they are the 'same' or different/unique content.
Is there such a tool? There must be.
Second question, how does google do this comparison?
How do they know when two pages have duplicate content?
Ideally, I want a tool that works equivalently to google's tool.
Thanks
For your first question, there is a tool for checking plagiarism. It's called CopyScape and can be found at: http://www.copyscape.com/
I don't understand your second question though. Sorry I couldn't be of more assistance.
I will try out copyscape. But copyscape is for text that is on a web page, I believe? What I want is to compare to blocks of text; maybe text I have just created in my word processor, for example.
To compare two blocks of text:
Try a diff tool, there are some for Windows:
http://www.google.com/search?q=diff+windows
But I am sure that's not how Google detects "duplicate content". Their algorithm must be more sophisticated.
I believe you need CopyScape Premium to search blocks of text only.
It can be found: http://www.copyscape.com/signup.php?pro=1&o=f
http://en.wikipedia.org/wiki/Latent_semantic_analysis
http://www.unl.edu/sbehrend/html/sbs...lAnalysis.html
Layman's language: you can build a tool like Google's algorithm (we're not talking about search here) for text comparison, but it will take a lot of time and effort.
There is always Copyscape.
You can use this free tool: http://www.plagium.com
when you put the text, it searches it and compares it to what found, to see its unique percentage (which is red circles). So, it's very useful if you purchased any content or articles, to see if it's spinned from other or original content.
Just go to Google and type in "duplicate content checker".
Here is one...
http://www.dupecop.com/compare-spun-articles.php
Bookmarks