Welcome to Welcome to DNF.com™ - Domain Sales, Domain Forum, Domain Appraisals, Domain Registrars

If you are new to domains and looking to buy, sell and learn about domains then you have come to the right place. DNForum is the largest domain name community on the internet and continues to grow every day. There are over 105,000 domainers on DNForum doing everything from buying domains, selling domains, learning about domains and discussing domains. Take a minute and Register.

Register Today on DNForum IT'S FREE!

Results 1 to 12 of 12
  1. #1
    Exclusive Lifetime Member

    Join Date
    Nov 2007
    Location
    A, A
    Posts
    2,313
    DNF$
    4,383
    Bank
    0
    Total DNF$
    4,383
    Donate  

    How to compare two blocks of text, to Test for copied content?

    I am looking for a tool that can compare two blocks of text
    to see if they are the 'same' or different/unique content.
    Is there such a tool? There must be.

    Second question, how does google do this comparison?
    How do they know when two pages have duplicate content?
    Ideally, I want a tool that works equivalently to google's tool.

    Thanks
    Best PPC? PM FOR DETAILS! ‹(•¿•)›
    Best Hosting: HOSTGATOR

    For Sale: DevDomains.com $499

  2. #2
    Platinum Lifetime Member
    amplify's Avatar
    Join Date
    Sep 2009
    Location
    Okinawa, Japan
    Posts
    699
    DNF$
    522
    Bank
    3,868
    Total DNF$
    4,390
    Donate  
    For your first question, there is a tool for checking plagiarism. It's called CopyScape and can be found at: http://www.copyscape.com/

    I don't understand your second question though. Sorry I couldn't be of more assistance.

  3. #3
    Exclusive Lifetime Member

    Join Date
    Nov 2007
    Location
    A, A
    Posts
    2,313
    DNF$
    4,383
    Bank
    0
    Total DNF$
    4,383
    Donate  
    I will try out copyscape. But copyscape is for text that is on a web page, I believe? What I want is to compare to blocks of text; maybe text I have just created in my word processor, for example.
    Best PPC? PM FOR DETAILS! ‹(•¿•)›
    Best Hosting: HOSTGATOR

    For Sale: DevDomains.com $499

  4. #4
    Platinum Lifetime Member
    hina's Avatar
    Join Date
    May 2008
    Location
    DomainLand
    Posts
    364
    DNF$
    881
    Bank
    0
    Total DNF$
    881
    Donate  
    To compare two blocks of text:
    Try a diff tool, there are some for Windows:
    http://www.google.com/search?q=diff+windows

    But I am sure that's not how Google detects "duplicate content". Their algorithm must be more sophisticated.


  5. #5
    Platinum Lifetime Member
    amplify's Avatar
    Join Date
    Sep 2009
    Location
    Okinawa, Japan
    Posts
    699
    DNF$
    522
    Bank
    3,868
    Total DNF$
    4,390
    Donate  
    I believe you need CopyScape Premium to search blocks of text only.

    It can be found: http://www.copyscape.com/signup.php?pro=1&o=f

  6. #6
    Exclusive Lifetime Member

    Join Date
    Nov 2009
    Location
    USA
    Posts
    60
    DNF$
    355
    Bank
    0
    Total DNF$
    355
    Donate  
    Quote Originally Posted by ksinclair View Post
    I am looking for a tool that can compare two blocks of text
    to see if they are the 'same' or different/unique content.
    Is there such a tool? There must be.
    Copyscape is there, like amplify said. You can always build your own tool.

    Quote Originally Posted by ksinclair View Post
    Second question, how does google do this comparison?
    How do they know when two pages have duplicate content?
    Ideally, I want a tool that works equivalently to google's tool.

    Thanks
    Google does much more than mere textual comparison. They analyze the sets for related words/phrases (LSA). They also do contextual analysis of the sets.

  7. #7
    Platinum Lifetime Member
    amplify's Avatar
    Join Date
    Sep 2009
    Location
    Okinawa, Japan
    Posts
    699
    DNF$
    522
    Bank
    3,868
    Total DNF$
    4,390
    Donate  
    Quote Originally Posted by domaindot View Post
    Google does much more than mere textual comparison. They analyze the sets for related words/phrases (LSA). They also do contextual analysis of the sets.
    Do you mind to elaborate on that? It's way over my head.

  8. #8
    Exclusive Lifetime Member

    Join Date
    Nov 2009
    Location
    USA
    Posts
    60
    DNF$
    355
    Bank
    0
    Total DNF$
    355
    Donate  
    http://en.wikipedia.org/wiki/Latent_semantic_analysis

    http://www.unl.edu/sbehrend/html/sbs...lAnalysis.html


    Layman's language: you can build a tool like Google's algorithm (we're not talking about search here) for text comparison, but it will take a lot of time and effort.

    There is always Copyscape.

  9. #9
    Platinum Lifetime Member

    Join Date
    Jan 2010
    Posts
    104
    DNF$
    747
    Bank
    0
    Total DNF$
    747
    Donate  
    You can use this free tool: http://www.plagium.com

  10. #10
    Exclusive Lifetime Member

    Join Date
    Nov 2007
    Location
    A, A
    Posts
    2,313
    DNF$
    4,383
    Bank
    0
    Total DNF$
    4,383
    Donate  

    re: plagium

    Quote Originally Posted by mannoo2005 View Post
    You can use this free tool: http://www.plagium.com

    i dont understand it. a lot of colored boxes. I did a test
    of content that is likely to be unique but it gave a lot of results.
    maybe its not unique or is it searching every word input,
    as 'or' for each word? it shoudl search it as a whole,
    as a set string.
    Best PPC? PM FOR DETAILS! ‹(•¿•)›
    Best Hosting: HOSTGATOR

    For Sale: DevDomains.com $499

  11. #11
    Platinum Lifetime Member

    Join Date
    Jan 2010
    Posts
    104
    DNF$
    747
    Bank
    0
    Total DNF$
    747
    Donate  
    when you put the text, it searches it and compares it to what found, to see its unique percentage (which is red circles). So, it's very useful if you purchased any content or articles, to see if it's spinned from other or original content.

  12. #12
    CrossLogix.com
    copper's Avatar
    Join Date
    Mar 2006
    Location
    Matthews, NC. U
    Posts
    2,560
    DNF$
    3,880
    Bank
    0
    Total DNF$
    3,880
    Donate  
    Just go to Google and type in "duplicate content checker".

    Here is one...
    http://www.dupecop.com/compare-spun-articles.php

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

Domain name forum recommended by Domaining.com