Similarity Texter

Weitere Informationen

DIE HOCHMÜTIGE DOHLE UND DER PFAU

Einst lebte eine Dohle, voll von eitlem Stolz, die stahl sich Federn, die dem Pfau entfallen warn und putzte sich damit. Das eigne Dohlenvolk ver- achtend trat sie in der schönen Pfauen Reihn. Der Unver- schämten reißt man hier die Federn aus, jagt sie mit Schnäbeln. Und die Dohle, bös verbleit will wieder nun betrübt zu ihrem Volk zurück. Die aber stoßen sie von sich, mit herbem Schimpf. Und eine derer, die zuvor verachtet, sprach zu ihr “Hätt’ unsre Lebensart dir vormals conveniert, hätt’st du, was die Natur dir schenkte, akzeptiert, dann wär dir weder jene Schande widerfahrn noch müsstest du zum Unglück jetzt verstoßen sein.”

Diese Version von Aesops Fabel ist aus Wilfried Strohs Sammlung von Übersetzungen von Jan Novák: “Aesopia”, die auf Geschichten von Phaedrus basieren.

C12-07 SIM_TEXT

Software Profile | Testoverview | Summary| Screenshots | Business Promotion | Links

Software Profile

ID C12-07
Product SIM_TEXT
Company University of Amsterdam
Web Site http://dickgrune.com/Programs/similarity_tester/
Software Type Desktop
Costs Free
Test Date 2012-03-27

Testoverview

Ranking for all tests: 3
Ranking for Source Code Test: 3
Ranking for Text Test: 6
Summary: useful

Summary

SIM_TEXT [http://dickgrune.com/Programs/similarity_tester/] is installed locally on a computer system by downloading the C source code files from the home page and starting it from a terminal [Alternatively, at http://de.vroniplag.wikia.com/wiki/Quelle:Textvergleichthere is a version implemented in JavaScript that can be used in any browser without installation.]. Despite the excellent technical documentation, the program is not easy to understand for non-computer scientists. The results are, however, excellent.

The reports are difficult to read at first, as they are only in text with no markings. But one can easily use the reported line numbers to quickly identify and mark the similar passages in an editor or a printed version of the offending documents. The similar passages are not sorted, so one must at times jump around between the files that were compared. At one point it was discovered that the results were only the differences between the files, not the similarities, although it was unclear what was changed in order for this to happen.

This system also does not work with German umlauts [Although it is possible, an online implementation at the VroniPlag Wiki, for example, does deal with umlauts properly.]. If individual words are changed too often in the text, the system will drop to a 0 % similarity measure. But the system in general does an excellent job of finding similarities and can certainly be integrated in other systems.

In summary, the system is difficult in install and use, but produces good results for text collusion and useful results for program collusion and is thus one of the systems that is useful for detecting collusion.


Screenshots


Screenshot 1: Overview


Business Promotion

SIMtests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp, Miranda, and natural language. It is used

  • to detect potentially duplicated code fragments in large software projects, in program text, in shell scripts and in documentation
  • to detect plagiarism in software projects, educational and otherwise“

Links

official website http://dickgrune.com/Programs/similarity_tester/