Welcome to SimDocSin

What is SimDocSin ?



SimDocSin is a cross-lingual document similarity checking tool for Sinhala and English.

This system can be used to find similar documents or parts of documents of Sinhala (English) language to a given document of English (Sinhala) language. System consists of two parts.

Full Matching

Here an user can submit a source document to get any matching complete document that exists in the system database in target language. Here the user has to set 3 input fields. Those are:-


Partial Matching

Here an user can submit a source document to get any matching partials of documents that exist in the system database in target language. Here also the user has to set the 3 inputs fields mentioned in the previous section. Apart from those there is another input field called Min Length. Min Length is the minimum number of sentences the user expected to have in a document partial. It can be 1, greater than 1, greater than 2, greater than 5 or greater than 10.

Creators:-
Udhan Isuranga (udhanisuranga.16@cse.mrt.ac.lk)
Janaka Sandaruwan (janakasadaruwan.16@cse.mrt.ac.lk)
Udesh Athukorala (udeshathukorala.16@cse.mrt.ac.lk)