All Projects → Kalebu → Plagiarism-checker-Python

Kalebu / Plagiarism-checker-Python

Licence: other
A python project for checking plagiarism of documents based on cosine similarity

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Plagiarism-checker-Python

Simple-Plagiarism-Checker
Web Application for checking the similarity between query and document using the concept of Cosine Similarity.
Stars: ✭ 47 (-58.77%)
Mutual labels:  cosine-similarity, python-project
solr-vector-scoring
Vector Plugin for Solr: calculate dot product / cosine similarity on documents
Stars: ✭ 28 (-75.44%)
Mutual labels:  cosine-similarity
Dr0pFi
Using this tool, You can capture all WiFi Passwords stored on Windows Computer and mail them to your email account!
Stars: ✭ 38 (-66.67%)
Mutual labels:  python-programming
cisip-FIRe
Fast Image Retrieval (FIRe) is an open source project to promote image retrieval research. It implements most of the major binary hashing methods to date, together with different popular backbone networks and public datasets.
Stars: ✭ 40 (-64.91%)
Mutual labels:  cosine-similarity
koolsla
Food recommendation tool with Machine learning.
Stars: ✭ 21 (-81.58%)
Mutual labels:  cosine-similarity
auto coding
A basic and simple tool for code auto completion
Stars: ✭ 42 (-63.16%)
Mutual labels:  python-programming
90 Python Examples
The best way to learn Python is by practicing examples. The repository contains examples of basic concepts of Python. You are advised to take the references from these examples and try them on your own.
Stars: ✭ 190 (+66.67%)
Mutual labels:  python-programming
stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (-47.37%)
Mutual labels:  cosine-similarity
typed python
An llvm-based framework for generating and calling into high-performance native code from Python.
Stars: ✭ 178 (+56.14%)
Mutual labels:  python-programming
python-client
A Python library to ease the integration with the Beem Africa (SMS, AIRTIME, OTP, 2WAY-SMS, BPAY, USSD)
Stars: ✭ 24 (-78.95%)
Mutual labels:  tanzania
Content-based-Recommender-System
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Stars: ✭ 64 (-43.86%)
Mutual labels:  cosine-similarity
bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-78.95%)
Mutual labels:  cosine-similarity
lsh-rs
Locality Sensitive Hashing in Rust with Python bindings
Stars: ✭ 64 (-43.86%)
Mutual labels:  cosine-similarity
Awesome Python Books
📚 Directory of Python books
Stars: ✭ 3,154 (+2666.67%)
Mutual labels:  python-programming
atm-project
file processing project
Stars: ✭ 34 (-70.18%)
Mutual labels:  python-project
NPTEL-The-Joy-of-Computing-using-Python
Study materials related to this course.
Stars: ✭ 29 (-74.56%)
Mutual labels:  python-programming
qxresearch-event-1
10+ Python Application 🦾 | 10- lines of code 👽
Stars: ✭ 194 (+70.18%)
Mutual labels:  python-project
php-pesa
Open payment integration toolkit for PHP
Stars: ✭ 26 (-77.19%)
Mutual labels:  tanzania
Img2VecCosSim-Django-Pytorch
Extract a feature vector for any image and find the cosine similarity for comparison using Pytorch. I have used ResNet-18 to extract the feature vector of images. Finally a Django app is developed to input two images and to find the cosine similarity.
Stars: ✭ 20 (-82.46%)
Mutual labels:  cosine-similarity
Face recognition based attendance system
A python GUI integrated attendance system using face recognition to take attendance.
Stars: ✭ 70 (-38.6%)
Mutual labels:  python-project

Plagiarism-checker-Python

This repo consists of a source code of a python script to detect plagiarism in textual document using cosine similarity

Become a patron

How is it done?

You might be wondering on how plagiarism detection on textual data is done, well it aint that complicated as you may think.

We all all know that computer are good at numbers, so in order to compute the simlilarity between on two text documents, the textual raw data is transformed into vectors => arrays of numbers and then from that we are going to use a basic knowledge vector to compute the the similarity between them.

This repo consist of a basic example on how to do that.

Getting started

To get started with the code on this repo, you need to either clone or download this repo into your machine just as shown below;

git clone https://github.com/Kalebu/Plagiarism-checker-Python

Dependencies

Before you begin playing with the source code you might need to install deps just as shown below;

pip3 install -r requirements.txt

Running the App

To run this code you need to have your textual document in your project directory with extension .txt and then when you run the script, it will automatically loads all the document with that extension and then compute the similarity between them just as shown below;

$-> cd Plagiarism-checker-Python
$ Plagiarism-checker-Python-> python3 app.py
('john.txt', 'juma.txt', 0.5465972177348937)
('fatma.txt', 'john.txt', 0.14806887549598566)
('fatma.txt', 'juma.txt', 0.18643448370323362)

A python library ?

Would you like to use Python library instead to help you compare strings and documents without spending time writing the vectorizers by your own then take a look at Pysimilar.

Explore it

Explore it and twist it to your own use case, in case of any question feel free to reach me out directly isaackeinstein(at)gmail.com

Issues

Incase you have any difficulties or issues while trying to run the script you can raise it on the issues.

Pull Requests

If you have something to add I welcome pull requests on improvement , you're helpful contribution will be merged as soon as possible

Give it a Star

If you find this repo useful , give it a star so as many people can get to know it.

Credits

All the credits to kalebu

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].