All Projects → emareg → paper-checker

emareg / paper-checker

Licence: other
Find simple grammar mistakes in scientific documents.

Programming Languages

python
139335 projects - #7 most used programming language
TeX
3793 projects

Projects that are alternatives of or similar to paper-checker

wowchemy-hugo-themes
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, easily build with blocks! 创建在线课程,学术简历或初创网站。#OpenScience
Stars: ✭ 6,891 (+42968.75%)
Mutual labels:  academic
cloudref
CloudRef - collaboratively managing references in the cloud
Stars: ✭ 53 (+231.25%)
Mutual labels:  academic
grammarify
Grammarify is a npm package that safely cleans up text that has mispellings, improper capitalization, lexical illusions, among other things.
Stars: ✭ 43 (+168.75%)
Mutual labels:  grammar-checker
awesome-academic-writing
MERGED with https://github.com/writing-resources/awesome-scientific-writing
Stars: ✭ 39 (+143.75%)
Mutual labels:  academic
checkyoself
Markdown Grammar Checker for blog posts, etc.
Stars: ✭ 38 (+137.5%)
Mutual labels:  grammar-checker
ml-ai
ML-AI Community | Open Source | Built in Bharat for the World | Data science problem statements and solutions
Stars: ✭ 32 (+100%)
Mutual labels:  academic
tips-for-scientific-research
A collection of tips for scientific research
Stars: ✭ 21 (+31.25%)
Mutual labels:  academic
awesome-AI-academia
A curated list of academic AI resources and people. Partly an aspirational list, partly a memory aide.
Stars: ✭ 41 (+156.25%)
Mutual labels:  academic
hugo-prose
A Hugo theme derived from the XMin theme, and inspired by Wowchemy, Distill, and tufte.css
Stars: ✭ 76 (+375%)
Mutual labels:  academic
mutschler.eu
Source code for my personal website https://mutschler.eu
Stars: ✭ 59 (+268.75%)
Mutual labels:  academic
academic-bloggers-toolkit
WordPress plugin providing an all-in-one solution for effective academic blogging.
Stars: ✭ 117 (+631.25%)
Mutual labels:  academic
yart
RMarkdown Template for writing PDF reports
Stars: ✭ 81 (+406.25%)
Mutual labels:  academic
hacktoberfest
Contribua com os projetos de código aberto do curso e ganhe prêmios por isso!
Stars: ✭ 25 (+56.25%)
Mutual labels:  academic
WordSegmentationDP
Word Segmentation with Dynamic Programming
Stars: ✭ 18 (+12.5%)
Mutual labels:  spell-checker
cppcombinator
parser combinator and AST generator in c++17
Stars: ✭ 20 (+25%)
Mutual labels:  grammar-checker
cv
[LaTeX] My academic CV
Stars: ✭ 31 (+93.75%)
Mutual labels:  academic
dslib
🌿 A library of "connected" data structures
Stars: ✭ 122 (+662.5%)
Mutual labels:  academic
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+32118.75%)
Mutual labels:  academic
fla.sh
💥 flashcards in your terminal
Stars: ✭ 99 (+518.75%)
Mutual labels:  academic
evalsp20.classes.andrewheiss.com
🎓 GSU MPA/MPP course on program evaluation and causal inference
Stars: ✭ 22 (+37.5%)
Mutual labels:  academic

PaperCheck

Actions Status Actions Status Code style: black

PaperCheck is a python script that searches for simple grammar mistakes in scientific english texts. Unlike other grammar checkers it is free and tailored for scientific texts, such as papers. It might find words that pass a spell check but are most likely not intended in a scientific context, such as "angel" vs. "angle".

Getting Started

git clone https://github.com/emareg/paper-checker.git
cd paper-checker
make setup

Afterwards, you can use the script in two ways:

1. Run the python file

python3 papercheck.py -sgy example/testfile.tex

2. Compile as a stand-alone executable (Unix only)

make
./papercheck -sgy example/testfile.tex

Supported file types: .tex .txt .md .pdf

The found issues are displayed in the terminal and also written into papercheck_report.html

System wide installation

make install

This will copy the stand-alone executable to ~/.local/bin

Install as a Python package

pip3 install .
cd example
python3 -m papercheck -sgy testfile.tex

Features

Spell Checker (-s option)

Will highlight spelling errors. The script uses a small basic dictionary plus some additional self-made dictionaries for terms such as

  • technical: “microcontroller”, “superframe”, “bitmask”
  • mathematical: “eigenvector”, “linearization”
  • chemical: todo

The larger standard dictionaries are unsuitable because they

  • contain errors such as “longitudianl” or “schemati”
  • mask informal plural forms such as “vertexes” which should be “vertices”
  • include obsolete forms such as “latence” which should be “latency”

Grammar Checker (-g option)

Will highlight simple grammar mistakes such as

  • misuse of “a” or “an”
  • doubled auxiliary verbs (e.g. “is are”)
  • doubled determiners (e.g. “this the”)
  • confused “then” vs. “than”
  • confused “to” vs. “too”
  • wrong person-verb combination (e.g. “This were”)

Style Checker (-y option)

Will highlight language that could be improved such as

  • wrong words in scientific context (e.g. “angle” vs. “angel”)
  • non explained acronyms
  • improve less formal words (e.g. use “entire” instead of “whole”)

Plagiarism Checker (-p option)

experimental!

The script will try to find significant sentences, which are then compared to Google search results. This is a very poor approach but useful as a minimal effort with zero cost.

TeX checker

When you run the script on .tex files, it will also check for certain TeX problems such as

  • unused labels
  • missing periods in figure/table captions
  • unused math operators in math mode, e.g. $sin$ instead of $\sin$

Related Work

  • LanguageTool: Grammar, Style and Spell Checker written in Java
  • textidote: uses LanguageTool on .tex files

So why not use LanguageTool? It is large, slow and not tailored for scientific/technical texts. However, I recommend to use LanguageTool in addition.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].