This utility aims to provide a simple, yet secure and highly configurable way to autograde programming assignments.

I consider it to be finished. Autograder has been tested on a real university class with hundreds of students and has shown to be error-less (in terms of grades), fast, and protected from cheating.

Note

If you wish to use autograder as a professor/student, configuring and running it through GUI is recommended -- it's a lot simpler, just as fast, and just as versatile.

The command line utility is intended for advanced use cases (extending autograder, grading on a server, or integrating it as a part of a larger utility/app)

Features
Platform Support
Installation
Supported Programming Languages
Quickstart
Usage
Writing testcases
Helper functions
Limitations
Anti Cheating
Adding Programming Languages

Features

Blazingly fast (can grade hundreads of submissions using dozens of testcases in a few minutes. Seconds if grading python)
Easy to grade
Easy-to-write testcases
Testcase grade can be based on student's stdout
Can grade C, C++, Java, and Python code in regular mode
Can grade any programming language in stdout-only mode
A file with testcase grades and details can be generated for each student
You can customize the total points for the assignment, maximum running time of student's program, file names to be considered for grading, formatters for checking student stdout, and so much more.
Anti Cheating capabilities that make it nearly impossible for students to cheat
Grading submissions in multiple programming languages at once
JSON result output supported if autograder needs to be integrated as a part of a larger utility
Can check submissions for similarity (plagiarism)
Can detect and report memory leaks in C/C++ code

Platform Support

Linux is fully supported
OS X is fully supported
Windows is partially supported:
- Stdout-testcases that require shebang lines are not and cannot be supported

Installation

Currently Python >= 3.7 is necessary.
Run pip install assignment-autograder
If you want to update to a newer version, run pip install -U --no-cache-dir assignment-autograder
To grade various programming languages, you'd need to install:
- gcc/clang for C/C++ support
- Java JDK for java support
- make for compiled stdout-only testcase support
- Any interpreter/compiler necessary to run stdout-only testcases. For example, testcases with ruby in their shebang lines will require the ruby interpreter

Supported Programming Languages

Java
C
C++
CPython (3.6-3.9)
Any programming language if stdout-only grading is used

Quickstart

Run autograder guide path/to/directory/you'd/like/to/grade. The guide will create all of the necessary configurations and directories for grading and will explain how to grade.
Read Usage section

Usage

Create tests directory in the same directory as student submissions. Its structure is shown in examples. (can be automatically created using --guide)
Optional files that can be automatically created by --guide CLI option and whose use is demostrated by examples:
1. Input (stdin) and expected output (stdout) text files in their respective directories for each testcase. If a test does not require input and/or stdout, the respective text file is also not required.
2. Create config.ini and change configuration to fit your needs (If you do not include some fields, autograder will use the respective fields from default_config.ini)
3. Create stdout_formatters.py and edit it to fit your needs. They will format student's stdout to allow you to give credit to students even if their stdout is not exactly the same as expected.
Write testcases as described below using examples as reference.
Run autograder run path/to/submissions/dir from command line.

Writing testcases

Write a main that follows the same structure as one of the examples in your programming language. The main should usually call student's code, check its result, and call one of the helper functions (when working with stdout, you don't check the result, and simply allow autograder to handle grading by calling CHECK_STDOUT())
Assume that student's code is available in your namespace. Examples demonstrate exactly how to call students' functions.
Assume that helper functions (decribed below) are predefined and use them to return student scores to the grader
Each helper function prints the student's score, validation string, terminates the execution of the program and returns its respective exit code that signifies to autograder if the testcase ended in a result, cheating attempt, or if stdout checking is necessary.
Each testcase is graded out of 100% and each grade is a 64bit double precision floating point number, which means that you can fully control how much partial credit is given in non-stdout checking tests.

Helper functions

CHECK_STDOUT() indicates that we do not check student's return values for the testcase and that we only care about their output (stdout) that will be checked by the autograder automatically using student's stdout and the output files with the same name stem as the testcase. (beware: printing anything within your testcase can break this functionality)
RESULT(double r) returns student's score r back to the grader (0 - 100)
PASS() returns the score of 100% back to the grader and is equivalent to RESULT(100)
FAIL() returns the score of 0% back to the grader and is equivalent to RESULT(0)

Limitations

At the point of writing this readme, stdout checking is a PASS or FAIL process (i.e. no partial credit possible). The reason is that allowing for 'partial similarity' of outputs is too error-prone and could yield too many points for students that did not actually complete the task properly. If you want to increase the chances of students' stdout matching, you should use stdout formatters described above.
If you don't prototype student functions you want to test in your C/C++ testcases, you will run into undefined behavior because of how C and C++ handle linking.
Student's main functions ARE NOT meant to be accessed because testcase must be the starting point of the program. They are, however, accessible if necessary in C/C++ as __student_main__.

Anti Cheating

One of the main weaknesses of automatic grading is how prone it is to cheating. Autograder tries to solve this problem with methods described in this section. Currently, (as far as I've read and tested), it is impossible to cheat autograder. However, Java might still have some weird ways of doing this but there are protections against all of the most popular scenarios (decompiling and parsing testcases, using System.exit, trying to read security key from environment variables, using reflection to use private members of the test helper)

To restrict the student from exiting the process himself and printing the grade of his/her choice, I validate testcase stdout using a pseudorandom key called validation string. Autograder gives the string to the testcase as an environment variable which is erased right after the testcase saves it, and then it is automatically printed on the last line of stdout before the testcase exits. The autograder, then, pops it from stdout and verifies that it is the same string it sent. If it is not, the student will get the respective error message and a 0 on the testcase.
To prevent students from simply importing the string from the testcase file, test helper files (described above) all have some way of disallowing imports. For C/C++, it is the static identifier, for Java, it is the private method modifiers and automatic testcase fail if reflection is detected, for python it is throwing an error and deleting the validation string if __name__ != "__main__". I assume that similar precautions can be implemented in almost any language.
Simply parsing validating string from the testcase file is impossible because it is passed at runtime.
As an additional (and maybe unnecessary) security measure, autograder precompiles testcases without linking for all languages except for java, thus decreasing the possibility that the student will simply parse the testcase file and figure out the correct return values if the security measure above doesn't work.

Adding Programming Languages

If you want to add a new language for grading, you have to:
1. Create a new directory in autograder/testcase_types/
2. Create a python module in that directory that contains a subclass of TestCase (from autograder/testcase_utils/abstract_testcase.py)
3. Create a helpers directory and write your test helper
4. Optionally, add the extra (extra files to be available to each testcase) and templates (examples of testcases written using the new language) directories
Use the other testcase subclasses and test helpers as reference
This point is optional but if you want full anti-cheating capabilities for your new language, you will need to consider three things:
- Does your language support getting and unsetting environment variables? It is required to save validating string in your code without leaking it to students.
- Does your language support private-to-file functions/classes/methods/variables? It is required to prevent the student from simply importing helper functions and/or the validating string.
- Does your language support precompilation (conversion to bytecode without linking)? It is not as important as other points but could speed up grading and hide testcase code from students.
You can extend many other capabilities of autograder using new testcase types. For example, C testcase type adds memory leak detection on its own for both C and C++ testcases.

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Ovsyanka83 / autograder

Programming Languages

Labels

Projects that are alternatives of or similar to autograder