This repository contains code for sa-bAbI code generator.

Contributors:

Carson Sestili
Will Snavely
Ben Cohen
Nathan VanHoudnos nmvanhoudnos NEAR cert FULL STOP org

Quickstart

A minimal example to get you an end to end analysis of generated code.

Git clone or unzip the archive and descend into the working directory.

$ unzip SA-bAbI-<version>.zip
$ cd sa-babi-<version>

Build the docker images and setup the workspace

$ docker-compose build
$ mkdir working

Generate two datasets, a training set and test set. The SA_SEED variable is optional, but used here for reproducibility.

$ SA_SEED=0 ./sa_e2e.sh working/sa-train-1000 1000
$ SA_SEED=1 ./sa_e2e.sh working/sa-test-100 100

Within the output directory you will find:

alerts/ containing CSV files of the alerts each static analyzer found per file
clang_sa/ containing XML output from clang
cppcheck/containing XML output from cppcheck
frama-c/ containing text output from frama-c
src/ containing the generated sa-bAbI .c files
tokens/ containing the tokenized .c files
tool_confusion_matrix.csv reporting results on the whole dataset
tool_confusion_matrix_sound.csv reporting results on the sound subsample of the dataset

Deep learning quickstart

Build and activate the conda environment for the deep learning component

$ cd pipeline
$ conda env create -f environment.yml
$ source activate sa_babi

Train the deep learning model on the training data. Note that the pipeline/constants.py file is already setup for working/sa-train-1000. If you wish to use different data, modify pipeline/constants.py appropriately before running the following command.

$ python train.py

The current validation script only serves the needs of the arXiv.org paper. TODO: Develop a more general validation script.

Docker Infrastructure

This repository, in part, contains an automated system for:

Generating source code
Generating tokens from the source
Running open-source static analysis tools on the source
Scoring the tool outputs against ground truth

This system is built with docker and docker compose.

Prerequisites

This system has been primarily run on Linux (Ubuntu 16.04), but should work on any Linux-like system (including Mac), assuming the following are present:

docker
docker-compose
bash
realpath (obtainable through the coreutils homebrew package on Mac)

Building

The necessary docker images can be built with

docker-compose build

Running the SA Pipeline

The sa_e2e.sh script runs the tool pipeline end-to-end. The usage for this script is:

bash sa_e2e.sh <working_dir> <num_instances>

Where <working_dir> is the path to a local directory where outputs will be stored (will be created if it doesn't exist), and <num_instances> is the number of testcases that will be generated.

After this script is run, <working_dir> will contain:

Generated source files in the src directory
Tokenized source files in the tokens directory.
Raw tool outputs in directories named with tool names (e.g. cppcheck)
Aggregated tool alerts in the alerts directory. The alerts are in a common csv format.
Confusion matrices for tools in tool_confusion_matrix.csv and tool_confusion_matrix_sound.csv

Setting the RNG seed

The testcases are randomly generated based on a seed. By default, this seed is set to a random value, but you can set it to a specific value by setting the SA_SEED environment variable, e.g.

SA_SEED=10 bash sa_e2e.sh <working_dir> <num_instances>

Running the Juliet Pipeline

Tools can be run against a subset of the Juliet testsuite using the juliet_run_tools.sh script. Usage:

bash juliet_run_tools.sh <working_dir>

Where <working_dir> is a local directory where outputs will be stored. The outputs in this directory are very similar to those for the SA pipeline, save no "sound" confusion matrix is generated.

Changelog

Version 0.1

Initial release

Document markings

# sa-bAbI: An automated software assurance code dataset generator
# 
# Copyright 2018 Carnegie Mellon University. All Rights Reserved.
#
# NO WARRANTY. THIS CARNEGIE MELLON UNIVERSITY AND SOFTWARE
# ENGINEERING INSTITUTE MATERIAL IS FURNISHED ON AN "AS-IS" BASIS.
# CARNEGIE MELLON UNIVERSITY MAKES NO WARRANTIES OF ANY KIND, EITHER
# EXPRESSED OR IMPLIED, AS TO ANY MATTER INCLUDING, BUT NOT LIMITED
# TO, WARRANTY OF FITNESS FOR PURPOSE OR MERCHANTABILITY, EXCLUSIVITY,
# OR RESULTS OBTAINED FROM USE OF THE MATERIAL. CARNEGIE MELLON
# UNIVERSITY DOES NOT MAKE ANY WARRANTY OF ANY KIND WITH RESPECT TO
# FREEDOM FROM PATENT, TRADEMARK, OR COPYRIGHT INFRINGEMENT.
#
# Released under a MIT (SEI)-style license, please see license.txt or
# contact [email protected] for full terms.
#
# [DISTRIBUTION STATEMENT A] This material has been approved for
# public release and unlimited distribution. Please see Copyright
# notice for non-US Government use and distribution.
# 
# Carnegie Mellon (R) and CERT (R) are registered in the U.S. Patent
# and Trademark Office by Carnegie Mellon University.
#
# This Software includes and/or makes use of the following Third-Party
# Software subject to its own license:
# 1. clang (http://llvm.org/docs/DeveloperPolicy.html#license)
#     Copyright 2018 University of Illinois at Urbana-Champaign.
# 2. frama-c (https://frama-c.com/download.html) Copyright 2018
#     frama-c team.
# 3. Docker (https://www.apache.org/licenses/LICENSE-2.0.html)
#     Copyright 2004 Apache Software Foundation.
# 4. cppcheck (http://cppcheck.sourceforge.net/) Copyright 2018
#     cppcheck team.
# 5. Python 3.6 (https://docs.python.org/3/license.html) Copyright
#     2018 Python Software Foundation.
# 
# DM18-0995
#

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

cmu-sei / sa-bAbI

Programming Languages