All Projects → wesleyraptor → Streamingphish

wesleyraptor / Streamingphish

Licence: apache-2.0
Python-based utility that uses supervised machine learning to detect phishing domains from the Certificate Transparency log network.

Projects that are alternatives of or similar to Streamingphish

Docker Nginx Gunicorn Flask Letsencrypt
Boilerplate code for setting up Nginx + Gunicorn + Flask + automated LetsEncrypt certificates (https) using docker-compose.
Stars: ✭ 117 (-56.83%)
Mutual labels:  ssl-certificates, docker-compose
Sci Pype
A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
Stars: ✭ 90 (-66.79%)
Mutual labels:  jupyter-notebook, docker-compose
Phishing catcher
Phishing catcher using Certstream
Stars: ✭ 1,232 (+354.61%)
Mutual labels:  phishing, certificate-transparency
Url Classification
Machine learning to classify Malicious (Spam)/Benign URL's
Stars: ✭ 95 (-64.94%)
Mutual labels:  jupyter-notebook, phishing
Pico
Object Detection and Analysis Made easy using Raspberry Pi, Apache Kafka, AWS Rekognition & Docker
Stars: ✭ 63 (-76.75%)
Mutual labels:  jupyter-notebook, docker-compose
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-50.18%)
Mutual labels:  jupyter-notebook, docker-compose
Pytorch tiramisu
FC-DenseNet in PyTorch for Semantic Segmentation
Stars: ✭ 267 (-1.48%)
Mutual labels:  jupyter-notebook
Bgp Dashboard
BGP Dashboard and Monitoring Web Application
Stars: ✭ 268 (-1.11%)
Mutual labels:  docker-compose
Deeplearning.ai Assignments
Stars: ✭ 268 (-1.11%)
Mutual labels:  jupyter-notebook
Lstm pose machines
Code repo for "LSTM Pose Machines" (CVPR'18)
Stars: ✭ 268 (-1.11%)
Mutual labels:  jupyter-notebook
Home Cloud
The "cloud" at home
Stars: ✭ 269 (-0.74%)
Mutual labels:  docker-compose
Deeplearningwithtf2.0
Practical Exercises in TensorFlow 2.0 for Ian Goodfellows Deep Learning Book
Stars: ✭ 270 (-0.37%)
Mutual labels:  jupyter-notebook
Graph nn
Graph Classification with Graph Convolutional Networks in PyTorch (NeurIPS 2018 Workshop)
Stars: ✭ 268 (-1.11%)
Mutual labels:  jupyter-notebook
Parallel Tutorial
Parallel computing in Python tutorial materials
Stars: ✭ 268 (-1.11%)
Mutual labels:  jupyter-notebook
Deep Learning
No description, website, or topics provided.
Stars: ✭ 3,058 (+1028.41%)
Mutual labels:  jupyter-notebook
Tutorial
A tutorial for widgets
Stars: ✭ 267 (-1.48%)
Mutual labels:  jupyter-notebook
Jenkins Bootstrap Shared
Jenkins as immutable infrastructure made easy. A repository of shared scripts meant to be used as a git submodule. Packing Jenkins, plugins, and scripts into immutable packages and images.
Stars: ✭ 270 (-0.37%)
Mutual labels:  docker-compose
Cookiecutter Docker Science
Cookiecutter template for data scientists working with Docker containers
Stars: ✭ 267 (-1.48%)
Mutual labels:  jupyter-notebook
Noah Research
Noah Research
Stars: ✭ 265 (-2.21%)
Mutual labels:  jupyter-notebook
Cutblur
Rethinking Data Augmentation for Image Super-resolution (CVPR 2020)
Stars: ✭ 269 (-0.74%)
Mutual labels:  jupyter-notebook

StreamingPhish

This is a utility that uses supervised machine learning to detect phishing domains from the Certificate Transparency log network. The firehose of domain names and SSL certificates are made available thanks to the certstream network (certstream.calidog.io). All of the data required for training the initial predictive model is included in this project as well.

Also included is a Jupyter notebook to help explain each step of the supervised machine learning lifecycle (as it pertains to this project).

Overview

StreamingPhish Diagram

This application consists of three main components:

  • Jupyter notebook
    • Demonstrates how to train a phishing classifier from start to finish.
  • CLI utility
    • Trains classifiers and evaluates domains in manual mode or against the Certificate Transparency log network (via certstream).
  • Database
    • Stores trained classifiers, performance metrics, and code for feature extraction.

Each segment has been functionally decomposed into its own Docker container. The application is designed to be built and operated via Docker Compose.

Install and Operational Instructions

Components

  • Docker - Containers that run the application.
  • Docker Compose - Fabric for orchestrating containers and their respective services.
  • Python3 - Programming language.
  • Scikit-learn - Open source library for training classifiers using Python.

Author

  • Wes Connell

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for further details.

Resources/Acknowledgments

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].