All Projects β†’ axenhammer β†’ Codar

axenhammer / Codar

Licence: mit
βœ… CODAR is a Framework built using PyTorch to analyze post (Text+Media) and predict Cyber Bullying and offensive content. πŸ’¬πŸ“·

Programming Languages

python3
1442 projects

Projects that are alternatives of or similar to Codar

Protest Detection Violence Estimation
Implementation of the model used in the paper Protest Activity Detection and Perceived Violence Estimation from Social Media Images (ACM Multimedia 2017)
Stars: ✭ 114 (+119.23%)
Mutual labels:  dataset, image-classification, social-media
Multidigitmnist
Combine multiple MNIST digits to create datasets with 100/1000 classes for few-shot learning/meta-learning
Stars: ✭ 48 (-7.69%)
Mutual labels:  dataset, image-classification
Awesome Project Ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Stars: ✭ 6,114 (+11657.69%)
Mutual labels:  dataset, image-classification
Images Web Crawler
This package is a complete tool for creating a large dataset of images (specially designed -but not only- for machine learning enthusiasts). It can crawl the web, download images, rename / resize / covert the images and merge folders..
Stars: ✭ 51 (-1.92%)
Mutual labels:  dataset, image-classification
Intrinsic Image Popularity
The pytorch code of the paper "Intrinsic Image Popularity Assessment"
Stars: ✭ 179 (+244.23%)
Mutual labels:  dataset, social-media
Trashnet
Dataset of images of trash; Torch-based CNN for garbage image classification
Stars: ✭ 368 (+607.69%)
Mutual labels:  dataset, image-classification
Cvat
Powerful and efficient Computer Vision Annotation Tool (CVAT)
Stars: ✭ 6,557 (+12509.62%)
Mutual labels:  dataset, image-classification
Label Studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+13869.23%)
Mutual labels:  dataset, image-classification
Deep learning projects
Stars: ✭ 28 (-46.15%)
Mutual labels:  dataset, image-classification
Mtnt
Code for the collection and analysis of the MTNT dataset
Stars: ✭ 48 (-7.69%)
Mutual labels:  dataset
Grafana Zabbix Dashboards
Grafana dashboards for Zabbix
Stars: ✭ 50 (-3.85%)
Mutual labels:  grafana
Rules grafana
Bazel rules for building Grafana dashboards
Stars: ✭ 46 (-11.54%)
Mutual labels:  grafana
Go Web Backend
Dockerized backend services for web application
Stars: ✭ 50 (-3.85%)
Mutual labels:  grafana
Grafana Prometheus Node Js Example
Step-by-step tutorial on creating beautiful dashboards for your Node JS application
Stars: ✭ 47 (-9.62%)
Mutual labels:  grafana
Bss distillation
Knowledge Distillation with Adversarial Samples Supporting Decision Boundary (AAAI 2019)
Stars: ✭ 51 (-1.92%)
Mutual labels:  image-classification
Social Media Depression Detector
πŸ˜” 😞 😣 πŸ˜– 😩 Detect depression on social media using the ssToT method introduced in our ASONAM 2017 paper titled "Semi-Supervised Approach to Monitoring Clinical Depressive Symptoms in Social Media"
Stars: ✭ 45 (-13.46%)
Mutual labels:  social-media
Watermarkreco
Pytorch implementation of the paper "Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach"
Stars: ✭ 45 (-13.46%)
Mutual labels:  dataset
Personal Influxdb
Import data from various APIs into InfluxDB
Stars: ✭ 51 (-1.92%)
Mutual labels:  grafana
Distil
πŸ’§ In memory dataset filtering, inspired by snikch/aggro
Stars: ✭ 49 (-5.77%)
Mutual labels:  dataset
Chinesetrafficpolicepose
Detects Chinese traffic police commanding poses ζ£€ζ΅‹δΈ­ε›½δΊ€θ­¦ζŒ‡ζŒ₯ζ‰‹εŠΏ
Stars: ✭ 49 (-5.77%)
Mutual labels:  dataset

Problem Statement

  • Cyber bullying involves posting, sharing wrong, private, negative, harmful information about victim. In today's digital world we see many such instances where a particular person is targeted. We are looking for the software solution to curb such bullying/harassment in cyber space. Such solution is expected to
    1. Work on social media such as twitter, facebook,etc.;.
    2. Facility to flag and report such incidents to authority.

Getting Started

The Software solution that we propose is Cyber Offense Detecting and Reporting (CODAR) Framework, A system that semi-automates the Internet Moderation Process.

What did we use?

Key Features ⭐️

  • [X] Finds the NSFW composition of a given YouTube video
  • [X] Perform Text Toxicity Prediction on public Facebook Posts/Comments using BeautifulSoup and Facebook API.
  • [X] Structures and Perform Text Toxicity Prediction on WhatsApp Chat Export Documents.
  • [X] Visualise Realtime Toxicity Scored on Tweets using Grafana.
  • [X] Chrome Extension to automatically block offensive content
  • [X] Reporting Portal for the public to report content.
  • [X] A custom Social Media Platform to test the capablities of this system.

Obscene Image Classification πŸ“·

⭐️⭐️⭐️ We have made our NSFW Image Classification Dataset accessible through Kaggle Link-Sharing and we have used the same. Our classification model for Content Moderation in Social Media Platforms are trained over 330,000 images on a pretrained RESNET50 in five β€œloosely defined” categories:

  1. pornography - Nudes and pornography images
  2. hentai - Hentai images, but also includes pornographic drawings
  3. sexually_provocative - Sexually explicit images, but not pornography. Think semi-nude photos, playboy, bikini, beach volleyball, etc. Considered acceptable by most public social media platforms.
  4. neutral - Safe for work neutral images of everyday things and people.
  5. drawing - Safe for work drawings (including anime, safe-manga)

Text Toxicity Prediction πŸ’¬

Our text classification BERT model is trained on the Jigsaw Toxic Comment Classification Dataset to predict the toxicity of texts to pre-emptively prevent any occurrence of cyberbullying and harassment before they tend to occur. We're chose BERT as to overcome challenges including understanding the context of text so as to detect sarcasm and cultural references, as it uses Stacked Transformer Encoders and Attention Mechanism to understand the relationship between words and sentences, the context from a given sentence.

Text_Input: I want to drug and rape her 
======================
Toxic: 0.987 
Severe_Toxic: 0.053 
Obscene: 0.100 
Threat 0.745 
Insult: 0.124 
Identity_Hate: 0.019 
======================
Result: Extremely Toxic as classified as Threat, Toxic 
Action: Text has been blocked. 


Screenshots (Click images for Full Resolution 🎯)

Confusion Matrix (91% Overall Accuracy) Finding the NSFW composition of a YouTube video (πŸ“·)
f f
Tested on more than 23,000 unseen images. Downloads the video, Find the NSFW composition using it's Keyframes
Realtime Tweet Toxicity prediction (πŸ’¬) Testing the models by integerating with own Social Media Platform (πŸ“·+πŸ’¬)
f f
We love Grafana Automatically hides NSFW content also shows a disclaimer
Reporting Portal for the public to report content (πŸ“·+πŸ’¬) Chrome Extension to automatically block offensive content (πŸ“·+πŸ’¬)
f f
The reporting portal with a dashboard to semi-automate the moderation process

Prerequisites

  • Expand for running CODAR on Raspberry Pi or other SBCs
  • Python Compiler (3.7 Recommended)
    • sudo apt update
      sudo apt install -y software-properties-common
      sudo apt install -y python3 python3-pip
      
  • Necessary Python3 Libraries for CODAR can be installed by running the following command:
    • sudo apt install -y python3-opencv
      pip install -r Social_Media_Platform/requirements.txt
      pip install -r Content_Moderation/requirements.txt
      pip install -r Reporting_Platform/requirements.txt
      
    • For installation of PyTorch, refer their official website.
  • A MongoDB Server, Grafana Sever, MySQL Server, and access to Twitter API
  • If you have Docker, you can use the below commands to quickly start a clean MySQL, Grafana and MongoDB Server
    • docker run -d -t -p 27017:27017 --name mongodb mongo
      docker run --name grafana -d -p 3000:3000 grafana/grafana
      # Runs MySQL server with port 3306 exposed and root password '0000' 
      docker run --name mysql -e MYSQL_ROOT_PASSWORD="0000" -p 3306:3306 -d mysql
      
  • Add credentails for your MySQL, Twitter API and MongoDB into the Flask Apps. Also, Import our Dashboard JSON into your Grafana Server and configure your data sources accordingly.

Contributors

Krishnakanth Alagiri Mahalakshumi V Vignesh S Nivetha MK
f f f f
@bearlike @mahavisvanathan @Vignesh0404 @nivethaakm99

Acknowledgments

License

MIT Β© Axenhammer



Made with ❀️ by Axemhammer

wave

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].