All Projects → tharidu → breakingcaptcha

tharidu / breakingcaptcha

Licence: other
Breaking captchas!!

Programming Languages

java
68154 projects - #9 most used programming language
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to breakingcaptcha

captcha-recognition
End-to-end captcha image recognition using PyTorch and CTC loss binding.
Stars: ✭ 29 (-23.68%)
Mutual labels:  captcha
eros-plugin-ios-TencentCaptcha
腾讯防水墙、滑动验证、类似bilibili滑动验证码
Stars: ✭ 21 (-44.74%)
Mutual labels:  captcha
hcaptcha-bundle
A Symfony 4+ bundle to bring hCaptcha into your forms
Stars: ✭ 15 (-60.53%)
Mutual labels:  captcha
captcha-generator
An NPM package to generate captcha images that can be used in Discord bots or various other projects
Stars: ✭ 45 (+18.42%)
Mutual labels:  captcha
Captcha-Tools
All-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-39.47%)
Mutual labels:  captcha
captcha-breaking-library
Neural network, contour analysis, bitmap vector subtraction CAPTCHA solving library and scripting language with perceptive color space segmentation
Stars: ✭ 76 (+100%)
Mutual labels:  captcha
Captcha
.net core Captcha Service
Stars: ✭ 38 (+0%)
Mutual labels:  captcha
adonis-recaptcha2
Google reCAPTCHA for AdonisJS
Stars: ✭ 24 (-36.84%)
Mutual labels:  captcha
imagetyperz-api-python3
imagetyperz-api-python3 - is a super easy to use bypass captcha API wrapper for imagetyperz.com captcha service
Stars: ✭ 25 (-34.21%)
Mutual labels:  captcha
hcaptcha-solver-python-selenium
hCaptcha solver and bypasser for Python Selenium. Simple website to try to solve hCaptcha.
Stars: ✭ 32 (-15.79%)
Mutual labels:  captcha
captcha-ios
iOS Captcha Solver
Stars: ✭ 33 (-13.16%)
Mutual labels:  captcha
django-rest-captcha
No description or website provided.
Stars: ✭ 25 (-34.21%)
Mutual labels:  captcha
Z-Spider
一些爬虫开发的技巧和案例
Stars: ✭ 33 (-13.16%)
Mutual labels:  captcha
shellfirm
Intercept any risky patterns (default or defined by you) and prompt you a small challenge for double verification
Stars: ✭ 159 (+318.42%)
Mutual labels:  captcha
phone-captcha
📱 Block robocalls with captcha for phone calls
Stars: ✭ 32 (-15.79%)
Mutual labels:  captcha
CAPTCHA Reader
🐝 PHP 验证码识别与训练 脚手架
Stars: ✭ 142 (+273.68%)
Mutual labels:  captcha
2captcha-go
Golang Module for easy integration with the API of 2captcha captcha solving service to bypass recaptcha, hcaptcha, funcaptcha, geetest and solve any other captchas.
Stars: ✭ 31 (-18.42%)
Mutual labels:  captcha
Slider Captcha Crack
🌈Slider_Captcha_Crack某教育网站滑动验证码破解(识别率100%)
Stars: ✭ 49 (+28.95%)
Mutual labels:  captcha
League-of-Legends-Accounts-Creator
Accounts Creator for League of Legends.
Stars: ✭ 46 (+21.05%)
Mutual labels:  captcha
go-captcha
Go Captcha is a behavioral captcha, which implements the generation of random verification text and the verification of click position information.
Stars: ✭ 86 (+126.32%)
Mutual labels:  captcha

Breaking CAPTCHA repository

This repository contains the report and code for our project in Scalable Machine Learning and Deep Learning (ID2223) at KTH.

The aim of the project is to break CAPTCHAs using deep learning technologies. Initially we focus on simple CAPTCHAs to evaluate the performance and move into more complex CAPTCHAs. The training dataset is generated from an open source CAPTCHA generation software. Tensorflow is used to create, train and test the network.

Generating the dataset

The datasets is generated using a Java based CAPTCHA generator. The easiest way is to import the Java project into IntelliJ, compile, and execute breakingCAPTCHA/data_gen/src/kth/id2223/Main.java. We have generated the following datasets.

Description Size Training samples Test samples
Digits only 38 MB 9502 100
Digits and characters 197 MB 49796 100
Digits and characters with rotation 39 MB 10000 100
Digits and characters with rotation 198 MB 49782 500
Digits and characters with rotation 777 MB 196926 500

Each dataset contains jpeg images containing a CAPTCHA with five characters. The characters are lowercase letters (a-z) or numbers (0-9). We used the fonts "Arial" and "Courier" with and without rotation. An example of the created CAPTCHAs is displayed below. Our intention was to mimic the CAPTCHAs created by Microsoft.

CAPTCHA1 Simple digit-only CAPTCHA
CAPTCHA2 Characters and digits without rotation
CAPTCHA3 Characters and digits with rotation

Executing the CNN

We have developed the code using Tensorflow 8.0 with Python 2.7.6 and Nvidia GPUs with CUDA 8.0. The CNN can be executed from the root folder of the project with cd simple_CNN and PYTHONPATH=".." python CAPTCHA_cnn.py.

After a successful training, the learned model is saved as model.ckpt. Then you can use PYTHONPATH=".." python restore_CAPTCHA_cnn.py filename.jpg to predict an image file using the learned model.

Results

First unsuccessful tries

DigitsOnly660M
CNN with three conv. layers and two fully connected layers accuracy of CAPTCHAs with five digits or lowercase letters without rotation. Training in 100 batches and 10000 training samples.

Digit-only CAPTCHAs

DigitsOnly
CNN with three conv. layers and two fully connected layers accuracy of CAPTCHAs with five digits without rotation. Training in 157 batches, 39250 training samples, and testing with 100 CAPTCHAs.

Digit and letter CAPTCHAs

DigitsChar
CNN with three conv. layers and two fully connected layers accuracy of CAPTCHAs with five digits or lowercase letters without rotation. Training in 199 batches, 49750 training samples, and testing with 500 CAPTCHAs.

Digit and letter CAPTCHAs with rotation

DigitsCharRot
CNN with three conv. layers and two fully connected layers accuracy of CAPTCHAs with five digits or lowercase letters with rotation. Training in 787 batches, 196926 training samples, and testing with 500 CAPTCHAs.

Examples of correct and false predictions

Correct
correct1 prediction: 54563
correct2 prediction: grh56
correct3 prediction: fb2x4

False
false1 prediction: 82298
false2 prediction: k76ap
false3 prediction: fffgr

Report

We have compiled a full blog-style report with more details. Selected results can be found in the results folder.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].