All Projects → GregaVrbancic → Phishing-Dataset

GregaVrbancic / Phishing-Dataset

Licence: other
Phishing dataset with more than 88,000 instances and 111 features. Web application available at. https://gregavrbancic.github.io/Phishing-Dataset/

Programming Languages

Svelte
593 projects
javascript
184084 projects - #8 most used programming language
HTML
75241 projects
CSS
56736 projects

Projects that are alternatives of or similar to Phishing-Dataset

Umbrella
A Phishing Dropper designed to Pentest.
Stars: ✭ 180 (+757.14%)
Mutual labels:  phishing
Phishmailer
Generate Professional Phishing Emails Fast And Easy
Stars: ✭ 227 (+980.95%)
Mutual labels:  phishing
termux-wifi
Bash script which installs and runs the Fluxion tool inside Termux, a wireless security auditing tool used to perform attacks such as WPA/WPA2 cracking and MITM attacks.
Stars: ✭ 406 (+1833.33%)
Mutual labels:  phishing
Gitem
A Github organization reconnaissance tool.
Stars: ✭ 190 (+804.76%)
Mutual labels:  phishing
Tigershark
Bilingual PhishingKit. TigerShark intergrates a vast array of various phishing tools and frameworks, from C2 servers, backdoors and delivery methods in multiple scripting languages in order to suit whatever your deployment needs may be.
Stars: ✭ 212 (+909.52%)
Mutual labels:  phishing
I See You
ISeeYou is a Bash and Javascript tool to find the exact location of the users during social engineering or phishing engagements. Using exact location coordinates an attacker can perform preliminary reconnaissance which will help them in performing further targeted attacks.
Stars: ✭ 246 (+1071.43%)
Mutual labels:  phishing
Extra Phishing Pages
Community-built scenarios for Wifiphisher
Stars: ✭ 170 (+709.52%)
Mutual labels:  phishing
Blackeye-for-Windows
This is a Phishing tool. Phishing is a type of hacking also called credential harvesting. It creates fake websites for victims to login which saves their login info which includes IP, User-Agent, Username and Password to a file in the computer running Blackeye. This tool has been there for Linux and even Android via Termux. I converted it to Win…
Stars: ✭ 38 (+80.95%)
Mutual labels:  phishing
Pot
Phishing on Twitter
Stars: ✭ 223 (+961.9%)
Mutual labels:  phishing
Abused-Legitimate-Services
Cloud, CDN, and marketing services leveraged by cybercriminals and APT groups
Stars: ✭ 42 (+100%)
Mutual labels:  phishing
Blackeye Python
BlackEye Phishing Kit in Python w Serveo Subdomain Creation | Educational Purposes Only
Stars: ✭ 191 (+809.52%)
Mutual labels:  phishing
Macphish
Office for Mac Macro Payload Generator
Stars: ✭ 202 (+861.9%)
Mutual labels:  phishing
Isthislegit
Dashboard to collect, analyze, and respond to reported phishing emails.
Stars: ✭ 251 (+1095.24%)
Mutual labels:  phishing
Socialfish
Phishing Tool & Information Collector
Stars: ✭ 2,522 (+11909.52%)
Mutual labels:  phishing
Phishious
An open-source Secure Email Gateway (SEG) evaluation toolkit designed for red-teamers.
Stars: ✭ 397 (+1790.48%)
Mutual labels:  phishing
Phishingkithunter
Find phishing kits which use your brand/organization's files and image.
Stars: ✭ 177 (+742.86%)
Mutual labels:  phishing
Thechoice
The linux choice collection tools
Stars: ✭ 245 (+1066.67%)
Mutual labels:  phishing
Social-Fish-v2.0
Phishing Tool
Stars: ✭ 24 (+14.29%)
Mutual labels:  phishing
wifimitm
Wi-Fi Machine-in-the-Middle: Automation of MitM Attack on Wi-Fi Networks
Stars: ✭ 49 (+133.33%)
Mutual labels:  phishing
phishEye
phishEye is an ultimate phishing tool in python. Includes popular websites like Facebook, Twitter, Instagram, LinkedIn, GitHub, Dropbox, and many others. Created with Flask, custom templates, and tunneled with ngrok and localhost.run.
Stars: ✭ 47 (+123.81%)
Mutual labels:  phishing

Datasets for Phishing Websites Detection

In this repository the two variants of the phishing dataset are presented.

Web application

To preview the dataset interactively and/or tailor it to your needs, please visit a dedicated web application.

dataset_full.csv

Short description of the full variant dataset:

  • Total number of instances: 88,647
    • Number of legitimate website instances (labeled as 0): 58,000
    • Number of phishing website instances (labeled as 1): 30,647
  • Total number of features: 111 (without target)

dataset_small.csv

Short description of the small variant dataset:

  • Total number of instances: 58,645
    • Number of legitimate website instances (labeled as 0): 27,998
    • Number of phishing website instances (labeled as 1): 30,647
  • Total number of features: 111 (without target)

Extracted Features

Feature Description
qty_dot_url count (.) in URL
qty_hyphen_url count (-) in URL
qty_underline_url count (_) in URL
qty_slash_url count (/) in URL
qty_questionmark_url count (?) in URL
qty_equal_url count (=) in URL
qty_at_url count (@) in URL
qty_and_url count (&) in URL
qty_exclamation_url count (!) in URL
qty_space_url count ( ) in URL
qty_tilde_url count (~) in URL
qty_comma_url count (,) in URL
qty_plus_url count (+) in URL
qty_asterisk_url count (*) in URL
qty_hashtag_url count (#) in URL
qty_dollar_url count ($) in URL
qty_percent_url count (%) in URL
qty_tld_url top-level-domain length
length_url URL length
qty_dot_domain count (.) in domain
qty_hyphen_domain count (-) in domain
qty_underline_domain count (_) in domain
qty_slash_domain count (/) in domain
qty_questionmark_domain count (?) in domain
qty_equal_domain count (=) in domain
qty_at_domain count (@) in domain
qty_and_domain count (&) in domain
qty_exclamation_domain count (!) in domain
qty_space_domain count ( ) in domain
qty_tilde_domain count (~) in domain
qty_comma_domain count (,) in domain
qty_plus_domain count (+) in domain
qty_asterisk_domain count (*) in domain
qty_hashtag_domain count (#) in domain
qty_dollar_domain count ($) in domain
qty_percent_domain count (%) in domain
qty_vowels_domain count vowels in domain
domain_length domain length
domain_in_ip URL domain in IP address format
server_client_domain domain contains the keywords "server" or "client"
qty_dot_directory count (.) in directory
qty_hyphen_directory count (-) in directory
qty_underline_directory count (_) in directory
qty_slash_directory count (/) in directory
qty_questionmark_directory count (?) in directory
qty_equal_directory count (=) in directory
qty_at_directory count (@) in directory
qty_and_directory count (&) in directory
qty_exclamation_directory count (!) in directory
qty_space_directory count ( ) in directory
qty_tilde_directory count (~) in directory
qty_comma_directory count (,) in directory
qty_plus_directory count (+) in directory
qty_asterisk_directory count (*) in directory
qty_hashtag_directory count (#) in directory
qty_dollar_directory count ($) in directory
qty_percent_directory count (%) in directory
directory_length directory length
qty_dot_file count (.) in file
qty_hyphen_file count (-) in file
qty_underline_file count (_) in file
qty_slash_file count (/) in file
qty_questionmark_file count (?) in file
qty_equal_file count (=) in file
qty_at_file count (@) in file
qty_and_file count (&) in file
qty_exclamation_file count (!) in file
qty_space_file count ( ) in file
qty_tilde_file count (~) in file
qty_comma_file count (,) in file
qty_plus_file count (+) in file
qty_asterisk_file count (*) in file
qty_hashtag_file count (#) in file
qty_dollar_file count ($) in file
qty_percent_file count (%) in file
file_length file length
qty_dot_params count (.) in parameters
qty_hyphen_params count (-) in parameters
qty_underline_params count (_) in parameters
qty_slash_params count (/) in parameters
qty_questionmark_params count (?) in parameters
qty_equal_params count (=) in parameters
qty_at_params count (@) in parameters
qty_and_params count (&) in parameters
qty_exclamation_params count (!) in parameters
qty_space_params count ( ) in parameters
qty_tilde_params count (~) in parameters
qty_comma_params count (,) in parameters
qty_plus_params count (+) in parameters
qty_asterisk_params count (*) in parameters
qty_hashtag_params count (#) in parameters
qty_dollar_params count ($) in parameters
qty_percent_params count (%) in parameters
params_length parameters length
tld_present_params TLD presence in arguments
qty_params number of parameters
email_in_url email present in URL
time_response search time (response) domain (lookup)
domain_spf domain has SPF
asn_ip AS Number (or ASN)
time_domain_activation time (in days) of domain activation
time_domain_expiration time (in days) of domain expiration
qty_ip_resolved number of resolved IPs
qty_nameservers number of resolved name servers (NameServers - NS)
qty_mx_servers number of MX Servers
ttl_hostname time-to-live (TTL) value associated with hostname
tls_ssl_certificate valid TLS / SSL Certificate
qty_redirects number of redirects
url_google_index check if URL is indexed on Google
domain_google_index check if domain is indexed on Google
url_shortened check if URL is shortened
phishing is phishing website

Cite this dataset

G. Vrbančič, I. Jr. Fister, V. Podgorelec. Datasets for Phishing Websites Detection. Data in Brief, Vol. 33, 2020, DOI: 10.1016/j.dib.2020.106438

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].