All Projects → Josue87 → roboxtractor

Josue87 / roboxtractor

Licence: GPL-3.0 license
Extract endpoints marked as disallow in robots files to generate wordlists.

Programming Languages

go
31211 projects - #10 most used programming language

Projects that are alternatives of or similar to roboxtractor

Dirsearch
Web path scanner
Stars: ✭ 7,246 (+18015%)
Mutual labels:  wordlist, enumeration, bug-bounty, fuzzing, bugbounty
SourceWolf
Amazingly fast response crawler to find juicy stuff in the source code! 😎🔥
Stars: ✭ 132 (+230%)
Mutual labels:  wordlist, fuzzing, bugbounty
vaf
Vaf is a cross-platform very advanced and fast web fuzzer written in nim
Stars: ✭ 294 (+635%)
Mutual labels:  bug-bounty, fuzzing, bugbounty
Onelistforall
Rockyou for web fuzzing
Stars: ✭ 213 (+432.5%)
Mutual labels:  wordlist, fuzzing, bugbounty
ronin-support
A support library for Ronin. Like activesupport, but for hacking!
Stars: ✭ 23 (-42.5%)
Mutual labels:  wordlist, fuzzing
tugarecon
Pentest: Subdomains enumeration tool for penetration testers.
Stars: ✭ 142 (+255%)
Mutual labels:  bug-bounty, bugbounty
brutas
Wordlists and passwords handcrafted with ♥
Stars: ✭ 32 (-20%)
Mutual labels:  wordlist, enumeration
Raccoon
A high performance offensive security tool for reconnaissance and vulnerability scanning
Stars: ✭ 2,312 (+5680%)
Mutual labels:  enumeration, fuzzing
UltimateCMSWordlists
📚 An ultimate collection wordlists of the best-known CMS
Stars: ✭ 54 (+35%)
Mutual labels:  wordlist, fuzzing
Dirsearch
A Go implementation of dirsearch.
Stars: ✭ 164 (+310%)
Mutual labels:  wordlist, enumeration
ras-fuzzer
RAS(RAndom Subdomain) Fuzzer
Stars: ✭ 42 (+5%)
Mutual labels:  fuzzing, bugbounty
fuzzmost
all manner of wordlists
Stars: ✭ 23 (-42.5%)
Mutual labels:  wordlist, bugbounty
wordlists
Aggregated wordlist pulled from commonly used tools for discovery, enumeration, fuzzing, and exploitation.
Stars: ✭ 94 (+135%)
Mutual labels:  enumeration, fuzzing
Fdsploit
File Inclusion & Directory Traversal fuzzing, enumeration & exploitation tool.
Stars: ✭ 199 (+397.5%)
Mutual labels:  enumeration, fuzzing
Crithit
Takes a single wordlist item and tests it one by one over a large collection of websites before moving onto the next. Create signatures to cross-check vulnerabilities over multiple hosts.
Stars: ✭ 182 (+355%)
Mutual labels:  enumeration, bugbounty
fleex
Fleex makes it easy to create multiple VPS on cloud providers and use them to distribute workloads.
Stars: ✭ 181 (+352.5%)
Mutual labels:  bug-bounty, bugbounty
Rescope
Rescope is a tool geared towards pentesters and bugbounty researchers, that aims to make life easier when defining scopes for Burp Suite and OWASP ZAP.
Stars: ✭ 156 (+290%)
Mutual labels:  enumeration, bugbounty
Asnlookup
Leverage ASN to look up IP addresses (IPv4 & IPv6) owned by a specific organization for reconnaissance purposes, then run port scanning on it.
Stars: ✭ 163 (+307.5%)
Mutual labels:  enumeration, bugbounty
wordlist generator
Unique wordlist generator of unique wordlists.
Stars: ✭ 41 (+2.5%)
Mutual labels:  wordlist, bugbounty
credcheck
Credentials Checking Framework
Stars: ✭ 50 (+25%)
Mutual labels:  bug-bounty, bugbounty

roboXtractor

This tool has been developed to extract endpoints marked as disallow in robots.txt file. It crawls the file directly on the web and has a wayback machine query mode (1 query for each of the previous 5 years).

Possible uses of roboXtractor:

  • Generate a customized wordlist of endpoints for later use in a fuzzing tool (-m 1).
  • Generate a list of URLs to visit (-m 0).


🛠️ Installation

If you want to make modifications locally and compile it, follow the instructions below:

> git clone https://github.com/Josue87/roboxtractor.git
> cd roboxtractor
> go build

If you are only interested in using the program:

> go get -u github.com/Josue87/roboxtractor

Note If you are using version 1.16 or higher and you have any errors, run the following command:

> go env -w GO111MODULE="auto"

🗒 Options

The flags that can be used to launch the tool:

Flag Type Description Example
u string URL to extract endpoints marked as disallow in robots.txt file. -u https://example.com
m uint Extract URLs (0) // Extract endpoints to generate a wordlist (>1 default) -m 1
wb bool Check Wayback Machine. Check 5 years (Slow mode) -wb
v bool Verbose mode. Displays additional information at each step -v
s bool Silen mode doesn't show banner -s

You can ignore the -u flag and pass a file directly as follows:

cat urls.txt | roboxtractor -m 1 -v

Only the results are written to the standard output. The banner and information messages with the -v flag are redirected to the error output,

👾 Usage

The following are some examples of use:

roboxtractor --help
cat urls.txt | roboxtractor -m 0 -v
roboxtractor -u https://www.example.com -m 1 -wb
cat urls.txt | roboxtractor -m 1 -s > ./customwordlist.txt
cat urls.txt | roboxtractor -s -v | uniq > ./uniquewordlist.txt
echo http://example.com | roboxtractor -v
echo http://example.com | roboxtractor -v -wb

🚀 Examples

Let's take a look at some examples. We have the following file:

image

Extracting endpoints:

image

Extracting URLs:

image

Checking Wayback Machine:

image

Github had many entries in the file, which were not useful, a cleaning process is done to avoid duplicates or entries with *. Check the following image:

image

For example:

  • /gist/*/*/* is transformed as gist.
  • /*/tarball is trasformed as tarball.
  • /, /* or similar entries are removed.

🤗 Thanks to

The idea comes from a tweet written by @remonsec that did something similar in a bash script. Check the tweet.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].