All Projects → lspahija → torchestrator

lspahija / torchestrator

Licence: Apache-2.0 license
Spin up Tor containers and then proxy HTTP requests via these Tor instances

Programming Languages

kotlin
9241 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to torchestrator

Proxybroker
Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS 🎭
Stars: ✭ 2,767 (+8546.88%)
Mutual labels:  proxy-server, proxies, proxypool, proxy-list, proxy-checker
proxi
Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (+0%)
Mutual labels:  scraping, scrapy, proxypool, proxy-list
Proxy
A simple tool for fetching usable proxies from several websites.
Stars: ✭ 124 (+287.5%)
Mutual labels:  proxies, proxypool, proxy-list
Free-Proxy
Hi there will be a lot of proxies here.
Stars: ✭ 135 (+321.88%)
Mutual labels:  proxy-server, proxy-list, proxy-checker
ProxyChecker
proxy checker to check the status of the ip-port proxy list
Stars: ✭ 24 (-25%)
Mutual labels:  proxy-server, proxy-list, proxy-checker
proxy fetcher
💪 Ruby / JRuby / TrufflleRuby gem & CLI for dealing with proxy lists from various sources
Stars: ✭ 119 (+271.88%)
Mutual labels:  proxies, proxy-list, proxy-checker
ProxyGrab
Asynchronous Library made using Python and aiohttp to get proxies from multiple services!
Stars: ✭ 17 (-46.87%)
Mutual labels:  proxies, proxy-list, proxy-scraper
socks5 list
Auto-updated SOCKS5 proxy list + proxies for Telegram
Stars: ✭ 210 (+556.25%)
Mutual labels:  proxy-server, proxies, proxy-list
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+284.38%)
Mutual labels:  scraping, scrapy
ip2proxy-java
IP2Proxy Java Component.It can be used to find the IP addresses which are used as VPN anonymizer, open proxies, web proxies and Tor exits.
Stars: ✭ 16 (-50%)
Mutual labels:  tor, proxies
RARBG-scraper
With Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (+18.75%)
Mutual labels:  scraping, scrapy
ProxyChecker
An easy to use open-source, multithreaded Proxy Checker. Allows you to quickly check HTTP and SOCKS proxies in a user friendly GUI, for Windows, Mac OS, Linux.
Stars: ✭ 113 (+253.13%)
Mutual labels:  proxies, proxy-checker
RandomProxyRuby
Tiny Library for get random proxy (free).
Stars: ✭ 16 (-50%)
Mutual labels:  proxy-server, proxy-list
ipChecker
Check if a IP is from tor or is a malicious proxy
Stars: ✭ 50 (+56.25%)
Mutual labels:  tor, proxies
torpool
Containerized pool of multiple Tor instances with load balancing and HTTP proxy.
Stars: ✭ 42 (+31.25%)
Mutual labels:  tor, privoxy
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-46.87%)
Mutual labels:  scraping, scrapy
gochanges
**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-62.5%)
Mutual labels:  scraping, scraping-websites
LiveProxies
Asynchronous proxy checker
Stars: ✭ 17 (-46.87%)
Mutual labels:  proxy-server, proxy-list
scavenger
Scrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-56.25%)
Mutual labels:  scraping, scraping-websites
scrapman
Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-34.37%)
Mutual labels:  scraping, scraping-websites

Torchestrator

Torchestrator can spin up Tor containers and expose ports for proxying HTTP requests via these Tor instances.

The IP address of the exit node of each Tor instance will vary. This is useful for IP address rotation.

To run:

  • set number of Tor containers you wish to spin up with config property tor.containerQuantity in torchestrator/src/main/resources/application.properties (default is 10, but can be a much higher number depending on RAM)
  • cd into torchestrator directory and execute ./gradlew bootRun (with JDK 15) or run in an IDE like IntelliJ IDEA
  • get next proxy port with GET localhost:8080/port

Examples

Scraping with Python using Torchestrator as a proxy provider:

import requests

torchestrator_host = "localhost"
tochestrator_port = 8080
torchestrator_api_path = "port"


def fetch_proxy_port():
    response = requests.get(f'http://{torchestrator_host}:{tochestrator_port}/{torchestrator_api_path}')
    return response.text


def get_using_proxy(url, proxy):
    return requests.get(url, proxies={"http": proxy, "https": proxy})


def scrape_using_proxy(url):
    proxy_port = fetch_proxy_port()
    response = get_using_proxy(url, f'{torchestrator_host}:{proxy_port}')
    print(response.text)
  
    
scrape_using_proxy("http://icanhazip.com/")

Similarly, a sample Kotlin HTTP client proxying requests via Tor instances can be found in torchestrator/src/main/kotlin/com/alealogic/torchestrator/client/ExampleClient.kt

Find this useful?

Please star this repository! It helps contributors gauge the popularity of the repo and determine how much time to allot to development.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].