All Projects → FlyersWeb → dhtbay

FlyersWeb / dhtbay

Licence: Unlicense license
A DHT crawler and torrent indexer

Programming Languages

javascript
184084 projects - #8 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to dhtbay

torrent-webseed-creator
Webseeded torrent creator using GitHub Actions
Stars: ✭ 54 (-42.55%)
Mutual labels:  torrent
simple-torrent-android
A torrent client library for Android that utilizes frostwire-jlibtorrent. It supports sequential and simultaneous downloads.
Stars: ✭ 85 (-9.57%)
Mutual labels:  torrent
Nepali-News-Classifier
Text Classification of Nepali Language Document. This Mini Project was done for the partial fulfillment of NLP Course : COMP 473.
Stars: ✭ 13 (-86.17%)
Mutual labels:  classifier
aria-switch-control
ARIA Switch control component
Stars: ✭ 38 (-59.57%)
Mutual labels:  aria
UEHToR
UEHToR, #1 Torrent And File Leecher For Heroku, Based on Node.js. Simple to Install with full SEO Support
Stars: ✭ 28 (-70.21%)
Mutual labels:  torrent
pyAudioProcessing
Audio feature extraction and classification
Stars: ✭ 165 (+75.53%)
Mutual labels:  classifier
IMDb-Scout-Mod
Auto search for movie/series on torrent, usenet, ddl, subtitles, streaming, predb and other sites. Adds links to IMDb pages from hundreds various sites. Adds movies/series to Radarr/Sonarr. Adds external ratings from Metacritic, Rotten Tomatoes, Letterboxd, Douban, Allocine. Media Server indicators for Plex, Jellyfin, Emby. Dark theme/style for …
Stars: ✭ 177 (+88.3%)
Mutual labels:  torrent
bencode
PHP Bencode (BitTorrent) Encoder/Decoder
Stars: ✭ 19 (-79.79%)
Mutual labels:  torrent
scrapeer
Essential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
Stars: ✭ 81 (-13.83%)
Mutual labels:  torrent
flixctl
A toolkit for controlling the infrastructure necessary for a true MaSaS (Movies and Shows as a Service) architecture.
Stars: ✭ 43 (-54.26%)
Mutual labels:  torrent
exatorrent
Easy to Use Torrent Client. Can be hosted in Cloud. Files can be streamed in Browser/Media Player.
Stars: ✭ 1,557 (+1556.38%)
Mutual labels:  torrent
ocr-machine-learning
OCR Machine Learning in python
Stars: ✭ 42 (-55.32%)
Mutual labels:  classifier
torrenttools
Commandline tool for inspecting, creating and editing BitTorrent metafiles.
Stars: ✭ 125 (+32.98%)
Mutual labels:  torrent
tailwindcss-aria-attributes
TailwindCSS variants for aria-* attributes
Stars: ✭ 92 (-2.13%)
Mutual labels:  aria
train-classifier-from-scratch
Machine Learning: Collect data online and train a classifier from scratch
Stars: ✭ 59 (-37.23%)
Mutual labels:  classifier
website-fingerprinting
Deanonymizing Tor or VPN users with website fingerprinting and machine learning.
Stars: ✭ 59 (-37.23%)
Mutual labels:  classifier
nyaascraper
An application to scrape and open magnet links for fansub groups from nyaa.si where batches don't already exist.
Stars: ✭ 34 (-63.83%)
Mutual labels:  torrent
Tanglestash
IOTA meets BitTorrent: An algorithm to persist any file onto the tangle of IOTA
Stars: ✭ 46 (-51.06%)
Mutual labels:  torrent
pytorch hand classifier
Simple hand classifier by Pytorch and ResNet
Stars: ✭ 91 (-3.19%)
Mutual labels:  classifier
Face-Recognition-FaceNet
A python script label faces in group photos using Facenet. 🎉
Stars: ✭ 21 (-77.66%)
Mutual labels:  classifier

dht-bay

A DHT crawler, torrent indexer and search portal all in nodeJS

DEPENDENCIES

This project works fine with :

  • node 6.3.1
  • npm 3.10.3
  • redis-server 2.8.17
  • mongod 2.4.10
  • aria2 1.33.1

INSTALL

Good to know

You should open your 6881/udp port to allow the crawler to have access to DHT network.

Use docker

Or you can just use the docker project version and run it using :

docker-compose up -d

It will automatically launch redis, mongo and aria2 then start crawling and categorizing for you. You'll have your DHT Crawler up and running. Crawling may take some time so be patient.

ARCHITECTURE

This diagram presents an overview of the project architecture and how each piece communicate with each other.

DHTBay Architecture

CONTENT

The project is composed of 4 modules as presented. Each module is independant and can be used separately without problem.

  • crawlDHT.js is responsible for crawling hashs from the DHT network. It will push hashes on a redis list called DHTS. It also provides a routing table backup system saving it each 10 minutes in a mongo collection called table.
  • loadDHT.js is responsible of loading hashes from the redis list DHTS and to download torrent metadat for indexation. It rely intensely on aria2 tool and tray to download it from torcache, torrage and through DHT.
  • loadTorrent.js is responsible of saving metadatas into our mongo instance in collection torrents. This will be our basis data.
  • categorize.js will try to categorize crawled torrent depending on file extensions. Because module only takes a limited number of extensions in account you can use classifier too.
  • classifier.js a bayesian classifier that will classify torrent that couldn't be classed by previous one. In order to work you need to train the classifier.
  • trainer.js the bayesian classifier trainer, based on categorize script classification it helps unknown torrent classification.

You could use the bayesian classifier when you've already had a bunch of torrent indexed. The more samples you'll have the more accurate it will be.

Please fork it, and use it everywhere you can.

IMPROVEMENTS

  • Improve categorization to support more extensions. Use a categorized API.

Have fun.

@flyersweb

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].