zackw / tbbscraper
Licence: other
Automated website scraping over Tor
Stars: ✭ 23
Programming Languages
python
139335 projects - #7 most used programming language
AngelScript
46 projects
C++
36643 projects - #6 most used programming language
c
50402 projects - #5 most used programming language
PLpgSQL
1095 projects
javascript
184084 projects - #8 most used programming language
# Automated website scraping (not actually using TBB) This software collects webpages, using a headless browser (PhantomJS), from many different network locations, via proxy servers. It could in principle use Tor for the proxy but right now it does not. There is also some software for analyzing the contents of the collected webpages. The management cannot guarantee that this is of any use to anyone or indeed that it works at all outside the context where it is used. License labeling is pretty spotty, but the intent is to use the Apache license for everything ( http://www.apache.org/licenses/LICENSE-2.0 )
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].