LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+21.62%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-54.05%)
scraperA web scraper starter project
Stars: ✭ 18 (-51.35%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+12972.97%)
CheerioFast, flexible, and lean implementation of core jQuery designed specifically for the server.
Stars: ✭ 24,616 (+66429.73%)
evineInteractive CLI Web Crawler
Stars: ✭ 140 (+278.38%)
fanslySimply scrape / download all the media from an fansly account
Stars: ✭ 351 (+848.65%)
roseAnalyse all kinds of data for a TV series
Stars: ✭ 34 (-8.11%)
scrapersscrapers for building your own image databases
Stars: ✭ 46 (+24.32%)
pyitauUnofficial client to access your Itaú bank data
Stars: ✭ 28 (-24.32%)
PyScholarA 'supervised' parser for Google Scholar
Stars: ✭ 74 (+100%)
xforestA super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.
Stars: ✭ 20 (-45.95%)
cat-messageFinds cat images/videos/gifs on reddit, sends them to my mom via applescript
Stars: ✭ 35 (-5.41%)
gHarvesterProof of concept for a security issue (in my opinion) that I found in accounts.google.com
Stars: ✭ 20 (-45.95%)
barclayscrapeA small app to programmatically mainpulate Barclays online banking
Stars: ✭ 57 (+54.05%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+132.43%)
ColegaDondeEstaMiTFMUn bot de Twitter que comparte cada hora un TFM hasta que Cristina Cifuentes enseñe el suyo.
Stars: ✭ 14 (-62.16%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+43.24%)
Federal-Parliament-ScraperA scraper for obtaining information on the workings of the Belgian Federal Parliament.
Stars: ✭ 18 (-51.35%)
buptclassA nodejs-spider that gets the infomation of empty classrooms in BUPT
Stars: ✭ 29 (-21.62%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-67.57%)
trawlerscraper for facebook, gab, google and tiktok
Stars: ✭ 20 (-45.95%)
wordpress-scraperSimple, easy-to-use scraper to scrape data from WordPress JSON API
Stars: ✭ 22 (-40.54%)
scrapeerEssential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
Stars: ✭ 81 (+118.92%)
scriptsA collection of random scripts I coded up
Stars: ✭ 17 (-54.05%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+145.95%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (-32.43%)
scibloxsciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+29.73%)
INMET-API-temperatureCrawler dos dados metereológicos de estações convencionais do INMET (BDMEP)
Stars: ✭ 32 (-13.51%)
GChanScrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Stars: ✭ 31 (-16.22%)
dh-coreFunctional data science
Stars: ✭ 123 (+232.43%)
twpyTwitter High level scraper for humans.
Stars: ✭ 58 (+56.76%)
MetQyRepository for R package MetQy (read related publication here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247936/)
Stars: ✭ 17 (-54.05%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+59.46%)
stweetAdvanced python library to scrap Twitter (tweets, users) from unofficial API
Stars: ✭ 287 (+675.68%)
doffya web auto run lib base on chrome headless
Stars: ✭ 13 (-64.86%)
scrapy-LBCAraignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (-62.16%)
KaliIntelligenceSuiteKali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.
Stars: ✭ 58 (+56.76%)
Medium-Stats-AnalysisExploring data and analyzing metrics for user-specific Medium Stats
Stars: ✭ 27 (-27.03%)
web-crawlerPython Web Crawler with Selenium and PhantomJS
Stars: ✭ 19 (-48.65%)
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (-2.7%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-56.76%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+67.57%)
Website-downloader💡 Download the complete source code of any website (including all assets). [ Javascripts, Stylesheets, Images ] using Node.js
Stars: ✭ 615 (+1562.16%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+62.16%)
tinyPornManagerMade for pornhub. Fork from tinyMediaManager v3
Stars: ✭ 57 (+54.05%)
antA web crawler for Go
Stars: ✭ 264 (+613.51%)
PTTmineRParallel Searching and Crawling Data from PTT 🚀
Stars: ✭ 31 (-16.22%)