All Projects → custom-crawler → Similar Projects or Alternatives

93 Open source projects that are alternatives of or similar to custom-crawler

crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-33.33%)
Mutual labels:  crawling, crawling-framework
scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
Stars: ✭ 42 (+27.27%)
Mutual labels:  crawling
Holiday Cn
📅🇨🇳 中国法定节假日数据 自动每日抓取国务院公告
Stars: ✭ 157 (+375.76%)
Mutual labels:  crawling
Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+5136.36%)
Mutual labels:  crawling
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+500%)
Mutual labels:  crawling
mal-analysis
github repo for MyAnimeList analysis. Also links to the MAL dataset.
Stars: ✭ 31 (-6.06%)
Mutual labels:  crawling
Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+34884.85%)
Mutual labels:  crawling
TeeChart-for-.NET-CSharp-WPF-samples
Assorted WPF examples
Stars: ✭ 18 (-45.45%)
Mutual labels:  wpf-application
RobotArmHelix
3D Simulation, forward and inverse kinematics of a robotic arm in C# using WPF and helix-toolkit
Stars: ✭ 84 (+154.55%)
Mutual labels:  wpf-application
Dig Etl Engine
Download DIG to run on your laptop or server.
Stars: ✭ 81 (+145.45%)
Mutual labels:  crawling
Pdf downloader
A Scrapy Spider for downloading PDF files from a webpage.
Stars: ✭ 18 (-45.45%)
Mutual labels:  crawling
Cdp4j
cdp4j - Chrome DevTools Protocol for Java
Stars: ✭ 232 (+603.03%)
Mutual labels:  crawling
socials
👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (+12.12%)
Mutual labels:  crawling
N2h4
네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (+436.36%)
Mutual labels:  crawling
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+418.18%)
Mutual labels:  crawling
Massivedl
Download a large list of files concurrently
Stars: ✭ 141 (+327.27%)
Mutual labels:  crawling
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+272.73%)
Mutual labels:  crawling
Corpuscrawler
Crawler for linguistic corpora
Stars: ✭ 127 (+284.85%)
Mutual labels:  crawling
Larkator
ARK dino locator that uses your saved .ark
Stars: ✭ 42 (+27.27%)
Mutual labels:  wpf-application
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (+203.03%)
Mutual labels:  crawling
crawlzone
Crawlzone is a fast asynchronous internet crawling framework for PHP.
Stars: ✭ 70 (+112.12%)
Mutual labels:  crawling-framework
Python Crawling Tutorial
Python crawling tutorial
Stars: ✭ 57 (+72.73%)
Mutual labels:  crawling
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+60.61%)
Mutual labels:  crawling
MoalemYar
A personal project for class management, using various technologies like WPF, Entityframwork, CodeFirst, Sqlite, Migration and more
Stars: ✭ 53 (+60.61%)
Mutual labels:  wpf-application
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+1830.3%)
Mutual labels:  crawling
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+15442.42%)
Mutual labels:  crawling
Memorious
Distributed crawling framework for documents and structured data.
Stars: ✭ 248 (+651.52%)
Mutual labels:  crawling
xXx dead xXx
b̶̡̪̬͒l̸̰̗̝̀ỏ̷̡̩g̴͇̑g̶̲̱̽͐i̵̹͗n̶̤̥͂̅̆g̴̮̾̅͜ ̷̧͎͆i̷̛͒͜͠n̸̥̺͒ ̶͚͚͊̿͜t̸̺͙̭̆̊̈́ḧ̶̟́̐e̸̱͔̟̓̓͝ ̶̨͔̾͛̑d̵̥̣̏ȧ̷̼̊r̷̰̝̥̅̌͝k̵̟̥̞̉̍͛
Stars: ✭ 19 (-42.42%)
Mutual labels:  crawling
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+46975.76%)
Mutual labels:  crawling
telegram-crawler
🕷 Automatically detect changes made to the official Telegram sites, clients and servers.
Stars: ✭ 84 (+154.55%)
Mutual labels:  crawling
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+6800%)
Mutual labels:  crawling
tech-seo-crawler
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.
Stars: ✭ 57 (+72.73%)
Mutual labels:  crawling
pumba
Fetch, store and access user agent strings for different browsers
Stars: ✭ 12 (-63.64%)
Mutual labels:  crawling
Scrapy Selenium
Scrapy middleware to handle javascript pages using selenium
Stars: ✭ 550 (+1566.67%)
Mutual labels:  crawling
Crawler
Go process used to crawl websites
Stars: ✭ 147 (+345.45%)
Mutual labels:  crawling
TimeRecorder
工数管理アプリ
Stars: ✭ 51 (+54.55%)
Mutual labels:  wpf-application
Instagram Bot
An Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (+318.18%)
Mutual labels:  crawling
Simple.Wpf.DataGrid
An experiment to build a data grid (blotter) in WPF without using any third party libaries
Stars: ✭ 64 (+93.94%)
Mutual labels:  wpf-application
Bhban rpa
6개월 치 업무를 하루 만에 끝내는 업무 자동화(생능출판사, 2020)의 예제 코드입니다. 파이썬을 한 번도 배워본 적 없는 분들을 위한 예제이며, 엑셀부터 디자인, 매크로, 크롤링까지 업무 자동화와 관련된 다양한 분야 예제가 제공됩니다.
Stars: ✭ 124 (+275.76%)
Mutual labels:  crawling
core
The complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (+3263.64%)
Mutual labels:  crawling
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (+278.79%)
Mutual labels:  crawling
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+57.58%)
Mutual labels:  crawling
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+128212.12%)
Mutual labels:  crawling
podcastcrawler
PHP library to find podcasts
Stars: ✭ 40 (+21.21%)
Mutual labels:  crawling
Grawler
Grawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (+196.97%)
Mutual labels:  crawling
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-48.48%)
Mutual labels:  crawling
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (+106.06%)
Mutual labels:  crawling
puppet-master
Puppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-24.24%)
Mutual labels:  crawling
Crawling Projects
Web scraping and automation using python
Stars: ✭ 49 (+48.48%)
Mutual labels:  crawling
proxycrawl-python
ProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+54.55%)
Mutual labels:  crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+2290.91%)
Mutual labels:  crawling
pdf-crawler
SimFin's open source PDF crawler
Stars: ✭ 100 (+203.03%)
Mutual labels:  crawling
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+1666.67%)
Mutual labels:  crawling
auctus
Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index
Stars: ✭ 34 (+3.03%)
Mutual labels:  crawling
BaiduSpider
项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-12.12%)
Mutual labels:  crawling
go-scrapy
Web crawling and scraping framework for Golang
Stars: ✭ 17 (-48.48%)
Mutual labels:  crawling
CaliburnMicro-Calculator
A simple Calculator using Caliburn.Micro (WPF with MVVM)
Stars: ✭ 19 (-42.42%)
Mutual labels:  wpf-application
zcrawl
An open source web crawling platform
Stars: ✭ 21 (-36.36%)
Mutual labels:  crawling
the-seinfeld-chronicles
A dataset for textual analysis on arguably the best written comedy television show ever.
Stars: ✭ 14 (-57.58%)
Mutual labels:  crawling
ioSender
A GCode Sender for Grbl and grblHAL written in C# (Windows only).
Stars: ✭ 142 (+330.3%)
Mutual labels:  wpf-application
1-60 of 93 similar projects