The Universal Device Detection library will parse any User Agent and detect the browser, operating system, device used (desktop, tablet, mobile, tv, cars, console, etc.), brand and model.

Stars: ✭ 2,106 (+17450%)

Mutual labels: user-agent

N2h4

네이버 뉴스 수집을 위한 도구

Stars: ✭ 177 (+1375%)

Mutual labels: crawling

useragent-generator

Easily generate correct user-agent strings for popular browsers

Stars: ✭ 62 (+416.67%)

Mutual labels: user-agent

Holiday Cn

📅🇨🇳 中国法定节假日数据自动每日抓取国务院公告

Stars: ✭ 157 (+1208.33%)

Mutual labels: crawling

Devicedetector.net

The Universal Device Detection library will parse any User Agent and detect the browser, operating system, device used (desktop, tablet, mobile, tv, cars, console, etc.), brand and model.

Stars: ✭ 144 (+1100%)

Mutual labels: user-agent

Massivedl

Download a large list of files concurrently

Stars: ✭ 141 (+1075%)

Mutual labels: crawling

dxram

A distributed in-memory key-value storage for billions of small objects.

Stars: ✭ 25 (+108.33%)

Mutual labels: in-memory-storage

Newspaper

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Stars: ✭ 11,545 (+96108.33%)

Mutual labels: crawling

Parser Php

Browser sniffing gone too far — A useragent parser library for PHP

Stars: ✭ 1,626 (+13450%)

Mutual labels: user-agent

Corpuscrawler

Crawler for linguistic corpora

Stars: ✭ 127 (+958.33%)

Mutual labels: crawling

tech-seo-crawler

Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.

Stars: ✭ 57 (+375%)

Mutual labels: crawling

Awesome Puppeteer

A curated list of awesome puppeteer resources.

Stars: ✭ 1,728 (+14300%)

Mutual labels: crawling

Http request randomizer

Proxying Python Requests

Stars: ✭ 110 (+816.67%)

Mutual labels: user-agent

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (+733.33%)

Mutual labels: crawling

proxycrawl-python

ProxyCrawl Python library for scraping and crawling

Stars: ✭ 51 (+325%)

Mutual labels: crawling

Dig Etl Engine

Download DIG to run on your laptop or server.

Stars: ✭ 81 (+575%)

Mutual labels: crawling

Useragent.js

A User-agent analyze project.

Stars: ✭ 70 (+483.33%)

Mutual labels: user-agent

Python Crawling Tutorial

Python crawling tutorial

Stars: ✭ 57 (+375%)

Mutual labels: crawling

jsgraph

Deprecated: Use the @encapsule/arccore package that includes the graph library

Stars: ✭ 42 (+250%)

Mutual labels: in-memory-storage

Pdf downloader

A Scrapy Spider for downloading PDF files from a webpage.

Stars: ✭ 18 (+50%)

Mutual labels: crawling

Parser Javascript

Browser sniffing gone too far — A useragent parser library for JavaScript

Stars: ✭ 66 (+450%)

Mutual labels: user-agent

Scrapyrt

HTTP API for Scrapy spiders

Stars: ✭ 637 (+5208.33%)

Mutual labels: crawling

Vytal

Browser extension to spoof timezone, geolocation, locale and user agent.

Stars: ✭ 1,449 (+11975%)

Mutual labels: user-agent

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+42641.67%)

Mutual labels: crawling

Bugz

🐛 Composable User Agent Detection using Ramda

Stars: ✭ 15 (+25%)

Mutual labels: user-agent

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+40208.33%)

Mutual labels: crawling

core

The complete web scraping toolkit for PHP.

Stars: ✭ 1,110 (+9150%)

Mutual labels: crawling

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+3566.67%)

Mutual labels: crawling

Kolpa

A fake data generator written in and for Go

Stars: ✭ 645 (+5275%)

Mutual labels: user-agent

Webster

a reliable high-level web crawling & scraping framework for Node.js.

Stars: ✭ 364 (+2933.33%)

Mutual labels: crawling

telegram-crawler

🕷 Automatically detect changes made to the official Telegram sites, clients and servers.

Stars: ✭ 84 (+600%)

Mutual labels: crawling

Sasila

一个灵活、友好的爬虫框架

Stars: ✭ 286 (+2283.33%)

Mutual labels: crawling

User agent

HTTP User Agent parser for the Go programming language.

Stars: ✭ 578 (+4716.67%)

Mutual labels: user-agent

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+2208.33%)

Mutual labels: crawling

podcastcrawler

PHP library to find podcasts

Stars: ✭ 40 (+233.33%)

Mutual labels: crawling

Spidy

The simple, easy to use command line web crawler.

Stars: ✭ 257 (+2041.67%)

Mutual labels: crawling

User Agents

A JavaScript library for generating random user agents with data that's updated daily.

Stars: ✭ 485 (+3941.67%)

Mutual labels: user-agent

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (+466.67%)

Mutual labels: crawling

crawlerdetect

Golang module to detect bots and crawlers via the user agent

Stars: ✭ 22 (+83.33%)

Mutual labels: user-agent

Curl

A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET and TFTP. libcurl offers a myriad of powerful features

Stars: ✭ 22,875 (+190525%)

Mutual labels: user-agent

haro

Haro is a modern immutable DataStore

Stars: ✭ 24 (+100%)

Mutual labels: in-memory-storage

react-ua

📱React User Agent Component, Hook, and HOC. SSR-ready, full UT, using new React Context and Hooks API

Stars: ✭ 18 (+50%)

Mutual labels: user-agent

user-agent

User-Agent parser for Clojure