All Projects → wusuopubupt → Go_spider

wusuopubupt / Go_spider

A golang spider

Programming Languages

go
31211 projects - #10 most used programming language
golang
3204 projects

Projects that are alternatives of or similar to Go spider

Gotools
create some tools use go lang.
Stars: ✭ 54 (+116%)
Mutual labels:  goroutine, spider
Zhihu Crawler
zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+3460%)
Mutual labels:  spider
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+2416%)
Mutual labels:  spider
Crawler
A high performance web crawler in Elixir.
Stars: ✭ 781 (+3024%)
Mutual labels:  spider
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+2524%)
Mutual labels:  spider
Gospider
Gospider - Fast web spider written in Go
Stars: ✭ 785 (+3040%)
Mutual labels:  spider
Grpool
Lightweight Goroutine pool
Stars: ✭ 616 (+2364%)
Mutual labels:  goroutine
Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-24%)
Mutual labels:  spider
Seeker
Seeker - another job board aggregator.
Stars: ✭ 16 (-36%)
Mutual labels:  spider
Creeper
🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+2948%)
Mutual labels:  spider
Bilibili Api
哔哩哔哩的API调用模块
Stars: ✭ 704 (+2716%)
Mutual labels:  spider
Oneblog
👽 OneBlog,一个简洁美观、功能强大并且自适应的Java博客
Stars: ✭ 678 (+2612%)
Mutual labels:  spider
Torbot
Dark Web OSINT Tool
Stars: ✭ 821 (+3184%)
Mutual labels:  spider
Darknet chinesetrading
🚇暗网中文网监控爬虫(DEEPMIX)
Stars: ✭ 649 (+2496%)
Mutual labels:  spider
Javlibrary
Javlibrary spider
Stars: ✭ 17 (-32%)
Mutual labels:  spider
Istock
👉一个基于spring boot 实现的java股票爬虫(仅支持A股),如果你❤️请⭐️ . V2升级版正在开发中!
Stars: ✭ 622 (+2388%)
Mutual labels:  spider
Querido Diario
📰 Brazilian government gazettes, accessible to everyone.
Stars: ✭ 681 (+2624%)
Mutual labels:  spider
Funpyspidersearchengine
Word2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Stars: ✭ 782 (+3028%)
Mutual labels:  spider
Scrapit
Scraping scripts for various websites.
Stars: ✭ 25 (+0%)
Mutual labels:  spider
Baiduyunspider
百度云网盘搜索引擎,包含爬虫 & 网站
Stars: ✭ 903 (+3512%)
Mutual labels:  spider

Build Status

go_spider

A golang spider created by Dash Wang, 2015-11-25

目录说明(cd $GOPATH/src/github.com/wusuopubupt/go_spider && tree):

├── conf(配置文件目录)
│   ├── example.xml
│   ├── logconf.xml
│   └── spider.conf
├── data(数据目录)
│   └── url.data
├── log(日志目录)
│   ├── mini_spider.log
│   └── mini_spider.wf.log
├── output(输出文件目录)
├── readme.txt
└── src(核心代码目录)
    ├── conf
    │   ├── config.go
    │   └── config_test.go
    ├── downloader
    │   ├── downloader.go
    │   └── downloader_test.go
    ├── main
    │   ├── main
    │   └── main.go
    ├── spider
    │   ├── spider.go
    │   └── spider_test.go
    ├── utils
    │   └── utils.go
    └── vendor(第三方包目录)
        ├── github.com
        │   └── alecthomas
        │       └── log4go
        │           ├── LICENSE
        │           ├── README
        │           ├── config.go
        │           ├── filelog.go
        │           ├── log4go.go
        │           ├── pattlog.go
        │           ├── socklog.go
        │           ├── termlog.go
        │           └── wrapper.go
        ├── golang.org
        │   └── x
        │       └── net
        │           ├── LICENSE
        │           ├── PATENTS
        │           └── html
        │               ├── atom
        │               │   ├── atom.go
        │               │   ├── gen.go
        │               │   └── table.go
        │               ├── const.go
        │               ├── doc.go
        │               ├── doctype.go
        │               ├── entity.go
        │               ├── escape.go
        │               ├── foreign.go
        │               ├── node.go
        │               ├── parse.go
        │               ├── render.go
        │               └── token.go
        ├── gopkg.in
        │   ├── gcfg.v1
        │   │   ├── LICENSE
        │   │   ├── README
        │   │   ├── doc.go
        │   │   ├── errors.go
        │   │   ├── go1_0.go
        │   │   ├── go1_2.go
        │   │   ├── read.go
        │   │   ├── scanner
        │   │   │   ├── errors.go
        │   │   │   └── scanner.go
        │   │   ├── set.go
        │   │   ├── token
        │   │   │   ├── position.go
        │   │   │   ├── serialize.go
        │   │   │   └── token.go
        │   │   └── types
        │   │       ├── bool.go
        │   │       ├── doc.go
        │   │       ├── enum.go
        │   │       ├── int.go
        │   │       └── scan.go
        │   └── warnings.v0
        │       ├── LICENSE
        │       ├── README
        │       └── warnings.go
        └── vendor.json
        

运行:

cd $GOPATH/src/github.com/wusuopubupt/go_spider/src/main && go run main.go
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].