All Projects → Albert-W → crawler-project

Albert-W / crawler-project

Licence: other
Google资深工程师深度讲解Go语言 爬虫项目。

Programming Languages

go
31211 projects - #10 most used programming language
CSS
56736 projects
HTML
75241 projects
Makefile
30231 projects
Dockerfile
14818 projects
assembly
5116 projects

Projects that are alternatives of or similar to crawler-project

Teacup
Teacup is templates in CoffeeScript
Stars: ✭ 117 (+116.67%)
Mutual labels:  html-template
Ionic Starter Template
Reinventing the wheel, again! Sorry Ionic Team... but there are many newbies learning on Youtube!
Stars: ✭ 208 (+285.19%)
Mutual labels:  html-template
about.me
Single Page Resume Template
Stars: ✭ 20 (-62.96%)
Mutual labels:  html-template
Instapack
All-in-one TypeScript and Sass compiler for web applications! 📦 🚀
Stars: ✭ 131 (+142.59%)
Mutual labels:  html-template
Gentelella Rtl
Free RTL Bootstrap 3 Admin Template
Stars: ✭ 194 (+259.26%)
Mutual labels:  html-template
Horrorshow Rs
A macro-based html builder for rust
Stars: ✭ 228 (+322.22%)
Mutual labels:  html-template
React Froala Design Blocks
React implementation for Froala Design Blocks.
Stars: ✭ 89 (+64.81%)
Mutual labels:  html-template
iDocs
iDocs is one page documentation html template which helps you to create your offline and online documentation for your themes, templates, plugins and software.
Stars: ✭ 75 (+38.89%)
Mutual labels:  html-template
Design Blocks
A set of 170+ Bootstrap based design blocks ready to be used to create clean modern websites.
Stars: ✭ 13,317 (+24561.11%)
Mutual labels:  html-template
Resume
🐳 一个简洁的橙色调个人简介
Stars: ✭ 60 (+11.11%)
Mutual labels:  html-template
Oboe
🗄 A simple tool to convert an Obsidian vault into a static directory of HTML files.
Stars: ✭ 168 (+211.11%)
Mutual labels:  html-template
Startmin
Admin dashboard template for Bootstrap
Stars: ✭ 192 (+255.56%)
Mutual labels:  html-template
Html Templates
100 (one hundred) simple and elegant HTML templates 💯
Stars: ✭ 240 (+344.44%)
Mutual labels:  html-template
Common Tags
🔖 Useful template literal tags for dealing with strings in ES2015+
Stars: ✭ 1,761 (+3161.11%)
Mutual labels:  html-template
HTML-templating-with-Google-Apps-Script
Use data from your spreadsheets to build a webpage or a client-side app ✨
Stars: ✭ 55 (+1.85%)
Mutual labels:  html-template
React Admin
Free React Admin Template
Stars: ✭ 100 (+85.19%)
Mutual labels:  html-template
Fatfree
A powerful yet easy-to-use PHP micro-framework designed to help you build dynamic and robust Web applications - fast!
Stars: ✭ 2,504 (+4537.04%)
Mutual labels:  html-template
Nginxy
FancyIndex Theme for Nginx same appearance of Apaxy for apache
Stars: ✭ 34 (-37.04%)
Mutual labels:  html-template
static-html-template
静态页面网站快速开发环境,支持自动刷新页面,less样式预处理。
Stars: ✭ 19 (-64.81%)
Mutual labels:  html-template
RazorTemplating
Razor Templating Engine to render Razor Views(.cshtml files) to String in Console, Web, Service, Desktop workloads in .NET Core 3+
Stars: ✭ 216 (+300%)
Mutual labels:  html-template

Crawler-website

It's crawler website using Go language.

This is the mainPage image

This is the homePage image

Features

  • Go language
  • Docker
  • Elastic Search
  • MVC pattern
  • Microservices
  • Singleton -> Concurrent -> Distribute

Installation and go packages

  • go language
  • docker
  • elasticsearch
  • go get golang.org/x/text
  • go get -v github.com/gpmgo/gopm
  • gopm get -g -v golang.org/x/text
  • gopm get -g -v golang.org/x/net/html
  • go get gopkg.in/olivere/elastic.v5

Usage for Concurrent

  • Start Docker.
  • Run Script "docker run -d -p 9200:9200 elasticsearch"
  • Run "src/crawler/main.go", to start the singleton crawler.
  • Run "src/crawler/frontend/starter.go", to view the result in the website.
  • Visit "http://localhost:8888/" in your browser
  • Type in query string with REST format. such as "女 && Age>20"

Usage for Distribute

  • Start Docker.
  • Run Script "docker run -d -p 9200:9200 elasticsearch"
  • Open a Terminal, execute: src\crawler_distributed\persist\server>go run ItemSaver.go --port=1234
  • Open a Terminal, execute: src\crawler_distributed\worker\server>go run worker.go --port=9000
  • Open a Terminal, execute: src\crawler_distributed\worker\server>go run worker.go --port=9001
  • Open a Terminal, execute: src\crawler_distributed>go run main.go --itemsaver_host=":1234" --worker_hosts=":9000,:9001"
  • Run "src/crawler/frontend/starter.go", to view the result in the website.
  • Visit "http://localhost:8888/" in your browser
  • Type in query string with REST format. such as "男 && 已购车"

Architecture

image

Framework

image

Algorithm

image

Reference

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].