Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → hudsonbay → google_scraper_live_view

hudsonbay / google_scraper_live_view

Licence: other

Application for extracting large amounts of data from the Google search results page

Programming Languages

2628 projects

7915 projects

184084 projects - #8 most used programming language

Labels

phoenix webscraping liveview

Projects that are alternatives of or similar to google scraper live view

Simple Kanban application written in elixir using phoenix liveview

Stars: ✭ 157 (+823.53%)

Mutual labels: phoenix, liveview

My Frugal PostBin

Stars: ✭ 26 (+52.94%)

Mutual labels: phoenix, liveview

Automate code & data workflows with interactive Elixir notebooks

Stars: ✭ 3,402 (+19911.76%)

Mutual labels: phoenix, liveview

Self-hosted audiobook streaming server

Stars: ✭ 38 (+123.53%)

Mutual labels: phoenix, liveview

phoenix live controller

Controller-style abstraction for building multi-action live views on top of Phoenix.LiveView

Stars: ✭ 31 (+82.35%)

Mutual labels: phoenix, liveview

ecto nested changeset

Helpers for manipulating nested Ecto changesets

Stars: ✭ 23 (+35.29%)

Mutual labels: phoenix, liveview

No description or website provided.

Stars: ✭ 31 (+82.35%)

Mutual labels: phoenix, liveview

👑 King of Tokyo Multiplayer Board Game using Phoenix LiveView

Stars: ✭ 25 (+47.06%)

Mutual labels: phoenix, liveview

Phoenix live dashboard

Realtime dashboard with metrics, request logging, plus storage, OS and VM insights

Stars: ✭ 1,657 (+9647.06%)

Mutual labels: phoenix, liveview

An open-source satirical social network. shlinkedin.com

Stars: ✭ 287 (+1588.24%)

Mutual labels: phoenix, liveview

How to do things on the Internet

Stars: ✭ 78 (+358.82%)

Mutual labels: phoenix

React Client for displaying Stock Quotes (IEX Trading)

Stars: ✭ 34 (+100%)

Mutual labels: phoenix

GoodReadsScraper

📚 A GoodReads.com Scraper script to get books reviews including text and rating.

Stars: ✭ 36 (+111.76%)

Mutual labels: webscraping

zero-to-graphql-using-elixir

The purpose of this example is to provide details as to how one would go about using GraphQL with the Elixir Language.

Stars: ✭ 20 (+17.65%)

Mutual labels: phoenix

Apache Phoenix / Hbase Spring Boot Microservices

Stars: ✭ 23 (+35.29%)

Mutual labels: phoenix

Compose Ecto queries without effort

Stars: ✭ 56 (+229.41%)

Mutual labels: phoenix

aws-pdf-textract-pipeline

🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript

Stars: ✭ 141 (+729.41%)

Mutual labels: webscraping

Flexible and Simple authentication solution for Phoenix

Stars: ✭ 27 (+58.82%)

Mutual labels: phoenix

super-anime-downloader

A program which takes an Anime name or URL and downloads the specified range of episodes.

Stars: ✭ 26 (+52.94%)

Mutual labels: webscraping

A library to get images from social media

Stars: ✭ 15 (-11.76%)

Mutual labels: webscraping

View All Similar Projects ➔

GoogleScraper

Overview

This is an application that extracts large amounts of data from the Google search results page, stores this data and report it back to the users. Also adds an authentication layer, so you need to create a user first and be authenticated to use it.

Disclaimer: This project is for educational purposes only. I'd suggest not to deploy a project like this into your prefered hosting since most of them will ban you for using their services for web crawling. Please, make sure you have authorization by your hosting provider first.

How does it work

Authenticated users can upload a CSV file of keywords. This upload file can be in any size from 1 to 100 keywords. There's an example CSV file in this project so you can see the simple structure to maintain in the file. But, basically, don't add headers 😄.

The uploaded file will contain keywords. Each of these keywords will be used to search on https://www.google.com and they will start to run as soon as they are uploaded.

For each search result/keyword result page on Google, the following information will be stored on the DB:

Total number of AdWords advertisers on the page.
Total number of links (all of them) on the page
Total of search results for this keyword e.g. About 21,600,000 results (0.42 seconds)
HTML code of the page/cache of the page.

The HTML code of the page/cache of the page won't be shown in the UI as it's too big

Users will be allowed to view the list of their uploaded keywords. For each keyword, users will also view the search result information stored in the database.

Tech used

This project was made using the following technologies:

Elixir
Phoenix LiveView
Tailwind CSS
PostgreSQL

Google search limitations

Google prevents mass-searching keywords. To work around these limitations, the application hits the Google domain given random amount of time from 0 - 5s by keyword.

Setup the project

To start your Phoenix server:

Install dependencies with mix deps.get
Create and migrate your database with mix ecto.setup
Start Phoenix endpoint with mix phx.server or inside IEx with iex -S mix phx.server

Now you can visit localhost:4000 from your browser.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 17

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗