Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → eracle → Linkedin

eracle / Linkedin

Licence: gpl-3.0

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Programming Languages

python

139335 projects - #7 most used programming language

Labels

docker bot docker-compose scraper scrapy scraping selenium-webdriver linkedin

Projects that are alternatives of or similar to Linkedin

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (-62.14%)

Mutual labels: scraper, scrapy, scraping, selenium-webdriver

Scrape Linkedin Selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Stars: ✭ 239 (-22.65%)

Mutual labels: linkedin, scraper, scraping, selenium-webdriver

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+231.39%)

Mutual labels: scraper, scrapy, scraping

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-92.88%)

Mutual labels: scraper, scraping, scrapy

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-73.79%)

Mutual labels: scraper, scrapy, scraping

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-44.66%)

Mutual labels: linkedin, scraper, scraping

linkedinscraper

LinkedinScraper is an another information gathering tool written in python. You can scrape employees of companies on Linkedin.com and then create these employee names, titles and emails.

Stars: ✭ 22 (-92.88%)

Mutual labels: scraper, linkedin

linky

Yet Another LInkedIn Scraper...

Stars: ✭ 44 (-85.76%)

Mutual labels: scraper, linkedin

whatsapp-tracking

Scraping the status of WhatsApp contacts

Stars: ✭ 49 (-84.14%)

Mutual labels: scraper, scraping

Zeiver

A Scraper, Downloader, & Recorder for static open directories.

Stars: ✭ 14 (-95.47%)

Mutual labels: scraper, scraping

Scraper-Projects

🕸 List of mini projects that involve web scraping 🕸

Stars: ✭ 25 (-91.91%)

Mutual labels: scraper, scraping

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-95.15%)

Mutual labels: scraper, scraping

SocialInfo4J

fetch data from Facebook, Instagram and LinkedIn

Stars: ✭ 44 (-85.76%)

Mutual labels: scraper, linkedin

scrapy-zyte-smartproxy

Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy

Stars: ✭ 317 (+2.59%)

Mutual labels: scraping, scrapy

Socialmanagertools Gui

🤖 👻 Desktop application for Instagram Bot, Twitter Bot and Facebook Bot

Stars: ✭ 293 (-5.18%)

Mutual labels: bot, scraper

kick-off-web-scraping-python-selenium-beautifulsoup

A tutorial-based introduction to web scraping with Python.

Stars: ✭ 18 (-94.17%)

Mutual labels: scraper, selenium-webdriver

TorScrapper

A Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)

Stars: ✭ 24 (-92.23%)

Mutual labels: scraper, scraping

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-92.88%)

Mutual labels: scraping, scrapy

scraper

Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.

Stars: ✭ 37 (-88.03%)

Mutual labels: scraper, scraping

bots-zoo

No description or website provided.

Stars: ✭ 59 (-80.91%)

Mutual labels: scraper, scraping

View All Similar Projects ➔

Linkedin Automation:

Uses: Scrapy, Selenium web driver, Chromium headless, docker and python3.

Linkedin spider:

The first spider aims to visit as more linkedin's user pages as possible :-D, the objective is to gain visibility with your account: since LinkedIn notifies the issued User when someone visits his page.

Companies spider:

This spider aims to collect all the users working for a company on linkedin.

It goes on the company's front-page;
Clicks on "See all 1M employees" button;
Starts collecting User related Scapy items.

Install

Needed:

docker;
docker-compose;
VNC viewer, like vinagre (ubuntu);
python3.6;
virtualenvs;

0. Prepare your environment:

Install docker from the official website https://www.docker.com/

Install VNC viewer if you do not have one. For ubuntu, go for vinagre:

sudo apt-get update
sudo apt-get install vinagre

1. Set up Linkedin login and password:

Copy conf_template.py in conf.py and fill the quotes with your credentials.

2. Run and build containers with docker-compose:

Only linkedin random spider, not the companies spider. Open your terminal, move to the project folder and type:

docker-compose up -d --build

3. Take a look on the browser's activity:

Open vinagre, and type address and port localhost:5900. The password is secret. or otherwise:

vinagre localhost:5900
or
make view

4. Stop the scraper;

Use your terminal again, type in the same window:

docker-compose down

Test & Development:

Setup your python virtual environment (trivial but mandatory):

    virtualenvs -p python3.6 .venv
    source .venv/bin/activate
    pip install -r requirements.txt

Create the selenium server, open the VNC window and launch the tests, type those in three different terminals on the project folder:

    make dev
    make view
    make tests

For more details have a look at the Makefile (here is used to shortcut and not to build).

Development:

    scrapy crawl companies -a selenium_hostname=localhost -o output.csv

    scrapy crawl random -a selenium_hostname=localhost -o output.csv

    scrapy crawl byname -a selenium_hostname=localhost -o output.csv

Legal

This code is in no way affiliated with, authorized, maintained, sponsored or endorsed by Linkedin or any of its affiliates or subsidiaries. This is an independent and unofficial project. Use at your own risk.

This project violates Linkedin's User Agreement Section 8.2, and because of this, Linkedin may (and will) temporarily or permantly ban your account. We are not responsible for your account being banned.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 309

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗