All Projects → 18520339 → facebook-data-extraction

18520339 / facebook-data-extraction

Licence: other
Experiences in extracting data from Facebook with these 3 methods: Facebook Graph API, Automation tools, DevTools Console

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to facebook-data-extraction

crawler
Crawler with Python 3.
Stars: ✭ 34 (-58.02%)
Mutual labels:  facebook, selenium
facebook-ruby-ads-sdk
The Facebook Marketing API in Ruby.
Stars: ✭ 59 (-27.16%)
Mutual labels:  facebook, facebook-graph-api
facebook-node-sdk
Modeled from the (Facebook Javascript SDK), now with the facebook-node-sdk you can now easily write the same code and share between your server (nodejs) and the client (Facebook Javascript SDK).
Stars: ✭ 519 (+540.74%)
Mutual labels:  facebook, facebook-graph-api
Tor Browser Selenium
Tor Browser automation with Selenium.
Stars: ✭ 267 (+229.63%)
Mutual labels:  selenium, tor
Ultimate Facebook Scraper
🤖 A Software that automates your social media interactions to collect posts, photos, videos, interests, friends, followers, and much more on Facebook.
Stars: ✭ 2,356 (+2808.64%)
Mutual labels:  facebook, facebook-crawler
facebook-go-sdk
A very simple and easy-to-use Facebook SDK for Golang.
Stars: ✭ 18 (-77.78%)
Mutual labels:  facebook, facebook-graph-api
Facebook-Auto-Pilot
Automate common Facebook activities such as posting to groups and pages walls. Effortlessly post to multiple groups or pages.
Stars: ✭ 126 (+55.56%)
Mutual labels:  facebook, facebook-graph-api
Spam Bot 3000
Social media research and promotion, semi-autonomous CLI bot
Stars: ✭ 79 (-2.47%)
Mutual labels:  facebook, selenium
facebook-py-sdk
Facebook Python SDK
Stars: ✭ 15 (-81.48%)
Mutual labels:  facebook, facebook-graph-api
FacebookGraphAPI-Examples
Examples for facebook graph api for python
Stars: ✭ 38 (-53.09%)
Mutual labels:  facebook, facebook-graph-api
Memedensity
CLI tool to let you know amount of memes in facebook feed.
Stars: ✭ 44 (-45.68%)
Mutual labels:  facebook, selenium
facebook-cleaner
It is almost spring, so time for a pre spring cleaning. This time: taking care of your Facebook. This script can safe you a lot of time if you would try to do that by hand.
Stars: ✭ 52 (-35.8%)
Mutual labels:  facebook, selenium
dnevnik mos ru
Python module to get easy machine access to dnevnik.mos.ru
Stars: ✭ 21 (-74.07%)
Mutual labels:  selenium
throughout
🎪 End-to-end testing made simple (using Jest and Puppeteer)
Stars: ✭ 16 (-80.25%)
Mutual labels:  selenium
burnermap
BurnerMap: Find your pals on the playa. Tracking down friends at Burning Man used to be harder than cleaning playa dust out of gold-sequined booty shorts.
Stars: ✭ 23 (-71.6%)
Mutual labels:  facebook
resgen
Keep track of jobs you've applied to, automate resume & cover letter creation; generate PDFs from .odt templates on the fly while scraping the job post and tracking employer status.
Stars: ✭ 31 (-61.73%)
Mutual labels:  selenium
python-crawler
爬虫学习仓库,适合零基础的人学习,对新手比较友好
Stars: ✭ 37 (-54.32%)
Mutual labels:  selenium
haveno
Decentralized P2P exchange built on Monero and Tor
Stars: ✭ 542 (+569.14%)
Mutual labels:  tor
giulius-selenium-tests
A test harness that allows Selenium tests to be run using JUnit and test fixtures to be created and injected by a WebDriver-aware Guice
Stars: ✭ 12 (-85.19%)
Mutual labels:  selenium
torgo
A Golang library for Tor.
Stars: ✭ 36 (-55.56%)
Mutual labels:  tor

Summary of Facebook data extraction methods

I. General Comparison

Method Sign-in required Risk when sign-in Risk when not sign-in Difficulty Speed
1️⃣  Personal account Access Token + Graph API Access Token leaked, Rate Limits Not working Easy Fast
2️⃣  Automation tools + IP hiding techniques Depend (*) Checkpoint but less loading more failure Safest Hard Slow (**)
3️⃣  Run JS code directly at the DevTools Console Depend (*) Checkpoint but less loading more failure Can be banned if abused Medium Slow (**)
4️⃣  Mbasic Facebook + IP hiding techniques Depend (*) - - Hard -

(*) Depend on the tasks that you need to sign in to perform. Example: Tasks that need to access private groups or private posts, ...

(**) Depend on how much data you want to extract, the more the number, the more times for scrolling down to load the contents.

II. My general conclusion after many tries with different methods

  • When run at not sign-in state, Facebook usually redirects to the login page or prevent you from loading more comments / replies.
  • No matter which method you use, any fast or irregular activity continuously in sign-in state for a long time can be likely to get blocked at any time.
  • If you want to use at sign-in state, for safety, I recommend create a fake account (you can use a Temporary Email Address to create one) and use it for the extraction.
  • With the sign-in state, there's also another technique to limit the Checkpoint is to sign in with different Cookies.

III. DISCLAIMER

All information provided in this repo and related articles are for educational purposes only. So use at your own risk, I will not guarantee & not be responsible for any situations including:

  • Whether your Facebook account may get Checkpoint due to repeatedly or rapid actions.
  • Problems that may occur or for any abuse of the information or the code provided.
  • Problems about your privacy while using IP hiding techniques or any malicious scripts.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].