All Projects → comp_thinking_social_science → Similar Projects or Alternatives

167 Open source projects that are alternatives of or similar to comp_thinking_social_science

actor-scraper
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Stars: ✭ 83 (+97.62%)
Mutual labels:  web-scraping
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1592.86%)
Mutual labels:  web-scraping
rreddit
𝐫⟋ Get Reddit data
Stars: ✭ 49 (+16.67%)
Mutual labels:  web-scraping
Neural-Scam-Artist
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
Stars: ✭ 18 (-57.14%)
Mutual labels:  web-scraping
sp-subway-scraper
🚆This web scraper builds a dataset for São Paulo subway operation status
Stars: ✭ 24 (-42.86%)
Mutual labels:  web-scraping
2018-2019
The GitHub repository containing all the material related to the Computational Thinking and Programming course of the Digital Humanities and Digital Knowledge degree at the University of Bologna (a.a. 2018/2019).
Stars: ✭ 29 (-30.95%)
Mutual labels:  digital-humanities
Data-Science-Resources
List 📋 of Books📚, Courses 💻 for Data Science 📊
Stars: ✭ 18 (-57.14%)
Mutual labels:  social-sciences
web-poet
Web scraping Page Objects core library
Stars: ✭ 67 (+59.52%)
Mutual labels:  web-scraping
heroshi
Heroshi – open source web crawler.
Stars: ✭ 51 (+21.43%)
Mutual labels:  web-scraping
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-21.43%)
Mutual labels:  social-sciences
iww
AI based web-wrapper for web-content-extraction
Stars: ✭ 61 (+45.24%)
Mutual labels:  web-scraping
2017-summer-workshop
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (-21.43%)
Mutual labels:  web-scraping
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-64.29%)
Mutual labels:  web-scraping
PythonScrapyBasicSetup
Basic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+35.71%)
Mutual labels:  web-scraping
Linkedin-Client
Web scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (+0%)
Mutual labels:  web-scraping
Hi
A Programming language for Web Scraping
Stars: ✭ 14 (-66.67%)
Mutual labels:  web-scraping
tableau-scraping
Tableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+116.67%)
Mutual labels:  web-scraping
npo classifier
Automated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark
Stars: ✭ 18 (-57.14%)
grailer
web scraping tool for grailed.com
Stars: ✭ 30 (-28.57%)
Mutual labels:  web-scraping
UofT-Timetable-Generator
A web application that generates timetables for university students at the University of Toronto
Stars: ✭ 34 (-19.05%)
Mutual labels:  web-scraping
Intro-Cultural-Analytics
Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book
Stars: ✭ 137 (+226.19%)
Mutual labels:  digital-humanities
Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+469.05%)
Mutual labels:  web-scraping
cl-torrents
Searching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)
Stars: ✭ 83 (+97.62%)
Mutual labels:  web-scraping
Docbao
Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam
Stars: ✭ 230 (+447.62%)
Mutual labels:  web-scraping
cummings.ee
A collection of the work of Edward Estlin Cummings, as it enters the public domain.
Stars: ✭ 32 (-23.81%)
Mutual labels:  digital-humanities
Selenium Python Helium
Selenium-python but lighter: Helium is the best Python library for web automation.
Stars: ✭ 2,732 (+6404.76%)
Mutual labels:  web-scraping
rymscraper
Python API to extract data from rateyourmusic.com.
Stars: ✭ 63 (+50%)
Mutual labels:  web-scraping
R Web Scraping Cheat Sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
Stars: ✭ 207 (+392.86%)
Mutual labels:  web-scraping
article-summary-deep-learning
📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!
Stars: ✭ 18 (-57.14%)
Mutual labels:  web-scraping
Bet On Sibyl
Machine Learning Model for Sport Predictions (Football, Basketball, Baseball, Hockey, Soccer & Tennis)
Stars: ✭ 190 (+352.38%)
Mutual labels:  web-scraping
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+26.19%)
Mutual labels:  web-scraping
scraping-ebay
Scraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (+88.1%)
Mutual labels:  web-scraping
actor-content-checker
You can use this act to monitor any page's content and get a notification when content changes.
Stars: ✭ 16 (-61.9%)
Mutual labels:  web-scraping
Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (+288.1%)
Mutual labels:  web-scraping
TraduXio
A participative platform for cultural texts translators
Stars: ✭ 19 (-54.76%)
Mutual labels:  digital-humanities
Netflix Clone
Netflix like full-stack application with SPA client and backend implemented in service oriented architecture
Stars: ✭ 156 (+271.43%)
Mutual labels:  web-scraping
halfstaff
🇺🇸 Is the US flag at half-staff?
Stars: ✭ 22 (-47.62%)
Mutual labels:  web-scraping
Helena
A Chrome extension for writing custom web scraping programs and web automation programs. Just demonstrate how to collect the first row of data, then let the extension write the program for collecting all rows.
Stars: ✭ 151 (+259.52%)
Mutual labels:  web-scraping
TopicsExplorer
Explore your own text collection with a topic model – without prior knowledge.
Stars: ✭ 53 (+26.19%)
Mutual labels:  digital-humanities
Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+252.38%)
Mutual labels:  web-scraping
India-WhatsAppFakeNews-Dataset
WhatsApps related deaths News Articles along with other articles across India during that period
Stars: ✭ 41 (-2.38%)
Mutual labels:  web-scraping
Zillow
Zillow Scraper for Python using Selenium
Stars: ✭ 141 (+235.71%)
Mutual labels:  web-scraping
ham4corpus
Data from "Hamilton: An American Musical", formatted for reuse. See below for some interesting text analysis basic findings! I am not throwing away my stopword?
Stars: ✭ 53 (+26.19%)
Mutual labels:  digital-humanities
Actor Page Analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
Stars: ✭ 124 (+195.24%)
Mutual labels:  web-scraping
audiobooker
Audio Book scrapper
Stars: ✭ 14 (-66.67%)
Mutual labels:  web-scraping
Ayakashi
⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (+178.57%)
Mutual labels:  web-scraping
TikTokDownloader PyWebIO
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+2088.1%)
Mutual labels:  web-scraping
Save For Offline
Android app for saving webpages for offline reading.
Stars: ✭ 114 (+171.43%)
Mutual labels:  web-scraping
twic
Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models
Stars: ✭ 51 (+21.43%)
Mutual labels:  digital-humanities
Rod
A Devtools driver for web automation and scraping
Stars: ✭ 1,392 (+3214.29%)
Mutual labels:  web-scraping
srqm
An introductory statistics course for social scientists, using Stata
Stars: ✭ 43 (+2.38%)
Mutual labels:  social-sciences
Sillynium
Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (+138.1%)
Mutual labels:  web-scraping
investigation-amazon-brands
Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"
Stars: ✭ 56 (+33.33%)
Mutual labels:  web-scraping
saveddit
Bulk Downloader for Reddit
Stars: ✭ 130 (+209.52%)
Mutual labels:  web-scraping
Movie-Recommendation-System-with-Sentiment-Analysis
Content based movie recommendation system with sentiment analysis
Stars: ✭ 44 (+4.76%)
Mutual labels:  web-scraping
dvt
Distant Viewing Toolkit for the Analysis of Visual Culture
Stars: ✭ 57 (+35.71%)
Mutual labels:  digital-humanities
linkextractor
A Docker tutorial using a link extraction application example
Stars: ✭ 41 (-2.38%)
Mutual labels:  web-scraping
GSoC-Data-Analyser
Simple search for organisations participating/participated in the GSoC
Stars: ✭ 29 (-30.95%)
Mutual labels:  web-scraping
Node-js-functionalities
This repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below
Stars: ✭ 69 (+64.29%)
Mutual labels:  web-scraping
bechdel-test
Does your favorite film pass the test?
Stars: ✭ 25 (-40.48%)
Mutual labels:  digital-humanities
61-120 of 167 similar projects