All Projects → comp_thinking_social_science → Similar Projects or Alternatives

167 Open source projects that are alternatives of or similar to comp_thinking_social_science

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

Stars: ✭ 83 (+97.62%)

Mutual labels: web-scraping

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (+1592.86%)

Mutual labels: web-scraping

rreddit

𝐫⟋ Get Reddit data

Stars: ✭ 49 (+16.67%)

Mutual labels: web-scraping

Neural-Scam-Artist

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

Stars: ✭ 18 (-57.14%)

Mutual labels: web-scraping

sp-subway-scraper

🚆This web scraper builds a dataset for São Paulo subway operation status

Stars: ✭ 24 (-42.86%)

Mutual labels: web-scraping

2018-2019

The GitHub repository containing all the material related to the Computational Thinking and Programming course of the Digital Humanities and Digital Knowledge degree at the University of Bologna (a.a. 2018/2019).

Stars: ✭ 29 (-30.95%)

Mutual labels: digital-humanities

Data-Science-Resources

List 📋 of Books📚, Courses 💻 for Data Science 📊

Stars: ✭ 18 (-57.14%)

Mutual labels: social-sciences

web-poet

Web scraping Page Objects core library

Stars: ✭ 67 (+59.52%)

Mutual labels: web-scraping

heroshi

Heroshi – open source web crawler.

Stars: ✭ 51 (+21.43%)

Mutual labels: web-scraping

wikirepo

Python based Wikidata framework for easy dataframe extraction

Stars: ✭ 33 (-21.43%)

Mutual labels: social-sciences

iww

AI based web-wrapper for web-content-extraction

Stars: ✭ 61 (+45.24%)

Mutual labels: web-scraping

2017-summer-workshop

Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)

Stars: ✭ 33 (-21.43%)

Mutual labels: web-scraping

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-64.29%)

Mutual labels: web-scraping

PythonScrapyBasicSetup

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

Stars: ✭ 57 (+35.71%)

Mutual labels: web-scraping

Linkedin-Client

Web scraper for grabing data from Linkedin profiles or company pages (personal project)

Stars: ✭ 42 (+0%)

Mutual labels: web-scraping

A Programming language for Web Scraping

Stars: ✭ 14 (-66.67%)

Mutual labels: web-scraping

tableau-scraping

Tableau scraper python library. R and Python scripts to scrape data from Tableau viz

Stars: ✭ 91 (+116.67%)

Mutual labels: web-scraping

npo classifier

Automated coding using machine-learning and remapping the U.S. nonprofit sector: A guide and benchmark

Stars: ✭ 18 (-57.14%)

Mutual labels: computational-social-science

grailer

web scraping tool for grailed.com

Stars: ✭ 30 (-28.57%)

Mutual labels: web-scraping

UofT-Timetable-Generator

A web application that generates timetables for university students at the University of Toronto

Stars: ✭ 34 (-19.05%)

Mutual labels: web-scraping

Intro-Cultural-Analytics

Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book

Stars: ✭ 137 (+226.19%)

Mutual labels: digital-humanities

Scrape Linkedin Selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Stars: ✭ 239 (+469.05%)

Mutual labels: web-scraping

cl-torrents

Searching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)

Stars: ✭ 83 (+97.62%)

Mutual labels: web-scraping

Docbao

Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam

Stars: ✭ 230 (+447.62%)

Mutual labels: web-scraping

cummings.ee

A collection of the work of Edward Estlin Cummings, as it enters the public domain.

Stars: ✭ 32 (-23.81%)

Mutual labels: digital-humanities

Selenium Python Helium

Selenium-python but lighter: Helium is the best Python library for web automation.

Stars: ✭ 2,732 (+6404.76%)

Mutual labels: web-scraping

rymscraper

Python API to extract data from rateyourmusic.com.

Stars: ✭ 63 (+50%)

Mutual labels: web-scraping

R Web Scraping Cheat Sheet

Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.

Stars: ✭ 207 (+392.86%)

Mutual labels: web-scraping

article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

Stars: ✭ 18 (-57.14%)

Mutual labels: web-scraping

Bet On Sibyl

Machine Learning Model for Sport Predictions (Football, Basketball, Baseball, Hockey, Soccer & Tennis)

Stars: ✭ 190 (+352.38%)

Mutual labels: web-scraping

selectorlib

A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them

Stars: ✭ 53 (+26.19%)

Mutual labels: web-scraping

scraping-ebay

Scraping Ebay's products using Scrapy Web Crawling Framework

Stars: ✭ 79 (+88.1%)

Mutual labels: web-scraping

actor-content-checker

You can use this act to monitor any page's content and get a notification when content changes.

Stars: ✭ 16 (-61.9%)

Mutual labels: web-scraping

Learnpythonforresearch

This repository provides everything you need to get started with Python for (social science) research.

Stars: ✭ 163 (+288.1%)

Mutual labels: web-scraping

TraduXio

A participative platform for cultural texts translators

Stars: ✭ 19 (-54.76%)

Mutual labels: digital-humanities

Netflix Clone

Netflix like full-stack application with SPA client and backend implemented in service oriented architecture

Stars: ✭ 156 (+271.43%)

Mutual labels: web-scraping

halfstaff

🇺🇸 Is the US flag at half-staff?

Stars: ✭ 22 (-47.62%)

Mutual labels: web-scraping

Helena

A Chrome extension for writing custom web scraping programs and web automation programs. Just demonstrate how to collect the first row of data, then let the extension write the program for collecting all rows.

Stars: ✭ 151 (+259.52%)

Mutual labels: web-scraping

TopicsExplorer

Explore your own text collection with a topic model – without prior knowledge.

Stars: ✭ 53 (+26.19%)

Mutual labels: digital-humanities

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

Stars: ✭ 148 (+252.38%)

Mutual labels: web-scraping

India-WhatsAppFakeNews-Dataset

WhatsApps related deaths News Articles along with other articles across India during that period

Stars: ✭ 41 (-2.38%)

Mutual labels: web-scraping

Zillow

Zillow Scraper for Python using Selenium

Stars: ✭ 141 (+235.71%)

Mutual labels: web-scraping

ham4corpus

Data from "Hamilton: An American Musical", formatted for reuse. See below for some interesting text analysis basic findings! I am not throwing away my stopword?

Stars: ✭ 53 (+26.19%)

Mutual labels: digital-humanities

Actor Page Analyzer

Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.

Stars: ✭ 124 (+195.24%)

Mutual labels: web-scraping

audiobooker

Audio Book scrapper

Stars: ✭ 14 (-66.67%)

Mutual labels: web-scraping

Ayakashi

⚡️ Ayakashi.io - The next generation web scraping framework

Stars: ✭ 117 (+178.57%)

Mutual labels: web-scraping

TikTokDownloader PyWebIO

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具，支持API调用，在线批量解析及下载。

Stars: ✭ 919 (+2088.1%)

Mutual labels: web-scraping

Save For Offline

Android app for saving webpages for offline reading.

Stars: ✭ 114 (+171.43%)

Mutual labels: web-scraping

twic

Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models

Stars: ✭ 51 (+21.43%)

Mutual labels: digital-humanities

Rod

A Devtools driver for web automation and scraping

Stars: ✭ 1,392 (+3214.29%)

Mutual labels: web-scraping

srqm

An introductory statistics course for social scientists, using Stata

Stars: ✭ 43 (+2.38%)

Mutual labels: social-sciences

Sillynium

Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements

Stars: ✭ 100 (+138.1%)

Mutual labels: web-scraping

investigation-amazon-brands

Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"

Stars: ✭ 56 (+33.33%)

Mutual labels: web-scraping

saveddit

Bulk Downloader for Reddit

Stars: ✭ 130 (+209.52%)

Mutual labels: web-scraping

Movie-Recommendation-System-with-Sentiment-Analysis

Content based movie recommendation system with sentiment analysis

Stars: ✭ 44 (+4.76%)

Mutual labels: web-scraping

dvt

Distant Viewing Toolkit for the Analysis of Visual Culture

Stars: ✭ 57 (+35.71%)

Mutual labels: digital-humanities

linkextractor

A Docker tutorial using a link extraction application example

Stars: ✭ 41 (-2.38%)

Mutual labels: web-scraping

GSoC-Data-Analyser

Simple search for organisations participating/participated in the GSoC

Stars: ✭ 29 (-30.95%)

Mutual labels: web-scraping

Node-js-functionalities

This repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below

Stars: ✭ 69 (+64.29%)

Mutual labels: web-scraping

bechdel-test

Does your favorite film pass the test?

Stars: ✭ 25 (-40.48%)

Mutual labels: digital-humanities

61-120 of 167 similar projects

‹

›