All Projects → 4m4n5 → the-seinfeld-chronicles

4m4n5 / the-seinfeld-chronicles

Licence: other
A dataset for textual analysis on arguably the best written comedy television show ever.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to the-seinfeld-chronicles

AdflyUrlGrabber
A python script designed to grab the original url from an adfly url without opening it :D
Stars: ✭ 53 (+278.57%)
Mutual labels:  python-script
tech-seo-crawler
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.
Stars: ✭ 57 (+307.14%)
Mutual labels:  crawling
videoslimmer
Utility to remove unwanted audio and subtitles from mkv files.
Stars: ✭ 23 (+64.29%)
Mutual labels:  python-script
scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
Stars: ✭ 42 (+200%)
Mutual labels:  crawling
IpHack
Track Location With Live Address And City in Termux
Stars: ✭ 315 (+2150%)
Mutual labels:  python-script
dirbpy
This is the new version of dirb in python
Stars: ✭ 36 (+157.14%)
Mutual labels:  python-script
rpi3-wifi-conf
A simple Python script to configure wifi over bluetooth for a Raspberry Pi 3
Stars: ✭ 112 (+700%)
Mutual labels:  python-script
Github-Environment-Cleaner
An interactive script to clean up GitHub environments
Stars: ✭ 101 (+621.43%)
Mutual labels:  python-script
mal-analysis
github repo for MyAnimeList analysis. Also links to the MAL dataset.
Stars: ✭ 31 (+121.43%)
Mutual labels:  crawling
Smtp-cracker
[NEW] : Simple Mail Transfer Protocol (SMTP) CHECKER - CRACKER Tool V2
Stars: ✭ 67 (+378.57%)
Mutual labels:  python-script
core
The complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (+7828.57%)
Mutual labels:  crawling
Airscript-ng
A python script to simplify the process of auditing wireless networks.
Stars: ✭ 83 (+492.86%)
Mutual labels:  python-script
socials
👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (+164.29%)
Mutual labels:  crawling
podcastcrawler
PHP library to find podcasts
Stars: ✭ 40 (+185.71%)
Mutual labels:  crawling
xXx dead xXx
b̶̡̪̬͒l̸̰̗̝̀ỏ̷̡̩g̴͇̑g̶̲̱̽͐i̵̹͗n̶̤̥͂̅̆g̴̮̾̅͜ ̷̧͎͆i̷̛͒͜͠n̸̥̺͒ ̶͚͚͊̿͜t̸̺͙̭̆̊̈́ḧ̶̟́̐e̸̱͔̟̓̓͝ ̶̨͔̾͛̑d̵̥̣̏ȧ̷̼̊r̷̰̝̥̅̌͝k̵̟̥̞̉̍͛
Stars: ✭ 19 (+35.71%)
Mutual labels:  crawling
GhostNET
GhostNET script that will help you be safer on the cyber
Stars: ✭ 45 (+221.43%)
Mutual labels:  python-script
OpenCVB
OpenCV .Net application supporting several RGBD cameras - Kinect, Intel RealSense, Luxonis Oak-D, Mynt Eye D 1000, and StereoLabs ZED 2
Stars: ✭ 60 (+328.57%)
Mutual labels:  python-script
Python-project-Scripts
This repositories contains a list of python scripts projects from beginner level advancing slowly. More code snippets to be added soon. feel free to clone this repo
Stars: ✭ 627 (+4378.57%)
Mutual labels:  python-script
Efficient-office
Alfred-Workflows,Vim,Script,Mac
Stars: ✭ 36 (+157.14%)
Mutual labels:  python-script
fa
Automation tool for locating symbols & structs in binary (primary IDA focused)
Stars: ✭ 58 (+314.29%)
Mutual labels:  python-script

A dataset for textual analysis on arguably the best written comedy television show ever.


Context

Dataset for people who love data science and Seinfeld.


Content

  • Details about all the episodes.
  • Includes attributes like Director, Episode Name, Air Date etc...
  • Complete Scripts of all the episodes.

Upcoming Update will Include :

  • Stage locations and cast

Data Source

The data is scraped from the fan website http://www.seinology.com/.


Possible Explorations

  • Train language models on the corpus.
  • Compare the vocabulary with other works on television, film or literature.
  • Find corellation between language complexity and popularity.
  • Train models to generate scripts based on the data.
  • Analyze obscure wods used in the vocabulary of the series.

These are just basic examples, sky is the limit.


Acknowledgements

The data has been crawled from the http://www.seinology.com/ website.


Contributing

Changes and Improvement suggestions are welcome. Feel free to comment new additions that you think are useful or drop a PR on the github project.

Wanna buy me coffee - paypal.me/AShrivastava961

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].