All Projects → bradlindblad → schrutepy

bradlindblad / schrutepy

Licence: MIT license
The Entire Transcript from the Office in Tidy Format

Programming Languages

python
139335 projects - #7 most used programming language
r
7636 projects

Projects that are alternatives of or similar to schrutepy

OpenPrompt
An Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+7940.91%)
Mutual labels:  nlp-library, nlp-machine-learning
Lingua
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+1450%)
Mutual labels:  nlp-library, nlp-machine-learning
NLP-Natural-Language-Processing
Projects and useful articles / links
Stars: ✭ 149 (+577.27%)
Mutual labels:  nlp-library, nlp-machine-learning
empythy
Automated NLP sentiment predictions- batteries included, or use your own data
Stars: ✭ 17 (-22.73%)
Mutual labels:  nlp-library, nlp-machine-learning
Naive-Bayes-Evening-Workshop
Companion code for Introduction to Python for Data Science: Coding the Naive Bayes Algorithm evening workshop
Stars: ✭ 23 (+4.55%)
Mutual labels:  datascience, nlp-machine-learning
TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (+90.91%)
Mutual labels:  nlp-library, nlp-machine-learning
Contextualized Topic Models
A python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+1345.45%)
Mutual labels:  nlp-library, nlp-machine-learning
mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (+113.64%)
Mutual labels:  nlp-library, nlp-machine-learning
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+722.73%)
Mutual labels:  nlp-library, nlp-machine-learning
Lingo
package lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (+413.64%)
Mutual labels:  nlp-library, nlp-machine-learning
Nuts
自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Stars: ✭ 21 (-4.55%)
Mutual labels:  nlp-library, nlp-machine-learning
Melusine
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Stars: ✭ 222 (+909.09%)
Mutual labels:  datascience, nlp-machine-learning
Tika Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+4431.82%)
Mutual labels:  nlp-library, nlp-machine-learning
vlainic.github.io
My GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (+18.18%)
Mutual labels:  datascience, nlp-machine-learning
Machine-learning
This repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (+22.73%)
Mutual labels:  datascience, nlp-machine-learning
Tech.ml.dataset
A Clojure high performance data processing system
Stars: ✭ 205 (+831.82%)
Mutual labels:  datascience
ml-book
Codice sorgente ed Errata Corrige del mio libro "A tu per tu col Machine Learning"
Stars: ✭ 16 (-27.27%)
Mutual labels:  datascience
Climate Change Data
🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (+786.36%)
Mutual labels:  datascience
Emotion Classification From Audio Files
Understanding emotions from audio files using neural networks and multiple datasets.
Stars: ✭ 189 (+759.09%)
Mutual labels:  datascience
spaczz
Fuzzy matching and more functionality for spaCy.
Stars: ✭ 215 (+877.27%)
Mutual labels:  nlp-library

logo


The Entire Transcript from the Office in Tidy Format

PyPI version Coverage Status PyUp Black Black

Also available in R: schrute package

schrute R package

What is it

The entire text transcripts from the American version of The Office TV show in pandas dataframe. Use this package to practice or learn NLP, text analysis or deep learning.

Getting started

You can install easily from PyPi

Install

pip install schrutepy

Usage

Pull the transcripts into a data frame with this library's only method:

from schrutepy import schrutepy

df = schrutepy.load_schrute()

df.head(5)

Demo

View the full demo on the website: technistema

Contributors

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].