cummings.eeA collection of the work of Edward Estlin Cummings, as it enters the public domain.
Stars: ✭ 32 (+146.15%)
twicTopic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models
Stars: ✭ 51 (+292.31%)
etymology-dbAn open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
Stars: ✭ 20 (+53.85%)
wiki从diy行为艺术到diy苏格拉底式对话,从diy一个仪式到diy一次旷课,各种活动指南的百科。diy💔是706孵化的一个非代码开源项目。
Stars: ✭ 49 (+276.92%)
booknlpBookNLP, a natural language processing pipeline for books
Stars: ✭ 636 (+4792.31%)
TraduXioA participative platform for cultural texts translators
Stars: ✭ 19 (+46.15%)
textboxText collections made available by the CLiGS group.
Stars: ✭ 19 (+46.15%)
TopicsExplorerExplore your own text collection with a topic model – without prior knowledge.
Stars: ✭ 53 (+307.69%)
ham4corpusData from "Hamilton: An American Musical", formatted for reuse. See below for some interesting text analysis basic findings! I am not throwing away my stopword?
Stars: ✭ 53 (+307.69%)
bechdel-testDoes your favorite film pass the test?
Stars: ✭ 25 (+92.31%)
scholarlyRetrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
Stars: ✭ 761 (+5753.85%)