All Projects → johnafish → duree

johnafish / duree

Licence: other
Durée: the longest book ever written.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to duree

linguistics problems
Natural language processing in examples and games
Stars: ✭ 23 (-65.67%)
Mutual labels:  linguistics
mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-29.85%)
Mutual labels:  linguistics
linguisticsdown
Easy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-64.18%)
Mutual labels:  linguistics
ngramr
R package to query the Google Ngram Viewer
Stars: ✭ 46 (-31.34%)
Mutual labels:  linguistics
expletives
Expletives vomiting library...
Stars: ✭ 12 (-82.09%)
Mutual labels:  linguistics
eliza-rs
A rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966.
Stars: ✭ 48 (-28.36%)
Mutual labels:  linguistics
Onset
A language evolution simulator, using realistic phonetic changes.
Stars: ✭ 30 (-55.22%)
Mutual labels:  linguistics
clinical nlp elastic
Clinical NLP Analysis with Elasticsearch and Kibana
Stars: ✭ 32 (-52.24%)
Mutual labels:  linguistics
libpalaso
Palaso Library: A set of .Net libraries useful for developers of Language Software.
Stars: ✭ 36 (-46.27%)
Mutual labels:  linguistics
lameta
The Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-73.13%)
Mutual labels:  linguistics
KoParadigm
KoParadigm: Korean Inflectional Paradigm Generator
Stars: ✭ 48 (-28.36%)
Mutual labels:  linguistics
verbecc
Complete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: ✭ 45 (-32.84%)
Mutual labels:  linguistics
NatLang
NatLang is an English parser with an extensible grammar
Stars: ✭ 20 (-70.15%)
Mutual labels:  linguistics
dev
PHOIBLE data and development.
Stars: ✭ 90 (+34.33%)
Mutual labels:  linguistics
neural-net-linguistics
Papers about NN and linguistics
Stars: ✭ 14 (-79.1%)
Mutual labels:  linguistics
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
Stars: ✭ 16 (-76.12%)
Mutual labels:  linguistics
LangPad
A word processor/dictionary/generally useful tool for linguistics.
Stars: ✭ 20 (-70.15%)
Mutual labels:  linguistics
folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (-16.42%)
Mutual labels:  linguistics
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-59.7%)
Mutual labels:  linguistics
lingvo--Ner-ru
Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-43.28%)
Mutual labels:  linguistics

durée

Durée is the longest book of all time. There are fifteen volumes published, each of which is approximately 800,000 words long.

The book is written by a Python script (which was in turn written by John Fish). The script is available online and is open-source. It uses a list of English words and sentence structure rules to create grammatically correct but generally nonsensical sentences.

Ultimately, the book was inspired by one phrase (from Noam Chomsky):

Colorless green ideas sleep furiously

This phrase was given by Chomsky as a grammatically correct but semantically nonsensical sentence. To me, it had a certain playfulness to it which appealed to me. So, I decided to see if I could come up with a program to generate these sentences for me automatically and durée is the result of this experiment.

I didn't go for the longest book of all time because I believe that it deserves the title. I went for the longest book of all time because it's good clickbait, and also because the former book of all time is "The Blah Story" and once I saw that, I had to one-up it.

Truly, the longest written book of all time is Devta, with 11,206,310 words written over 33 years. Durée is but a mere publicity stunt designed to educate people about linguistics and get views on YouTube.

Now, I did print a copy of each of the fifteen volumes which means that I am responsible for over 12,000 pieces of paper which is about one and a half trees. I felt guilt about this (as should you, if you were to whimsically waste such an enormous amount of paper) and so made a donation to the Eden Reforestation Projects (https://edenprojects.org) where trees are planted in deforested areas by local citizens who are paid a fair wage. The cost to plant a tree according to them is "anywhere from $0.10 to $0.35 cents a tree" including employment, administration costs, etc. My donation will thus be responsible for the planting of orders of magnitude more trees than my silly project cut down.

If you wish, you can purchase an abridged copy of durée on Amazon (link coming soon). It's a fun coffee-table book.

Cover of book

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].