All Projects → darius → Languagetoys

darius / Languagetoys

Licence: mit
Random fun with statistical language models.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Languagetoys

Nndial
NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 332 (+426.98%)
Mutual labels:  natural-language-processing, natural-language-generation
Nlg Rl
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Stars: ✭ 59 (-6.35%)
Mutual labels:  natural-language-processing, natural-language-generation
Nlp Conference Compendium
Compendium of the resources available from top NLP conferences.
Stars: ✭ 349 (+453.97%)
Mutual labels:  natural-language-processing, natural-language-generation
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+87.3%)
Mutual labels:  natural-language-processing, natural-language-generation
Nlg Eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+1204.76%)
Mutual labels:  natural-language-processing, natural-language-generation
Natural Language Processing Specialization
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Stars: ✭ 151 (+139.68%)
Mutual labels:  natural-language-processing, natural-language-generation
Practical Pytorch
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
Stars: ✭ 4,329 (+6771.43%)
Mutual labels:  natural-language-processing, natural-language-generation
Question generation
Neural question generation using transformers
Stars: ✭ 356 (+465.08%)
Mutual labels:  natural-language-processing, natural-language-generation
Pplm
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Stars: ✭ 674 (+969.84%)
Mutual labels:  natural-language-processing, natural-language-generation
This Word Does Not Exist
This Word Does Not Exist
Stars: ✭ 640 (+915.87%)
Mutual labels:  natural-language-processing, natural-language-generation
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+88379.37%)
Mutual labels:  natural-language-processing, natural-language-generation
Ludwig
Data-centric declarative deep learning framework
Stars: ✭ 8,018 (+12626.98%)
Mutual labels:  natural-language-processing, natural-language-generation
Textaugmentation Gpt2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (+65.08%)
Mutual labels:  natural-language-processing, natural-language-generation
Gluon Nlp
NLP made easy
Stars: ✭ 2,344 (+3620.63%)
Mutual labels:  natural-language-processing, natural-language-generation
Gpt2
PyTorch Implementation of OpenAI GPT-2
Stars: ✭ 64 (+1.59%)
Mutual labels:  natural-language-processing, natural-language-generation
Rnnlg
RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 487 (+673.02%)
Mutual labels:  natural-language-processing, natural-language-generation
Pqg Pytorch
Paraphrase Generation model using pair-wise discriminator loss
Stars: ✭ 33 (-47.62%)
Mutual labels:  natural-language-processing, natural-language-generation
Convai Baseline
ConvAI baseline solution
Stars: ✭ 49 (-22.22%)
Mutual labels:  natural-language-processing, natural-language-generation
Mindspore Nlp Tutorial
Natural Language Processing Tutorial for MindSpore Users
Stars: ✭ 58 (-7.94%)
Mutual labels:  natural-language-processing
Textblob Ar
Arabic support for textblob
Stars: ✭ 60 (-4.76%)
Mutual labels:  natural-language-processing

Currently here: a Markov random verse generator. There's sample output at https://gist.github.com/darius/2371822

To generate it: $ python verse.py sonnet # or limerick or other verse form it knows about

Currently missing: the data it works from. You need two files:

I'd like to add I don't normally publish code in such a crap state.

NOTE

Three files checked in to this repo (pdist.py, contractionmodel.bigram, and contractionmodel.unigram) are NOT copyright by me, Darius Bacon. They're derived from files at http://norvig.com/ngrams/ (but not identical to any of them). pdist.py by Peter Norvig is distributed under the MIT license. The two data files weren't originally included here, and should probably be removed, but I'm including them for the moment to make it easier to actually try out the code.

The remaining files are by me, and distributed under the MIT license.

Some other hacks thrown in here:

  • anagram.py generates multiword anagrams

  • bestpermutation.py helps to sort anagrams by quality (using n-gram statistics and brute force)

  • bibleanalyze.py breaks down the Gutenberg Project's KJ Bible into raw material for other hacks here

  • companynames.py generate random Web2.0 company names, along with a plausibility rating for each.

  • emvowel.py reverses disemvoweling

  • mnemonify.py tries to invent mnemonics like pi's "How I wish I could enumerate pi easily..."

  • portmanteau.py finds pairs of words that blend nicely, like book + hookup --> bookup

  • summarize.py generates chapter 'summaries' for a book, like http://wry.me/blog/2010/04/08/quantitative-tolkien-studies.html

  • textanalyze.py is a super-crude sentence segmenter

  • tohtml.py writes HTML that highlights words with increasing intensity the more unlikely they are according to a language model

  • verse.py described above

See also https://github.com/darius/amphigory for verse-making rewritten in Javascript.

Finally, there's https://github.com/darius/versecop -- I'm sorry I don't remember if it has worthwhile differences from the verse code in this repo.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].