All Projects → sumehta → Question Generation

sumehta / Question Generation

Licence: mit
Given a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Question Generation

News push project
Real Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers
Stars: ✭ 44 (-56%)
Mutual labels:  nlp-machine-learning
Nlp Paper
自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-33%)
Mutual labels:  nlp-machine-learning
Doc2vec
📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-8%)
Mutual labels:  nlp-machine-learning
Text Classification Keras
📚 Text classification library with Keras
Stars: ✭ 53 (-47%)
Mutual labels:  nlp-machine-learning
Aiops platform
An Artificial Intelligence Platform for IT Operations.
Stars: ✭ 63 (-37%)
Mutual labels:  nlp-machine-learning
Russian news corpus
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-24%)
Mutual labels:  nlp-machine-learning
Predicting Myers Briggs Type Indicator With Recurrent Neural Networks
Stars: ✭ 43 (-57%)
Mutual labels:  nlp-machine-learning
Wiki Split
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (-5%)
Mutual labels:  nlp-machine-learning
Intent classifier
Stars: ✭ 67 (-33%)
Mutual labels:  nlp-machine-learning
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-9%)
Mutual labels:  nlp-machine-learning
Wongnai Corpus
Collection of Wongnai's datasets
Stars: ✭ 57 (-43%)
Mutual labels:  nlp-machine-learning
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (-39%)
Mutual labels:  nlp-machine-learning
Summarus
Models for automatic abstractive summarization
Stars: ✭ 83 (-17%)
Mutual labels:  nlp-machine-learning
Gec Pseudodata
Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Stars: ✭ 49 (-51%)
Mutual labels:  nlp-machine-learning
Datascience
It consists of examples, assignments discussed in data science course taken at algorithmica.
Stars: ✭ 92 (-8%)
Mutual labels:  nlp-machine-learning
Mitie chinese wikipedia corpus
Pre-trained Wikipedia corpus by MITIE
Stars: ✭ 43 (-57%)
Mutual labels:  nlp-machine-learning
Cracking The Da Vinci Code With Google Interview Problems And Nlp In Python
A guide on how to crack combinatorics puzzles shown in The Da Vinci Code movie using CS fundamentals and NLP
Stars: ✭ 75 (-25%)
Mutual labels:  nlp-machine-learning
Monkeylearn
⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒
Stars: ✭ 95 (-5%)
Mutual labels:  nlp-machine-learning
Writeup Frontend
Beat Writer's Block with AI
Stars: ✭ 94 (-6%)
Mutual labels:  nlp-machine-learning
Text classification
Text Classification Algorithms: A Survey
Stars: ✭ 1,276 (+1176%)
Mutual labels:  nlp-machine-learning

Question Generation from Declarative Sentences

Update: If you have a feature request please open an issue or feel free to submit a pull request.

Introduction

Given a statement of text such as,

Handheld devices find ways to bolster U.S. homeland defense and response.

The questions generated will be:

What do handheld devices find?

The original software is a product of PhD thesis of Michael Heilman. The original JAVA code and other instructions/resources/dependencies can be found here.

For all Pythonistas 😎 out there this repository provides a Python wrapper to simplify the execution of the code above. The hardest part of the whole project is setup. Yes you heard it right!

Setup

To run the original code you need to have a JAVA Runtime Environment installed. JAVA downloads can be found on the Oracle website here. Version 1.6.0_07 of JAVA was used in developing the original system. The code is packaged for use on UNIX systems or for use in the Eclipse IDE.

Dependencies

The following are the dependencies of the original code.

-Apache Commons Lang (http://commons.apache.org/lang/)

-Apahce Commons Logging (http://commons.apache.org/logging/)

-JUnit (http://www.junit.org/)

-JWNL (http://sourceforge.net/projects/jwordnet/)

-Stanford NLP tools (http://www-nlp.stanford.edu/software/)

-WordNet (http://wordnet.princeton.edu/)

-The sst-light-0.4 release of the SuperSenseTagger, from which we used the SemCor data for training the supersense tagger (http://sourceforge.net/projects/supersensetag/)

-The Semcor corpus, used for training the supersense tagger (http://www.cse.unt.edu/~rada/downloads.html#semcor)

-The WEKA toolkit, version 3.6.0 (http://www.cs.waikato.ac.nz/ml/weka/)

The good news is that you don't need to download each one of those individually 😀. Everything is neatly packed in the QuestionGeneration.zip bundled with the code.

Steps

  1. Clone this repository,

    git clone https://github.com/sumehta/question-generation.git

  2. Unzip the QuestionGeneration.zip file

    unzip QuestionGeneration.zip

  3. [Optional] Start two servers to speed up the script, Stanford Parser server and the SST servers in two separate terminals.

    cd QuestionGeneration

    bash runStanfordParserServer.sh

    bash runSSTServer.sh

  4. Finally, to get a list of questions for a statement, from the parent directory execute this command

    python question.py -s 'Handheld devices find ways to bolster U.S. homeland defense and response'

For other options exposed by the script type,

python question.py --help

Note: For developers, I have also included a QuestionGenerator class, that exposes other methods for processing large collections.

Cite this repository

If you find this repository helpful and use it in a technical report or a research paper, please cite this repository.

  author = {Sneha Mehta},
  title = {Question Generation - Python Wrapper},
  year = {2017},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/sumehta/question-generation}},
  commit = {master}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].