All Projects → kiankd → Events

kiankd / Events

Licence: gpl-3.0
Repository for *SEM Paper on Event Coreference Resolution in ECB+

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Events

Ml
A high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (+6250%)
Mutual labels:  natural-language-processing, clustering
Multilingual Latent Dirichlet Allocation Lda
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Stars: ✭ 64 (+220%)
Mutual labels:  natural-language-processing, clustering
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+5560%)
Mutual labels:  natural-language-processing, clustering
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (+355%)
Mutual labels:  natural-language-processing, clustering
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+7480%)
Mutual labels:  natural-language-processing, clustering
Newsrecommender
A news recommendation system tailored for user communities
Stars: ✭ 164 (+720%)
Mutual labels:  natural-language-processing, clustering
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+9240%)
Mutual labels:  natural-language-processing, clustering
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+2820%)
Mutual labels:  natural-language-processing, clustering
Covid 19 Bert Researchpapers Semantic Search
BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab
Stars: ✭ 23 (+15%)
Mutual labels:  natural-language-processing
Spago
Self-contained Machine Learning and Natural Language Processing library in Go
Stars: ✭ 854 (+4170%)
Mutual labels:  natural-language-processing
Bagofconcepts
Python implementation of bag-of-concepts
Stars: ✭ 18 (-10%)
Mutual labels:  clustering
Spacy Transformers
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+4495%)
Mutual labels:  natural-language-processing
Pke
Python Keyphrase Extraction module
Stars: ✭ 855 (+4175%)
Mutual labels:  natural-language-processing
Nlp With Ruby
Curated List: Practical Natural Language Processing done in Ruby
Stars: ✭ 907 (+4435%)
Mutual labels:  natural-language-processing
Twitter Bot
👻 Markov chain-based Japanese twitter bot
Stars: ✭ 12 (-40%)
Mutual labels:  natural-language-processing
Kts linguistics
Spellcheck, phonetics, text processing and more
Stars: ✭ 18 (-10%)
Mutual labels:  natural-language-processing
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-10%)
Mutual labels:  natural-language-processing
Clustering
fast clustering algorithms
Stars: ✭ 14 (-30%)
Mutual labels:  clustering
Node Api.ai
[DEPRECATED] Ultimate Node.JS SDK for api.ai
Stars: ✭ 12 (-40%)
Mutual labels:  natural-language-processing
Syntree2vec
An algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-55%)
Mutual labels:  natural-language-processing

Events: Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization

Published in the conference STAR-SEM 2018. Link to paper is to be found here.

Startup

Run the startup script (bash startup.sh) to get started. It will pull and download all necessary repositories and datasets, including:

There are several Python package dependencies, including Theano, which is what we used for our experiments.

If you seek to implement our model, I would recommend a reimplementation in PyTorch or a more well-maintained deep learning library.

Relevant files

The predictions made by each model have been saved in results/, along with the gold standard coreference chains. After switching to the scripts directory (cd scripts/) you can do the following to replicate the results presented in the paper. For the within and cross-doc results:

bash get_scores.sh MODEL_NAME.response_conll

For just within-doc results:

bash get_scores.sh ecb_plus_events_test_mention_based_WITHINDOC_.key_conll  MODEL_NAME__within.response_conll

Python code

I do not currently have the time to document the Python code, but on request I can offer assistance over email. All of the code is found in python/. I would recommend reimplementation of the model if you seek to develop upon CORE. If you are interested primarily in the loss function and matrix derivation of CORE, check the file python/neural_cluster_model.py and the definition of the loss in the prepare_model function. Note that several of the files and functions are deprecated and were only used for preliminary experimentation.

Contact info

Contact Kian Kenyon-Dean at [email protected] (or, on github) for questions about this repository.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].