All Projects → imonban → RadiologyReportEmbedding

imonban / RadiologyReportEmbedding

Licence: other
Intelligent Word Embeddings of Free-Text Radiology Reports

Projects that are alternatives of or similar to RadiologyReportEmbedding

sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+131.82%)
Mutual labels:  embeddings, nlp-machine-learning
word2vec-tsne
Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+168.18%)
Mutual labels:  embeddings, nlp-machine-learning
ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec
Stars: ✭ 83 (+277.27%)
Mutual labels:  embeddings, word2vec-model
Entity Embedding
Reference implementation of the paper "Word Embeddings for Entity-annotated Texts"
Stars: ✭ 19 (-13.64%)
Mutual labels:  embeddings, nlp-machine-learning
Datastories Semeval2017 Task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (+736.36%)
Mutual labels:  embeddings, nlp-machine-learning
SentimentAnalysis
Sentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (+45.45%)
Mutual labels:  embeddings, nlp-machine-learning
datastories-semeval2017-task6
Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-9.09%)
Mutual labels:  embeddings, nlp-machine-learning
Contextualized Topic Models
A python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+1345.45%)
Mutual labels:  embeddings, nlp-machine-learning
Deepehr
Chronic Disease Prediction Using Medical Notes
Stars: ✭ 220 (+900%)
Mutual labels:  embeddings, nlp-machine-learning
DeepLearningReading
Deep Learning and Machine Learning mini-projects. Current Project: Deepmind Attentive Reader (rc-data)
Stars: ✭ 78 (+254.55%)
Mutual labels:  embeddings, nlp-machine-learning
CS224NHomeworks
CS224N 2019 Homeworks
Stars: ✭ 18 (-18.18%)
Mutual labels:  nlp-machine-learning
openmrs-module-radiology
🏥 OpenMRS Radiology Module
Stars: ✭ 37 (+68.18%)
Mutual labels:  radiology
vlainic.github.io
My GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (+18.18%)
Mutual labels:  nlp-machine-learning
eye-tracker-setup
👀 Tobii Eye Tracker 4C Setup
Stars: ✭ 24 (+9.09%)
Mutual labels:  radiology
Question-Answering-based-on-SQuAD
Question Answering System using BiDAF Model on SQuAD v2.0
Stars: ✭ 20 (-9.09%)
Mutual labels:  nlp-machine-learning
mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (+113.64%)
Mutual labels:  nlp-machine-learning
CaRE
EMNLP 2019: CaRe: Open Knowledge Graph Embeddings
Stars: ✭ 34 (+54.55%)
Mutual labels:  embeddings
Conditional-SeqGAN-Tensorflow
Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow
Stars: ✭ 47 (+113.64%)
Mutual labels:  nlp-machine-learning
nlp classification workshop
NLP Classification Workshop
Stars: ✭ 22 (+0%)
Mutual labels:  nlp-machine-learning
Machine-learning
This repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (+22.73%)
Mutual labels:  nlp-machine-learning

RadiologyReportEmbedding

Paper in AMIA 2017: Banerjee, Imon, Sriraman Madhavan, Roger Eric Goldman, and Daniel L. Rubin. "Intelligent Word Embeddings of Free-Text Radiology Reports." In AMIA Annual Symposium Proceedings, vol. 2017, p. 411. American Medical Informatics Association, 2017. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5977573/

A hybrid strategy that combines semantic-dictionary mapping and word2vec modeling, has been applied to create the word embeddings from ~10,000 CT Head radiology reports.

We followed the following step: Data set retrieval from PACS, Data Cleaning & Pre-processing, Semantic-dictionary mapping (CLEVER and RadLex terminology), and Word and Report Embedding via Continuous Bag Of Word model. The size of the resulting vocabulary was 4,442 words.

Using the vector representation, we automatically classify them into three classes denoting the confidence in the diagnosis of intracranial hemorrhage by the interpreting radiologist. We performed a range of experiments with different classifiers and varying hyper-parameters settings. Best performance achieved is weighted precision of 88% and weighted recall of 90%.

The Radiology word vectors can be resued in similar classification scenarios or can be used to interpret word-to-word relations.

Using a prebuilt model:

  1. Get python 2.7

  2. unzip modelname.zip -d destination_folderpath

  3. Install gensim: pip install gensim

  4. Load model in gensim:

from gensim.models import Word2Vec

model = Word2Vec.load(modelpath)

model.similarity('new', 'recent') %used to find the cosine distance

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].