All Projects → fnielsen → dasem

fnielsen / dasem

Licence: Apache-2.0 license
Danish Semantic analysis

Programming Languages

python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects
HTML
75241 projects

Projects that are alternatives of or similar to dasem

Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+1052.94%)
Mutual labels:  word-embeddings
HiCE
Code for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"
Stars: ✭ 56 (+229.41%)
Mutual labels:  word-embeddings
wefe
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Stars: ✭ 164 (+864.71%)
Mutual labels:  word-embeddings
Question Generation
Generating multiple choice questions from text using Machine Learning.
Stars: ✭ 227 (+1235.29%)
Mutual labels:  word-embeddings
Pytorch Sentiment Analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+18776.47%)
Mutual labels:  word-embeddings
lemmy
🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪
Stars: ✭ 68 (+300%)
Mutual labels:  danish
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+1011.76%)
Mutual labels:  word-embeddings
S-WMD
Code for Supervised Word Mover's Distance (SWMD)
Stars: ✭ 90 (+429.41%)
Mutual labels:  word-embeddings
Simple-Sentence-Similarity
Exploring the simple sentence similarity measurements using word embeddings
Stars: ✭ 99 (+482.35%)
Mutual labels:  word-embeddings
sister
SImple SenTence EmbeddeR
Stars: ✭ 66 (+288.24%)
Mutual labels:  word-embeddings
Wordgcn
ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (+1252.94%)
Mutual labels:  word-embeddings
Spanish Word Embeddings
Spanish word embeddings computed with different methods and from different corpora
Stars: ✭ 236 (+1288.24%)
Mutual labels:  word-embeddings
two-stream-cnn
A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
Stars: ✭ 24 (+41.18%)
Mutual labels:  word-embeddings
Chameleon recsys
Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (+1088.24%)
Mutual labels:  word-embeddings
fuzzymax
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
Stars: ✭ 43 (+152.94%)
Mutual labels:  word-embeddings
Jfasttext
Java interface for fastText
Stars: ✭ 193 (+1035.29%)
Mutual labels:  word-embeddings
overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification
NLP tutorial
Stars: ✭ 41 (+141.18%)
Mutual labels:  word-embeddings
word2vec-on-wikipedia
A pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+300%)
Mutual labels:  word-embeddings
PersianNER
Named-Entity Recognition in Persian Language
Stars: ✭ 48 (+182.35%)
Mutual labels:  word-embeddings
Word2VecfJava
Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
Stars: ✭ 14 (-17.65%)
Mutual labels:  word-embeddings

Dasem

Danish semantic analysis.

Examples

Get nouns from Dannet and Wiktionary:

from dasem.wiktionary import get_nouns
from dasem.dannet import Dannet

wiktionary_nouns = get_nouns()

dannet = Dannet()
query = "select w.form from words w where w.pos = 'Noun'"
dannet_nouns = set(dannet.db.query(query).form)

nouns = dannet_nouns.union(wiktionary_nouns)

Get similar words based on a word2vec model on the Danish part of the Project Gutenberg corpus:

$ python -m dasem.gutenberg most-similar mand
kvinde
dame
pige
kone
fyr
dreng
præst
profet
hund
person

Get first two sentences from Dannet synsets examples:

$ python -m dasem.dannet get-all-sentences | head -n 2
I september måned var jeg sammen med en dansk gruppe af unge bøsser og lesbiske i Moskva
Til en gruppe på 10 børn i alderen 0-3 år søges pr. 1.3.83 en pædagog 40 timer ugentligt

Reference

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].