All Projects → ishaan007 → vector_space_modelling

ishaan007 / vector_space_modelling

Licence: other
NLP in python Vector Space Modelling and document classification NLP

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to vector space modelling

Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+1125%)
Mutual labels:  word2vec, scikit-learn
Stocksensation
基于情感字典和机器学习的股市舆情情感分类可视化Web
Stars: ✭ 215 (+1243.75%)
Mutual labels:  word2vec, scikit-learn
text-classification-cn
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Stars: ✭ 81 (+406.25%)
Mutual labels:  word2vec, scikit-learn
IBM-final-project-Machine-Learning
Final project of IBM's course https://www.coursera.org/learn/machine-learning-with-python on coursera
Stars: ✭ 33 (+106.25%)
Mutual labels:  scikit-learn
GraphDBLP
a Graph-based instance of DBLP
Stars: ✭ 33 (+106.25%)
Mutual labels:  word2vec
sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
Stars: ✭ 51 (+218.75%)
Mutual labels:  word2vec
fsauor2018
基于LSTM网络与自注意力机制对中文评论进行细粒度情感分析
Stars: ✭ 36 (+125%)
Mutual labels:  word2vec
playground
A Streamlit application to play with machine learning models directly from the browser
Stars: ✭ 48 (+200%)
Mutual labels:  scikit-learn
NTUA-slp-nlp
💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (+18.75%)
Mutual labels:  word2vec
cyber-matrix-ai
Collection of cyber security and "AI" relevant topics
Stars: ✭ 69 (+331.25%)
Mutual labels:  word2vec
text classifier
Tensorflow2.3的文本分类项目,支持各种分类模型,支持相关tricks。
Stars: ✭ 135 (+743.75%)
Mutual labels:  word2vec
A-Detector
⭐ An anomaly-based intrusion detection system.
Stars: ✭ 69 (+331.25%)
Mutual labels:  scikit-learn
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-6.25%)
Mutual labels:  word2vec
osprey
🦅Hyperparameter optimization for machine learning pipelines 🦅
Stars: ✭ 71 (+343.75%)
Mutual labels:  scikit-learn
word2viz
Visualization of semantic similarities in word embeddings.
Stars: ✭ 86 (+437.5%)
Mutual labels:  word2vec
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+68.75%)
Mutual labels:  word2vec
Word2Vec-on-Wikipedia-Corpus
利用wikipedia中英文的語料訓練Word2vec模型
Stars: ✭ 18 (+12.5%)
Mutual labels:  word2vec
abess
Fast Best-Subset Selection Library
Stars: ✭ 266 (+1562.5%)
Mutual labels:  scikit-learn
wikidata-corpus
Train Wikidata with word2vec for word embedding tasks
Stars: ✭ 109 (+581.25%)
Mutual labels:  word2vec
books
A collection of online books for data science, computer science and coding!
Stars: ✭ 29 (+81.25%)
Mutual labels:  scikit-learn

Text classification Blog Link

Document classification

        a. Feature extraction
                (i)TF-IDF
                (ii) word embeddings using doc2vec
        b. Classification
                (i) Logistic Regression
                (ii) Naive Bayes (Multinomial and gauusian)

Token classification :TODO

Data

Reuters News data

Cleaned Reuters data

Results

Document Classification

Token Classification :TODO

Model Links

Doc2Vec self trained model

Contributing

  1. Fork it!
  2. Create your feature branch: git checkout -b my-new-feature
  3. Commit your changes: git commit -am 'Add some feature'
  4. Push to the branch: git push origin my-new-feature
  5. Submit a pull request :D
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].