All Projects → abhishekkrthakur → approaching_almost_nlp

abhishekkrthakur / approaching_almost_nlp

Licence: GPL-3.0 license
Approaching (Almost) Any Natural Language Processing Problem

Approaching (Almost) Any Natural Language Processing Problem

Contents:

  • The obvious intro to NLP
  • Bag of words
  • TFIDF
  • Hashing
  • Cleaning text data (stemming, lemmatization)
  • Tokenization of text data (from space-based to BPE and SentencePiece)
  • Word embeddings
  • Revisit LSTM, GRU, 1-D CNN, 2-D CNN
  • Decomposition-SVD
  • SVM, Logistic regression based models
  • Topic modelling
  • POS tagging, NER, entity detection/extraction using traditional approaches
  • Attention is all you need
  • Transformers
  • BERT, RoBERTa, XLM, and other transformer based models
  • Distillation of transformer based models
  • Entity extraction using transformer based models
  • Text summarization
  • Various assignments and projects

Missing something? Have a wish? Create an issue and let me know :)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].