All Projects → yandexdataschool → Nlp_course

yandexdataschool / Nlp_course

Licence: mit
YSDA course in Natural Language Processing

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
HTML
75241 projects

Projects that are alternatives of or similar to Nlp course

Rnn encoder decoder
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Enviro mod notes
Introductory notes on environmental modelling and statistics
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Jupyter Notebooks
data analysis experiments in haskell and python
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Kaggle right whale
Project on Pattern Detection and Recognition using Deep Learning
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Uw python for geosciences
Talks from the UW Python for Geosciences Seminar
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Swcarpentry Workshop Pandas
Remix of Software Carpentry material using intermediate pandas lesson and concepts from the novice class
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Python Ml
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Code snippets
No description, website, or topics provided.
Stars: ✭ 8,186 (+8.81%)
Mutual labels:  jupyter-notebook
Spot price history
IPython notebook that uses Boto, Pandas and MatPlotLib to show historical price data
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Poems
Poems (Mirror)
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Buoypy
Retrieve NDBC data
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Musicinformationretrieval.com
Instructional notebooks on music information retrieval.
Stars: ✭ 845 (-88.77%)
Mutual labels:  jupyter-notebook
Machine Learning Experiments
🤖 Interactive Machine Learning experiments: 🏋️models training + 🎨models demo
Stars: ✭ 841 (-88.82%)
Mutual labels:  jupyter-notebook
Jupyter Gee
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Pelee
Pelee: A Real-Time Object Detection System on Mobile Devices
Stars: ✭ 851 (-88.69%)
Mutual labels:  jupyter-notebook
Pythondataanalytics
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
Dominhhai.github.io
My Blog
Stars: ✭ 8 (-99.89%)
Mutual labels:  jupyter-notebook
100days
100 days of algorithms
Stars: ✭ 6,789 (-9.76%)
Mutual labels:  jupyter-notebook
Light head rcnn
Light-Head R-CNN
Stars: ✭ 852 (-88.67%)
Mutual labels:  jupyter-notebook
Snake
Code for "Deep Snake for Real-Time Instance Segmentation" CVPR 2020 oral
Stars: ✭ 849 (-88.71%)
Mutual labels:  jupyter-notebook

YSDA Natural Language Processing course

  • This is the 2021 version. For previous year' course materials, go to this branch
  • Lecture and seminar materials for each week are in ./week* folders, see README.md for materials and instructions
  • YSDA homework deadlines will be listed in Anytask (read more).
  • Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue
  • Installing libraries and troubleshooting: this thread.

Syllabus

  • week01 Word Embeddings

    • Lecture: Word embeddings. Distributional semantics. Count-based (pre-neural) methods. Word2Vec: learn vectors. GloVe: count, then learn. Evaluation: intrinsic vs extrinsic. Analysis and Interpretability. Interactive lecture materials and more.
    • Seminar: Playing with word and sentence embeddings
    • Homework: Embedding-based machine translation system
  • week02 Text Classification

    • Lecture: Text classification: introduction and datasets. General framework: feature extractor + classifier. Classical approaches: Naive Bayes, MaxEnt (Logistic Regression), SVM. Neural Networks: General View, Convolutional Models, Recurrent Models. Practical Tips: Data Augmentation. Analysis and Interpretability. Interactive lecture materials and more.
    • Seminar: Text classification with convolutional NNs.
    • Homework: Statistical & neural text classification.
  • week03 Language Modeling

    • Lecture: Language Modeling: what does it mean? Left-to-right framework. N-gram language models. Neural Language Models: General View, Recurrent Models, Convolutional Models. Evaluation. Practical Tips: Weight Tying. Analysis and Interpretability. Interactive lecture materials and more.
    • Seminar: Build a N-gram language model from scratch
    • Homework: Neural LMs & smoothing in count-based models.
  • week04 Seq2seq and Attention

    • Lecture: Seq2seq Basics: Encoder-Decoder framework, Training, Simple Models, Inference (e.g., beam search). Attention: general, score functions, models. Transformer: self-attention, masked self-attention, multi-head attention; model architecture. Subword Segmentation (BPE). Analysis and Interpretability: functions of attention heads; probing for linguistic structure. Interactive lecture materials and more.
    • Seminar: Basic sequence to sequence model
    • Homework: Machine translation with attention
  • week05 Transfer Learning

    • Lecture: What is Transfer Learning? Great idea 1: From Words to Words-in-Context (CoVe, ELMo). Great idea 2: From Replacing Embeddings to Replacing Models (GPT, BERT). (A Bit of) Adaptors. Analysis and Interpretability. Interactive lecture materials and more.
  • week06 Domain Adaptation

    • Lecture: General theory. Instance weighting. Proxy-labels methods. Feature matching methods. Distillation-like methods.
    • Seminar+Homework: BERT-based NER domain adaptation

More TBA

Contributors & course staff

Course materials and teaching performed by

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].