All Projects → king-menin → mipt-nlp2021

king-menin / mipt-nlp2021

Licence: other
NLP course, MIPT

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to mipt-nlp2021

data-mining
Resources for the Data Mining for Bussiness and Governance course.
Stars: ✭ 52 (+100%)
Mutual labels:  course
blockchain101
区块链是面向未来数字化社会的新一代信息技术。本课程将系统介绍区块链原理和应用,让学生对区块链有整体的了解。课程设计目标是帮助学生树立分布式整体性世界观,教学大纲将涵盖围绕区块链,通过应用密码学、分布式系统基础、博弈论的基础知识,把区块链作为分布式整体世界观最前沿的创新应用进行系统讲解。课程还将引入区块链智能合约的概念,帮助学生理解区块链编程的理念和应用的方法。
Stars: ✭ 102 (+292.31%)
Mutual labels:  course
tex-course-index-template
A template for writing a condensed course index leveraging LaTeX indexing
Stars: ✭ 30 (+15.38%)
Mutual labels:  course
udemy-rails-api
Udemy Rails API course
Stars: ✭ 25 (-3.85%)
Mutual labels:  course
ASPP-2018-numpy
Material for the Advanced Scientific Programming in Python course on advanced numpy
Stars: ✭ 24 (-7.69%)
Mutual labels:  course
cpp-made-2020-hw
A repo for publishing tasks and tests for mail.ru MADE (big-data school) 2020 C++ course.
Stars: ✭ 34 (+30.77%)
Mutual labels:  course
edge-computer-vision
Edge Computer Vision Course
Stars: ✭ 41 (+57.69%)
Mutual labels:  course
react-native-saas
☁️ Application using Redux, Redux-Saga, React Native Redux Toast, Immer, react-native-side-menu, React Native Async Storage, react-native-iphone-x-helper, React Native Vector Icons and consuming the features of the Node.js - SaaS API
Stars: ✭ 14 (-46.15%)
Mutual labels:  course
neutronics-workshop
A workshop covering a range of fusion relevant analysis and simulations with OpenMC, DAGMC, Paramak and other open source fusion neutronics tools
Stars: ✭ 29 (+11.54%)
Mutual labels:  course
course-ASAP-learn-golang
A$AP Learn GoLang Open Course 🚀to the 🌙
Stars: ✭ 20 (-23.08%)
Mutual labels:  course
blockchain-development
A complimentary course for an understanding of blockchain and its development like custom blockchain, dapps, etc.
Stars: ✭ 71 (+173.08%)
Mutual labels:  course
geospatial-modeling-course
NCSU GIS/MEA582: Geospatial Modeling and Analysis Course
Stars: ✭ 30 (+15.38%)
Mutual labels:  course
gds course
Geographic Data Science, the course
Stars: ✭ 60 (+130.77%)
Mutual labels:  course
reinforcement learning course materials
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
Stars: ✭ 765 (+2842.31%)
Mutual labels:  course
just-the-class
A modern, highly customizable, responsive Jekyll template for course websites.
Stars: ✭ 156 (+500%)
Mutual labels:  course
vuejs-egitimi
Vue.js ile Sıfırdan Uygulama Geliştirme Eğitimi uygulama ve proje dosyaları
Stars: ✭ 19 (-26.92%)
Mutual labels:  course
Python-Course
🐍 This is the most complete course in Python, completely practical and all the lessons are explained with examples, so that they can be easily understood. 🍫
Stars: ✭ 18 (-30.77%)
Mutual labels:  course
How-To-Start-A-Startup
"How to Start a Startup" is the Y Combinator class made by real entrepreneurs
Stars: ✭ 55 (+111.54%)
Mutual labels:  course
Statistical Inference BSc
Course materials for Statistical Inference ("Inferência Estatística")
Stars: ✭ 47 (+80.77%)
Mutual labels:  course
start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2022 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
Stars: ✭ 3,066 (+11692.31%)
Mutual labels:  course

mipt-nlp2022

NLP course, MIPT

Course instructors

Anton Emelianov ([email protected], @king_menin), Albina Akhmetgareeva ([email protected])

Videos here

Exam questions here

Mark

final_mark=sum_i (max_score(HW_i)) / count(HWs) * 10, i==1..3

Lecture schedule

Week 1

Week 2

Distributional semantics. Count-based (pre-neural) methods. Word2Vec: learn vectors. GloVe: count, then learn. N-gram (collocations) RusVectores. t-SNE.

Week 3

Neural Language Models: Recurrent Models, Convolutional Models. Text classification (architectures)

Week 4

Task description, methods (Markov Model, RNNs), evaluation (perplexity), Sequence Labelling (NER, pos-tagging, chunking etc.) N-gram language models, HMM, MEMM, CRF

Week 5

Basics: Encoder-Decoder framework, Inference (e.g., beam search), Eval (bleu). Attention: general, score functions, models. Bahdanau and Luong models. Transformer: self-attention, masked self-attention, multi-head attention.

Week 6

Bertology (BERT, GPT-s, t5, etc.), Subword Segmentation (BPE), Evaluation of big LMs.

Week 7

Lecture & Practical: How to train big models? Distributed training

Training Multi-Billion Parameter Language Models. Model Parallelism. Data Parallelism.

Week 8

Squads (one-hop, multi-hop), architectures, retrieval and search, chat-bots

Week 9

Week 10

Recommended Resources

En

На русском (и про русский, в основном)

Literature

  • Manning, Christopher D., and Hinrich Schütze. Foundations of statistical natural language processing. Vol. 999. Cambridge: MIT press, 1999.
  • Martin, James H., and Daniel Jurafsky. "Speech and language processing." International Edition 710 (2000): 25.
  • Cohen, Shay. "Bayesian analysis in natural language processing." Synthesis Lectures on Human Language Technologies 9, no. 2 (2016): 1-274.
  • Goldberg, Yoav. "Neural Network Methods for Natural Language Processing." Synthesis Lectures on Human Language Technologies 10, no. 1 (2017): 1-309.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].