All Projects → duoergun0729 → Nlp

duoergun0729 / Nlp

兜哥出品 <一本开源的NLP入门书籍>

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Nlp

Embedding As Service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-91%)
Mutual labels:  ai, word2vec, fasttext
NLP-paper
🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/
Stars: ✭ 23 (-98.63%)
Mutual labels:  word2vec, fasttext
Simple-Sentence-Similarity
Exploring the simple sentence similarity measurements using word embeddings
Stars: ✭ 99 (-94.1%)
Mutual labels:  word2vec, fasttext
Persian-Sentiment-Analyzer
Persian sentiment analysis ( آناکاوی سهش های فارسی | تحلیل احساسات فارسی )
Stars: ✭ 30 (-98.21%)
Mutual labels:  word2vec, fasttext
Djl
An Engine-Agnostic Deep Learning Framework in Java
Stars: ✭ 2,262 (+34.88%)
Mutual labels:  ai, fasttext
Ai law
all kinds of baseline models for long text classificaiton( text categorization)
Stars: ✭ 243 (-85.51%)
Mutual labels:  ai, fasttext
word embedding
Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-98.75%)
Mutual labels:  word2vec, fasttext
Gensim
Topic Modelling for Humans
Stars: ✭ 12,763 (+661.06%)
Mutual labels:  word2vec, fasttext
Lmdb Embeddings
Fast word vectors with little memory usage in Python
Stars: ✭ 404 (-75.91%)
Mutual labels:  word2vec, fasttext
Neural Networks
All about Neural Networks!
Stars: ✭ 34 (-97.97%)
Mutual labels:  word2vec, fasttext
Finalfusion Rust
finalfusion embeddings in Rust
Stars: ✭ 35 (-97.91%)
Mutual labels:  word2vec, fasttext
Cw2vec
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (-86.64%)
Mutual labels:  word2vec, fasttext
Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (-88.31%)
Mutual labels:  word2vec, fasttext
Wordvectors
Pre-trained word vectors of 30+ languages
Stars: ✭ 2,043 (+21.82%)
Mutual labels:  word2vec, fasttext
Nlp Journey
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (-23.08%)
Mutual labels:  word2vec, fasttext
Embedding
Embedding模型代码和学习笔记总结
Stars: ✭ 25 (-98.51%)
Mutual labels:  word2vec, fasttext
Wordembeddings Elmo Fasttext Word2vec
Using pre trained word embeddings (Fasttext, Word2Vec)
Stars: ✭ 146 (-91.29%)
Mutual labels:  word2vec, fasttext
Fasttext4j
Implementing Facebook's FastText with java
Stars: ✭ 148 (-91.17%)
Mutual labels:  word2vec, fasttext
Text summurization abstractive methods
Multiple implementations for abstractive text summurization , using google colab
Stars: ✭ 359 (-78.59%)
Mutual labels:  ai, word2vec
Convai Bot 1337
NIPS Conversational Intelligence Challenge 2017 Winner System: Skill-based Conversational Agent with Supervised Dialog Manager
Stars: ✭ 65 (-96.12%)
Mutual labels:  ai, fasttext

NLP基础知识

NLP应用案例

让机器理解文字

图像、文字和声音是人类接触、理解外部世界最常见的三种形式,其中文字又是最容易保存和进行交换的形式。大量的人类文明,最终都是以文字的形式保留下来的;大量的信息交换,都是是文字的形式进行。如何能让机器可以与人交流,理解人类的思想,最终能像人类一样理解文字以及文字背后的各种想法、意图呢?自然语言处理,即所谓的NLP是重要的支撑技术。

人机对话

NLP与安全

在传统的web攻防中,大家与http协议结下了不解之缘。但是在安全领域,web攻防只是很小一个分支。许多明显的与工作和生活无关的垃圾邮件,人一样就可以看出来,但是基于规则的垃圾邮件网关处理起来却总是差强人意。越来越多的电商、论坛甚至是视频网站的弹幕,总是可以看到明显的人身攻击或者违法违规信息,但是基于规则的过滤机制总是被绕过。人类可以很轻松的理解二十四口交换机,知道苹果是水果还是手机,但是机器如何做到呢?答案就是NLP。

一本开源的NLP入门书籍

这可能是第一本用开源的思想写的NLP入门书籍,整个写作过程都在我的Github上。

https://github.com/duoergun0729/nlp

之所以想用开源的思路去写,主要是因为NLP技术,尤其是基于机器学习的NLP技术发展非常快,比如目前已经广泛使用的fasttext技术,2016年发布论文,2017年已经进入大量生产领域,但是许多自然语言处理书籍还停留在大学课程的范围,甚至连词向量都很少涉及。相对周期繁琐的纸质书籍编写,在Github上我可以很方便的进行编写和更新,有勘误也可以很快修改。目前我已经完成了其中的三篇,后面我将不断更新内容,大家可以订阅我的Github,或者关注我的微信公众号《兜哥带你学安全

公众号

License

© 2018~2020 兜哥.

本作品采用知识共享署名-非商业性使用 4.0 国际许可协议进行许可。没有我许可的任何使用该书进行的商业行为都是违法。

打赏

写书不容易,尤其是使用个人休息时间的写作,感谢您的打赏,100不嫌多1块不嫌少。

打赏

如果您更喜欢知识星球这种形式,可以加入我的知识星球,NLP相关的问题也可以在星球里面提问。

知识星球

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].