ZhihuThis repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (+6789.58%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (+14.58%)
Attention MechanismsImplementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.
Stars: ✭ 203 (+322.92%)
ChazutsuThe tool to make NLP datasets ready to use
Stars: ✭ 238 (+395.83%)
Clean Text🧹 Python package for text cleaning
Stars: ✭ 284 (+491.67%)
TokenizerFast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (+175%)
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+4058.33%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+431.25%)
BondBOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Stars: ✭ 96 (+100%)
Texar PytorchIntegrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+1225%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (+189.58%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+556.25%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+20143.75%)
Opus MtOpen neural machine translation models and web services
Stars: ✭ 111 (+131.25%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+4558.33%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+5145.83%)
Mams For AbsaA Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (+181.25%)
WikisqlA large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+1910.42%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (+229.17%)
Nlp ProgressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Stars: ✭ 19,518 (+40562.5%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+11566.67%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (+125%)
Nlg EvalEvaluation code for various unsupervised automated metrics for Natural Language Generation.
Stars: ✭ 822 (+1612.5%)
Pytreebank😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (+93.75%)
Hardware Aware Transformers[ACL 2020] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Stars: ✭ 206 (+329.17%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+489.58%)
Mtbook《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
Stars: ✭ 2,307 (+4706.25%)
Opennmt TfNeural machine translation and sequence learning using TensorFlow
Stars: ✭ 1,223 (+2447.92%)
Comet A Neural Framework for MT Evaluation
Stars: ✭ 58 (+20.83%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+145.83%)
ThotThot toolkit for statistical machine translation
Stars: ✭ 53 (+10.42%)
Char Rnn TensorflowMulti-layer Recurrent Neural Networks for character-level language models implements by TensorFlow
Stars: ✭ 58 (+20.83%)
Text2sql DataA collection of datasets that pair questions with SQL queries.
Stars: ✭ 287 (+497.92%)
String To Tree NmtSource code and data for the paper "Towards String-to-Tree Neural Machine Translation"
Stars: ✭ 16 (-66.67%)
SangitaA Natural Language Toolkit for Indian Languages
Stars: ✭ 43 (-10.42%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-4.17%)
PtsQuantized Mesh Terrain Data Generator and Server for CesiumJS Library
Stars: ✭ 36 (-25%)
Okutama ActionOkutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection
Stars: ✭ 36 (-25%)
LudwigData-centric declarative deep learning framework
Stars: ✭ 8,018 (+16604.17%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-25%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-25%)
StocksightStock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis
Stars: ✭ 1,037 (+2060.42%)
BartycrouchLocalization/I18n: Incrementally update/translate your Strings files from .swift, .h, .m(m), .storyboard or .xib files.
Stars: ✭ 1,032 (+2050%)
TextblobSimple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Stars: ✭ 7,991 (+16547.92%)
Vale📝 A syntax-aware linter for prose built with speed and extensibility in mind.
Stars: ✭ 978 (+1937.5%)
RebiberA simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Stars: ✭ 1,005 (+1993.75%)
NhazmA C# version of Hazm (Python library for digesting Persian text)
Stars: ✭ 35 (-27.08%)