ChineseglueLanguage Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Stars: ✭ 1,548 (+911.76%)
NLP PEMDCNLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (-62.09%)
Dr.sure🏫DeepLearning学习笔记以及Tensorflow、Pytorch的使用心得笔记。Dr. Sure会不定时往项目中添加他看到的最新的技术,欢迎批评指正。
Stars: ✭ 365 (+138.56%)
Coco Annotator✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Stars: ✭ 1,138 (+643.79%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-75.16%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-82.35%)
vlainic.github.ioMy GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (-83.01%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+107.84%)
Spatio-Temporal-papersThis project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (+17.65%)
Aiops platformAn Artificial Intelligence Platform for IT Operations.
Stars: ✭ 63 (-58.82%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+2732.68%)
kaggledatasetsCollection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (-71.24%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-69.28%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (+93.46%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-71.9%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-85.62%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (+85.62%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-20.26%)
Conditional-SeqGAN-TensorflowConditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow
Stars: ✭ 47 (-69.28%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+90.2%)
lidtkLanguage Identification Toolkit
Stars: ✭ 17 (-88.89%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+81.7%)
Naive-Bayes-Evening-WorkshopCompanion code for Introduction to Python for Data Science: Coding the Naive Bayes Algorithm evening workshop
Stars: ✭ 23 (-84.97%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-32.68%)
brand-sentiment-analysisScripts utilizing Heartex platform to build brand sentiment analysis from the news
Stars: ✭ 21 (-86.27%)
anuvadaInterpretable Models for NLP using PyTorch
Stars: ✭ 102 (-33.33%)
Gec PseudodataRepository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Stars: ✭ 49 (-67.97%)
Pix2codepix2code: Generating Code from a Graphical User Interface Screenshot
Stars: ✭ 11,349 (+7317.65%)
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-72.55%)
Customer satisfaction analysis基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Stars: ✭ 262 (+71.24%)
mlxMachine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (-13.73%)
Awesome Earth Artificial IntelligenceA curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.
Stars: ✭ 44 (-71.24%)
DeepLearningReadingDeep Learning and Machine Learning mini-projects. Current Project: Deepmind Attentive Reader (rc-data)
Stars: ✭ 78 (-49.02%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-83.01%)
Entity EmbeddingReference implementation of the paper "Word Embeddings for Entity-annotated Texts"
Stars: ✭ 19 (-87.58%)
Mrc book《机器阅读理解:算法与实践》代码
Stars: ✭ 102 (-33.33%)
datasetsTFDS data loaders for sign language datasets.
Stars: ✭ 17 (-88.89%)
dagpiDagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-83.66%)
mindsdb-examplesExamples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-83.66%)
Multi object datasetsMulti-object image datasets with ground-truth segmentation masks and generative factors.
Stars: ✭ 121 (-20.92%)
Gekko DatasetsGekko Trading Bot dataset dumps. Ready to use and download history files in SQLite format.
Stars: ✭ 146 (-4.58%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-18.95%)
LemminflectA python module for English lemmatization and inflection.
Stars: ✭ 105 (-31.37%)
ColourColour Science for Python
Stars: ✭ 1,131 (+639.22%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+122.88%)