Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
Stars: ✭ 231 (+153.85%)
Harvesttext文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Stars: ✭ 956 (+950.55%)
awesome-sweden-datasetsA curated list of awesome datasets to use when coding for the Swedish market.
Stars: ✭ 17 (-81.32%)
Neuronlp2Deep neural models for core NLP tasks (Pytorch version)
Stars: ✭ 397 (+336.26%)
molminerPython library and command-line tool for extracting compounds from scientific literature. Written in Python.
Stars: ✭ 38 (-58.24%)
MolaA Modular Optimization framework for Localization and mApping (MOLA)
Stars: ✭ 206 (+126.37%)
CLNER[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Stars: ✭ 50 (-45.05%)
Awesome Json DatasetsA curated list of awesome JSON datasets that don't require authentication.
Stars: ✭ 2,421 (+2560.44%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-10.99%)
DatasaurusR Package 📦 Containing the Datasaurus Dozen datasets 📊
Stars: ✭ 193 (+112.09%)
Unify Emotion DatasetsA Survey and Experiments on Annotated Corpora for Emotion Classification in Text
Stars: ✭ 169 (+85.71%)
Awesome Holistic 3dA list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Stars: ✭ 387 (+325.27%)
Awesome Nlp PolishA curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Stars: ✭ 153 (+68.13%)
IdenprofIdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.
Stars: ✭ 149 (+63.74%)
Gekko DatasetsGekko Trading Bot dataset dumps. Ready to use and download history files in SQLite format.
Stars: ✭ 146 (+60.44%)
Remo Python🐰 Python lib for remo - the app for annotations and images management in Computer Vision
Stars: ✭ 138 (+51.65%)
IE Paper NotesPaper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Extraction (EE), Named Entity Disambiguation (NED).
Stars: ✭ 14 (-84.62%)
Multi object datasetsMulti-object image datasets with ground-truth segmentation masks and generative factors.
Stars: ✭ 121 (+32.97%)
TnerLanguage model finetuning on NER with an easy interface, and cross-domain evaluation. We released NER models finetuned on various domain via huggingface model hub.
Stars: ✭ 54 (-40.66%)
FirstcoursenetworkscienceTutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis
Stars: ✭ 111 (+21.98%)
react-taggyA simple zero-dependency React component for tagging user-defined entities within a block of text.
Stars: ✭ 29 (-68.13%)
Tf nerSimple and Efficient Tensorflow implementations of NER models with tf.estimator and tf.data
Stars: ✭ 876 (+862.64%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+1414.29%)
Exposure correctionReference code for the paper "Learning Multi-Scale Photo Exposure Correction", CVPR 2021.
Stars: ✭ 98 (+7.69%)
Nlp ProgressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Stars: ✭ 19,518 (+21348.35%)
AIODriveOfficial Python/PyTorch Implementation for "All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds"
Stars: ✭ 32 (-64.84%)
DatashareBetter analyze information, in all its forms
Stars: ✭ 254 (+179.12%)
bernA neural named entity recognition and multi-type normalization tool for biomedical text mining
Stars: ✭ 151 (+65.93%)
Bert nerNer with Bert
Stars: ✭ 240 (+163.74%)
Animal MattingGithub repository for the paper End-to-end Animal Image Matting
Stars: ✭ 363 (+298.9%)
Fancy NlpNLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
Stars: ✭ 233 (+156.04%)
presidio-researchThis package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
Stars: ✭ 62 (-31.87%)
Multi Task Nlpmulti_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Stars: ✭ 221 (+142.86%)
GigabertZero-shot Transfer Learning from English to Arabic
Stars: ✭ 23 (-74.73%)
eve-botEVE bot, a customer service chatbot to enhance virtual engagement for Twitter Apple Support
Stars: ✭ 31 (-65.93%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (-9.89%)
Tf Lstm Crf BatchTensorflow-LSTM-CRF tool for Named Entity Recognizer
Stars: ✭ 59 (-35.16%)
DatasetteAn open source multi-tool for exploring and publishing data
Stars: ✭ 5,640 (+6097.8%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-67.03%)