OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (-48.39%)
ChakinSimple downloader for pre-trained word vectors
Stars: ✭ 323 (-79.13%)
Nlp TutorialNatural Language Processing Tutorial for Deep Learning Researchers
Stars: ✭ 9,895 (+539.21%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (-80.88%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (-81.65%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+290.57%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (-82.04%)
ColourColour Science for Python
Stars: ✭ 1,131 (-26.94%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+369.25%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (-83.66%)
Transitland DatastoreTransitland's centralized web service API for both querying and editing aggregated transit data from around the world
Stars: ✭ 101 (-93.48%)
Chinese Bert WwmPre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
Stars: ✭ 6,357 (+310.66%)
TSForecastingThis repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.
Stars: ✭ 53 (-96.58%)
FewCLUEFewCLUE 小样本学习测评基准,中文版
Stars: ✭ 251 (-83.79%)
PersonasDatasets for Deep learning Personas
Stars: ✭ 49 (-96.83%)
T3[EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Qian Chen, Shuohang Wang, Bo Li
Stars: ✭ 25 (-98.39%)
LoghubA large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (-64.41%)
SIGIR2021 ConureOne Person, One Model, One World: Learning Continual User Representation without Forgetting
Stars: ✭ 23 (-98.51%)
Bert As ServiceMapping a variable-length sentence to a fixed-length vector using BERT model
Stars: ✭ 9,779 (+531.72%)
Bert PytorchGoogle AI 2018 BERT pytorch implementation
Stars: ✭ 4,642 (+199.87%)
dplace-dataThe data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)
Stars: ✭ 49 (-96.83%)
DatasetteAn open source multi-tool for exploring and publishing data
Stars: ✭ 5,640 (+264.34%)
TEXTOIRTEXTOIR is a flexible toolkit for open intent detection and discovery. (ACL 2021)
Stars: ✭ 31 (-98%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (-34.5%)
covid-19-data-cleanupScripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-98.39%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (-68.09%)
Wb srgbWhite balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Stars: ✭ 101 (-93.48%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+261.76%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-97.87%)
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (-98.58%)
Awesome RoboticsA curated list of awesome links and software libraries that are useful for robots.
Stars: ✭ 478 (-69.12%)
disent🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-97.35%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-94.77%)
knowledge-graph-nlp-in-action从模型训练到部署,实战知识图谱(Knowledge Graph)&自然语言处理(NLP)。涉及 Tensorflow, Bert+Bi-LSTM+CRF,Neo4j等 涵盖 Named Entity Recognition,Text Classify,Information Extraction,Relation Extraction 等任务。
Stars: ✭ 58 (-96.25%)
ADL2019Applied Deep Learning (2019 Spring) @ NTU
Stars: ✭ 20 (-98.71%)
HealthcheckHealth Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-97.74%)
Filipino-Text-BenchmarksOpen-source benchmark datasets and pretrained transformer models in the Filipino language.
Stars: ✭ 22 (-98.58%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-96.58%)
Doppelganger[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Stars: ✭ 97 (-93.73%)
HugsVisionHugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (-90.05%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-20.61%)
Commons⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Stars: ✭ 34 (-97.8%)
GeobrEasy access to official spatial data sets of Brazil in R and Python
Stars: ✭ 411 (-73.45%)
ercEmotion recognition in conversation
Stars: ✭ 34 (-97.8%)
syntaxdotNeural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
Stars: ✭ 32 (-97.93%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (-74.35%)
ml4seA curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Stars: ✭ 46 (-97.03%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (-38.57%)
opendatasetsA Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
Stars: ✭ 161 (-89.6%)
Awesome Holistic 3dA list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Stars: ✭ 387 (-75%)
databrewerThe missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
Stars: ✭ 39 (-97.48%)
ck-envCK repository with components and automation actions to enable portable workflows across diverse platforms including Linux, Windows, MacOS and Android. It includes software detection plugins and meta packages (code, data sets, models, scripts, etc) with the possibility of multiple versions to co-exist in a user or system environment:
Stars: ✭ 67 (-95.67%)
Atis datasetThe ATIS (Airline Travel Information System) Dataset
Stars: ✭ 81 (-94.77%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (-43.15%)