AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+8744.9%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+908.16%)
PaperrobotCode for PaperRobot: Incremental Draft Generation of Scientific Ideas
Stars: ✭ 372 (+659.18%)
NetEmb-DatasetsA collection of real-world networks/graphs for Network Embedding
Stars: ✭ 18 (-63.27%)
LoghubA large collection of system log datasets for AI-powered log analytics
Stars: ✭ 551 (+1024.49%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (+479.59%)
newsletter-archiveMarkdown archive & RSS/Atom feeds for Data Is Plural.
Stars: ✭ 65 (+32.65%)
Awesome RoboticsA curated list of awesome links and software libraries that are useful for robots.
Stars: ✭ 478 (+875.51%)
disent🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-16.33%)
Datasets For Recommender SystemsThis is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (+1051.02%)
Dr.sure🏫DeepLearning学习笔记以及Tensorflow、Pytorch的使用心得笔记。Dr. Sure会不定时往项目中添加他看到的最新的技术,欢迎批评指正。
Stars: ✭ 365 (+644.9%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+1695.92%)
Medical Datasetstracking medical datasets, with a focus on medical imaging
Stars: ✭ 296 (+504.08%)
DatasetteAn open source multi-tool for exploring and publishing data
Stars: ✭ 5,640 (+11410.2%)
Cluecorpus2020Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Stars: ✭ 278 (+467.35%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (+416.33%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+11328.57%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+1410.2%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-38.78%)
GeobrEasy access to official spatial data sets of Brazil in R and Python
Stars: ✭ 411 (+738.78%)
Awesome Holistic 3dA list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Stars: ✭ 387 (+689.8%)
TSForecastingThis repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.
Stars: ✭ 53 (+8.16%)
Label StudioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+14724.49%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+1840.82%)
Animal MattingGithub repository for the paper End-to-end Animal Image Matting
Stars: ✭ 363 (+640.82%)
ChakinSimple downloader for pre-trained word vectors
Stars: ✭ 323 (+559.18%)
DatasetsMachine learning datasets used in tutorials on MachineLearningMastery.com
Stars: ✭ 536 (+993.88%)
CleoraCleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Stars: ✭ 303 (+518.37%)
Entity Recognition DatasetsA collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (+1718.37%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+493.88%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+473.47%)
Pytorch CppC++ Implementation of PyTorch Tutorials for Everyone
Stars: ✭ 1,014 (+1969.39%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+8069.39%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-46.94%)
OgbBenchmark datasets, data loaders, and evaluators for graph machine learning
Stars: ✭ 799 (+1530.61%)
datasetsTFDS data loaders for sign language datasets.
Stars: ✭ 17 (-65.31%)
OpenmlOpen Machine Learning
Stars: ✭ 489 (+897.96%)
covid-19-data-cleanupScripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-48.98%)
HealthcheckHealth Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-28.57%)
podiumPodium: a framework agnostic Python NLP library for data loading and preprocessing
Stars: ✭ 55 (+12.24%)
Awesome TransitCommunity list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂
Stars: ✭ 713 (+1355.1%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+710.2%)
Awesome Earth Artificial IntelligenceA curated list of Earth Science's Artificial Intelligence (AI) tutorials, notebooks, software, datasets, courses, books, video lectures and papers. Contributions most welcome.
Stars: ✭ 44 (-10.2%)
Commons⛲️ Commons Marketplace client & server to explore, download, and publish open data sets in the Ocean Protocol Network.
Stars: ✭ 34 (-30.61%)
EasyprAn easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations.
Stars: ✭ 6,046 (+12238.78%)