All Projects → rmax → databrewer-recipes

rmax / databrewer-recipes

Licence: other
DataBrewer Recipes Repository.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to databrewer-recipes

Data-Science-and-Machine-Learning-Resources
List of Data Science and Machine Learning Resource that I frequently use
Stars: ✭ 19 (+0%)
Mutual labels:  datasets
RData.jl
Read R data files from Julia
Stars: ✭ 49 (+157.89%)
Mutual labels:  datasets
dplace-data
The data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)
Stars: ✭ 49 (+157.89%)
Mutual labels:  datasets
awesome-sweden-datasets
A curated list of awesome datasets to use when coding for the Swedish market.
Stars: ✭ 17 (-10.53%)
Mutual labels:  datasets
bnk48 photo datasets
BNK48 Photo Datasets
Stars: ✭ 12 (-36.84%)
Mutual labels:  datasets
databrewer
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
Stars: ✭ 39 (+105.26%)
Mutual labels:  datasets
systematic-review-datasets
A collection of fully labeled systematic review datasets (title-abstract screening)
Stars: ✭ 25 (+31.58%)
Mutual labels:  datasets
podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing
Stars: ✭ 55 (+189.47%)
Mutual labels:  datasets
SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (+289.47%)
Mutual labels:  datasets
download audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (+178.95%)
Mutual labels:  datasets
kaggle-code
A repository for some of the code I used in kaggle data science & machine learning tasks.
Stars: ✭ 100 (+426.32%)
Mutual labels:  datasets
the-weather-scraper
A Lightweight Weather Scraper
Stars: ✭ 56 (+194.74%)
Mutual labels:  datasets
opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
Stars: ✭ 161 (+747.37%)
Mutual labels:  datasets
text-classification-small-datasets
Building a text classifier with extremely small datasets
Stars: ✭ 34 (+78.95%)
Mutual labels:  datasets
TSForecasting
This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.
Stars: ✭ 53 (+178.95%)
Mutual labels:  datasets
PharmacoGx
R package to analyze large-scale pharmacogenomic datasets.
Stars: ✭ 42 (+121.05%)
Mutual labels:  datasets
ck-env
CK repository with components and automation actions to enable portable workflows across diverse platforms including Linux, Windows, MacOS and Android. It includes software detection plugins and meta packages (code, data sets, models, scripts, etc) with the possibility of multiple versions to co-exist in a user or system environment:
Stars: ✭ 67 (+252.63%)
Mutual labels:  datasets
recurrent-defocus-deblurring-synth-dual-pixel
Reference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (+57.89%)
Mutual labels:  datasets
disent
🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (+115.79%)
Mutual labels:  datasets
ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Stars: ✭ 46 (+142.11%)
Mutual labels:  datasets

DataBrewer Recipes

Documentation Status

DataBrewer Recipes Repository.

What is this?

This is a collection of dataset recipes, that is, a simple description of where to find existing datasets archives.

The recipes itself are licensed under MIT license. Each dataset may have its own licensing and usage restrictions.

This recipes are used by the databrewer tool. See https://github.com/rolando/databrewer

Contributing

You can contribute in several ways, for example:

Recipes Guidelines

  • The name fields must be all lowercase and separated by dashes (if needed).
  • Brackets can be used to group subsets of files within the dataset.
  • Single-file datasets can use the url field.
  • If dataset comes from a dataset repository or single entity, a short prefix should be added to the name (i.e.: fte-<name> for FiveThirty datasets).
  • If a dataset has a download page but is not available for direct downloading, the field restricted must be set to true.

Example recipes:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].