Preprocessing pipeline on Brain MR Images through FSL and ANTs, including registration, skull-stripping, bias field correction, enhancement and segmentation.

Stars: ✭ 107 (+94.55%)

Mutual labels: preprocessing

ColdStorage

Lightweight data loading and caching library for android

Stars: ✭ 39 (-29.09%)

Mutual labels: data-loading

text-normalizer

Normalize text string

Stars: ✭ 12 (-78.18%)

Mutual labels: preprocessing

farabio

🤖 PyTorch toolkit for biomedical imaging ❤️

Stars: ✭ 48 (-12.73%)

Mutual labels: datasets

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (-3.64%)

Mutual labels: datasets

covid19-datasets

A list of high quality open datasets for COVID-19 data analysis

Stars: ✭ 56 (+1.82%)

Mutual labels: datasets

text-classification-small-datasets

Building a text classifier with extremely small datasets

Stars: ✭ 34 (-38.18%)

Mutual labels: datasets

Few-Shot-Intent-Detection

Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Stars: ✭ 63 (+14.55%)

Mutual labels: datasets

pywedge

Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking

Stars: ✭ 49 (-10.91%)

Mutual labels: preprocessing

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+25118.18%)

Mutual labels: datasets

systematic-review-datasets

A collection of fully labeled systematic review datasets (title-abstract screening)

Stars: ✭ 25 (-54.55%)

Mutual labels: datasets

AIODrive

Official Python/PyTorch Implementation for "All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds"

Stars: ✭ 32 (-41.82%)

Mutual labels: datasets

Dataset-Sentimen-Analisis-Bahasa-Indonesia

Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…

Stars: ✭ 38 (-30.91%)

Mutual labels: datasets

SER-datasets

A collection of datasets for the purpose of emotion recognition/detection in speech.

Stars: ✭ 74 (+34.55%)

Mutual labels: datasets

PharmacoDB

Search across publicly available datasets to find instances where a drug or cell line of interest has been profiled.

Stars: ✭ 38 (-30.91%)

Mutual labels: datasets

postcss-each

PostCSS plugin to iterate through values

Stars: ✭ 93 (+69.09%)

Mutual labels: preprocessing

traj-pred-irl

Official implementation codes of "Regularizing neural networks for future trajectory prediction via IRL framework"

Stars: ✭ 23 (-58.18%)

Mutual labels: datasets

bnk48 photo datasets

BNK48 Photo Datasets

Stars: ✭ 12 (-78.18%)

Mutual labels: datasets

napkinXC

Extremely simple and fast extreme multi-class and multi-label classifiers.

Stars: ✭ 38 (-30.91%)

Mutual labels: datasets

dplace-data

The data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)

Stars: ✭ 49 (-10.91%)

Mutual labels: datasets

HINT3

This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020's Insights Workshop https://insights-workshop.github.io/ Preprint for the paper is available here https://arxiv.org/abs/2009.13833

Stars: ✭ 27 (-50.91%)

Mutual labels: datasets

panoptic parts

This repository contains code and tools for reading, processing, evaluating on, and visualizing Panoptic Parts datasets. Moreover, it contains code for reproducing our CVPR 2021 paper results.

Stars: ✭ 82 (+49.09%)

Mutual labels: datasets

Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

Stars: ✭ 213 (+287.27%)

Mutual labels: datasets

dropEst

Pipeline for initial analysis of droplet-based single-cell RNA-seq data

Stars: ✭ 71 (+29.09%)

Mutual labels: preprocessing

Three-Filters-to-Normal

Three-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator (RAL+ICRA'21)

Stars: ✭ 41 (-25.45%)

Mutual labels: datasets

veridical-flow

Making it easier to build stable, trustworthy data-science pipelines.

Stars: ✭ 28 (-49.09%)

Mutual labels: preprocessing

masader

The largest public catalogue for Arabic NLP and speech datasets. There are +250 datasets annotated with more than 25 attributes.

Stars: ✭ 66 (+20%)

Mutual labels: datasets

TSForecasting

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

Stars: ✭ 53 (-3.64%)

Mutual labels: datasets

Clustering-Datasets

This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.

Stars: ✭ 189 (+243.64%)

Mutual labels: datasets

awesome-sweden-datasets

A curated list of awesome datasets to use when coding for the Swedish market.

Stars: ✭ 17 (-69.09%)

Mutual labels: datasets

akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Stars: ✭ 5,155 (+9272.73%)

Mutual labels: datasets

ck-env

CK repository with components and automation actions to enable portable workflows across diverse platforms including Linux, Windows, MacOS and Android. It includes software detection plugins and meta packages (code, data sets, models, scripts, etc) with the possibility of multiple versions to co-exist in a user or system environment:

Stars: ✭ 67 (+21.82%)

Mutual labels: datasets

cmip6 preprocessing

Analysis ready CMIP6 data in python the easy way with pangeo tools.

Stars: ✭ 126 (+129.09%)

Mutual labels: preprocessing

Data-Science-and-Machine-Learning-Resources

List of Data Science and Machine Learning Resource that I frequently use

Stars: ✭ 19 (-65.45%)

Mutual labels: datasets

awesome-forests

🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.

Stars: ✭ 111 (+101.82%)

Mutual labels: datasets

ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

Stars: ✭ 46 (-16.36%)

Mutual labels: datasets

dataset

dataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections

Stars: ✭ 21 (-61.82%)

Mutual labels: datasets

PharmacoGx

R package to analyze large-scale pharmacogenomic datasets.

Stars: ✭ 42 (-23.64%)

Mutual labels: datasets

SeqTools

A python library to manipulate and transform indexable data (lists, arrays, ...)

Stars: ✭ 42 (-23.64%)

Mutual labels: preprocessing

skippa

SciKIt-learn Pipeline in PAndas

Stars: ✭ 33 (-40%)

Mutual labels: preprocessing

extra keras datasets

📃🎉 Additional datasets for tensorflow.keras

Stars: ✭ 20 (-63.64%)

Mutual labels: datasets

disent

🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib

Stars: ✭ 41 (-25.45%)

Mutual labels: datasets

preprocess-conll05

Scripts for preprocessing the CoNLL-2005 SRL dataset.

Stars: ✭ 17 (-69.09%)

Mutual labels: preprocessing

opendatasets

A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.

Stars: ✭ 161 (+192.73%)

Mutual labels: datasets

RData.jl

Read R data files from Julia