Cmu MultimodalsdkCMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
Stars: ✭ 388 (+37.59%)
StrayVisualizerVisualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html
Stars: ✭ 30 (-89.36%)
squad-v1.1-ptPortuguese translation of the SQuAD dataset
Stars: ✭ 13 (-95.39%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-93.62%)
user qualityDataset for Software Evolution and Quality Improvement
Stars: ✭ 27 (-90.43%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-9.57%)
snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-80.14%)
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Stars: ✭ 266 (-5.67%)
Open-korean-corporaOpen Korean NLP Dataset Curation for the Users All Around the Globe
Stars: ✭ 82 (-70.92%)
block-alignerSIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (-79.43%)
lexicon-mono-seqDOM Text Based Multiple Sequence Alignment Library
Stars: ✭ 15 (-94.68%)
Awesome MsrA curated repository of software engineering repository mining data sets
Stars: ✭ 257 (-8.87%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-93.97%)
Semantic Kitti ApiSemanticKITTI API for visualizing dataset, processing data, and evaluating results.
Stars: ✭ 272 (-3.55%)
Audio-Classification-using-CNN-MLPMulti class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (-87.23%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-90.78%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (-2.13%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-89.36%)
ctableC library to print nicely formatted tables
Stars: ✭ 13 (-95.39%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1319.5%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (-67.38%)
visualqcVisualQC : assistive tool to ease the quality control workflow of neuroimaging data.
Stars: ✭ 56 (-80.14%)
LocARNAAlignment of RNAs
Stars: ✭ 15 (-94.68%)
bs3BS-Seeker3: An Ultra-fast, Versatile Pipeline for Mapping Bisulfite-treated Reads.
Stars: ✭ 20 (-92.91%)
Dataset ApiThe ApolloScape Open Dataset for Autonomous Driving and its Application.
Stars: ✭ 260 (-7.8%)
HJDatasetA Large Dataset of Historical Japanese Documents with Complex Layouts
Stars: ✭ 19 (-93.26%)
Face Everthingface detection alignment recognition reconstruction ...
Stars: ✭ 257 (-8.87%)
mxmortalitydbA data only R package containing all injury intent deaths registered in Mexico from 2004 to 2019
Stars: ✭ 20 (-92.91%)
Fx 1 Minute DataHISTDATA - Full Dataset composed of 68 FX trading pairs / Simple API to retrieve 1 Minute data Historical FX Prices (up to June 2019).
Stars: ✭ 278 (-1.42%)
pump-and-dump-datasetAdditional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20
Stars: ✭ 66 (-76.6%)
NLPrep🍳 NLPrep - dataset tool for many natural language processing task
Stars: ✭ 26 (-90.78%)
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (-76.6%)
Covid19canadaEpidemiological Data from the COVID-19 Epidemic in Canada
Stars: ✭ 272 (-3.55%)
icedataIceData: Datasets Hub for the *IceVision* Framework
Stars: ✭ 41 (-85.46%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (-0.35%)
ACVR2017An Innovative Salient Object Detection Using Center-Dark Channel Prior
Stars: ✭ 20 (-92.91%)
Facenet-Caffefacenet recognition and retrieve by using hnswlib and flask, convert tensorflow model to caffe
Stars: ✭ 30 (-89.36%)
TVQAplus[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Stars: ✭ 99 (-64.89%)
Ergo🧠 A tool that makes AI easier.
Stars: ✭ 264 (-6.38%)
climateRAn R 📦 for getting point and gridded climate data by AOI
Stars: ✭ 93 (-67.02%)
AITQAresources for the IBM Airlines Table-Question-Answering Benchmark
Stars: ✭ 12 (-95.74%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-3.19%)
astroalignA tool to align astronomical images based on asterism matching
Stars: ✭ 102 (-63.83%)
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Stars: ✭ 21 (-92.55%)
pblatparallelized blat with multi-threads support
Stars: ✭ 34 (-87.94%)
Game Datasets🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
Stars: ✭ 261 (-7.45%)
figpatchEasily Arrange Images with Patchwork Alongside ggplot2 Figures.
Stars: ✭ 46 (-83.69%)
BugZooKeep your bugs contained. A platform for studying historical software bugs.
Stars: ✭ 49 (-82.62%)
Tehran StocksA python package to access tsetmc data
Stars: ✭ 282 (+0%)
Exclusively Dark Image DatasetExclusively Dark (ExDARK) dataset which to the best of our knowledge, is the largest collection of low-light images taken in very low-light environments to twilight (i.e 10 different conditions) to-date with image class and object level annotations.
Stars: ✭ 274 (-2.84%)
JschemaA simple, easy to use data modeling framework for JavaScript
Stars: ✭ 261 (-7.45%)