squad-v1.1-ptPortuguese translation of the SQuAD dataset
Stars: ✭ 13 (-50%)
cord19qCOVID-19 Open Research Dataset (CORD-19) Analysis
Stars: ✭ 54 (+107.69%)
rid-covidImage-based COVID-19 diagnosis. Links to software, data, and other resources.
Stars: ✭ 74 (+184.62%)
BugZooKeep your bugs contained. A platform for studying historical software bugs.
Stars: ✭ 49 (+88.46%)
roco-datasetRadiology Objects in COntext (ROCO): A Multimodal Image Dataset
Stars: ✭ 38 (+46.15%)
MICCAI21 MMQMultiple Meta-model Quantifying for Medical Visual Question Answering
Stars: ✭ 16 (-38.46%)
survey kitFlutter library to create beautiful surveys (aligned with ResearchKit on iOS)
Stars: ✭ 68 (+161.54%)
pump-and-dump-datasetAdditional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20
Stars: ✭ 66 (+153.85%)
CBLUE中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+1357.69%)
dialogue-datasetscollect the open dialog corpus and some useful data processing utils.
Stars: ✭ 24 (-7.69%)
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Stars: ✭ 21 (-19.23%)
fuse-med-mlA python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
Stars: ✭ 66 (+153.85%)
BIRLBIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (+153.85%)
django-serializable-modelDjango classes to make your models, managers, and querysets serializable, with built-in support for related objects in ~150 LoC
Stars: ✭ 15 (-42.31%)
humanapiThe easiest way to integrate health data from anywhere - https://www.humanapi.co
Stars: ✭ 21 (-19.23%)
LanguageCodesWe present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
Stars: ✭ 70 (+169.23%)
DICOM.jlJulia package for reading and writing DICOM (Digital Imaging and Communications in Medicine) files
Stars: ✭ 45 (+73.08%)
ChRIS uiUI for ChRIS
Stars: ✭ 20 (-23.08%)
thaigov-corpusโครงการเก็บรวบรวมข่าวสารจากเว็บไซต์รัฐบาลไทย
Stars: ✭ 19 (-26.92%)
fastmorphFast corpus search engine originally made for the Corpus of Written Tatar language
Stars: ✭ 14 (-46.15%)
dicomC++11 and boost based implementation of the DICOM standard.
Stars: ✭ 14 (-46.15%)
When-in-RomeA meta-corpus of functional harmonic analysis.
Stars: ✭ 35 (+34.62%)
Indian ParallelCorpusCurated list of publicly available parallel corpus for Indian Languages
Stars: ✭ 23 (-11.54%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+626.92%)
aarogya sevaA beautiful 😍 covid-19 app with self - assessment and more.
Stars: ✭ 118 (+353.85%)
gumRepository for the Georgetown University Multilayer Corpus (GUM)
Stars: ✭ 71 (+173.08%)
MAxOMedical action ontology
Stars: ✭ 26 (+0%)
deidbest effort anonymization for medical images using python
Stars: ✭ 108 (+315.38%)
HJDatasetA Large Dataset of Historical Japanese Documents with Complex Layouts
Stars: ✭ 19 (-26.92%)
nnDetectionnnDetection is a self-configuring framework for 3D (volumetric) medical object detection which can be applied to new data sets without manual intervention. It includes guides for 12 data sets that were used to develop and evaluate the performance of the proposed method.
Stars: ✭ 355 (+1265.38%)
OpenConvertText conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
Stars: ✭ 20 (-23.08%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-19.23%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-30.77%)
CODERCODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Stars: ✭ 24 (-7.69%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+2634.62%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+115.38%)
tvsubTVsub: DCU-Tencent Chinese-English Dialogue Corpus
Stars: ✭ 40 (+53.85%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (+253.85%)
MaskedFaceRepresentationMasked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.
Stars: ✭ 17 (-34.62%)
open-discourseOpen Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Stars: ✭ 47 (+80.77%)
thai-languagecomputer tools for thai language
Stars: ✭ 20 (-23.08%)
JSON-pathFind the path of a key / value in a JSON hierarchy easily.
Stars: ✭ 88 (+238.46%)
DermatronDermatology focused medical records software, augmented with computer vision and artificial intelligence [Meteor packaged with Electron]
Stars: ✭ 19 (-26.92%)
proiel-treebankOfficial releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (+15.38%)
ACVR2017An Innovative Salient Object Detection Using Center-Dark Channel Prior
Stars: ✭ 20 (-23.08%)