Ergo🧠 A tool that makes AI easier.
Stars: ✭ 264 (-69.59%)
Bert Ner PytorchChinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Stars: ✭ 654 (-24.65%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-97.58%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+361.18%)
YeddaYEDDA: A Lightweight Collaborative Text Span Annotation Tool. Code for ACL 2018 Best Demo Paper Nomination.
Stars: ✭ 704 (-18.89%)
Css10CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Stars: ✭ 302 (-65.21%)
verseagilityRamp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.
Stars: ✭ 23 (-97.35%)
DictChinese and English translation tools in the command line(命令行下中英文翻译工具)
Stars: ✭ 243 (-72%)
Nlp Interview Notes本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。
Stars: ✭ 207 (-76.15%)
Python Benedictdict subclass with keylist/keypath support, I/O shortcuts (base64, csv, json, pickle, plist, query-string, toml, xml, yaml) and many utilities. 📘
Stars: ✭ 204 (-76.5%)
FacerankFaceRank - Rank Face by CNN Model based on TensorFlow (add keras version). FaceRank-人脸打分基于 TensorFlow (新增 Keras 版本) 的 CNN 模型(QQ群:167122861)。技术支持:http://tensorflow123.com
Stars: ✭ 841 (-3.11%)
AddictThe Python Dict that's better than heroin.
Stars: ✭ 2,141 (+146.66%)
Dataset ApiThe ApolloScape Open Dataset for Autonomous Driving and its Application.
Stars: ✭ 260 (-70.05%)
IoDataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
Stars: ✭ 427 (-50.81%)
Devblogs+2600 developer-related blogs and publications.
Stars: ✭ 637 (-26.61%)
DictfierPython library to convert/serialize class instances(Objects) both flat and nested into a dictionary data structure. It's very useful in converting Python Objects into JSON format
Stars: ✭ 67 (-92.28%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-97%)
DatasetsTFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Stars: ✭ 3,094 (+256.45%)
CorporaA collection of small corpuses of interesting data for the creation of bots and similar stuff.
Stars: ✭ 4,293 (+394.59%)
TextData loaders and abstractions for text and NLP
Stars: ✭ 2,915 (+235.83%)
icedataIceData: Datasets Hub for the *IceVision* Framework
Stars: ✭ 41 (-95.28%)
Cocostuff10kThe official homepage of the (outdated) COCO-Stuff 10K dataset.
Stars: ✭ 248 (-71.43%)
Osint collectionMaintained collection of OSINT related resources. (All Free & Actionable)
Stars: ✭ 809 (-6.8%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (-72.24%)
Covid 19 Repo DataData archive of identifiable COVID-19 related public projects on GitHub
Stars: ✭ 236 (-72.81%)
Indian ParallelCorpusCurated list of publicly available parallel corpus for Indian Languages
Stars: ✭ 23 (-97.35%)
DataladKeep code, data, containers under control with git and git-annex
Stars: ✭ 234 (-73.04%)
Esc 50ESC-50: Dataset for Environmental Sound Classification
Stars: ✭ 631 (-27.3%)
Structured3d[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Stars: ✭ 224 (-74.19%)
tracing-vs-freehandTracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Stars: ✭ 21 (-97.58%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (-73.73%)
Imdb FaceA new large-scale noise-controlled face recognition dataset.
Stars: ✭ 399 (-54.03%)
TorchdataPyTorch dataset extended with map, cache etc. (tensorflow.data like)
Stars: ✭ 226 (-73.96%)
OTT-QACode and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"
Stars: ✭ 92 (-89.4%)
Imagenetscraper👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
Stars: ✭ 24 (-97.24%)
CollectionCollection Data for Cooper Hewitt, Smithsonian Design Museum
Stars: ✭ 214 (-75.35%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-98.16%)
DatatableA go in-memory table
Stars: ✭ 215 (-75.23%)
Cmu MultimodalsdkCMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
Stars: ✭ 388 (-55.3%)
DialogrptEMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: ✭ 216 (-75.12%)
Ava downloader⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
Stars: ✭ 214 (-75.35%)
Gensim DataData repository for pretrained NLP models and NLP corpora.
Stars: ✭ 622 (-28.34%)
RerankNERNeural Reranking for Named Entity Recognition, accepted as regular paper at RANLP 2017
Stars: ✭ 22 (-97.47%)
DatasetsA repository of pretty cool datasets that I collected for network science and machine learning research.
Stars: ✭ 302 (-65.21%)
troveWeakly supervised medical named entity classification
Stars: ✭ 55 (-93.66%)
django-serializable-modelDjango classes to make your models, managers, and querysets serializable, with built-in support for related objects in ~150 LoC
Stars: ✭ 15 (-98.27%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+545.16%)
Bert seq2seqpytorch实现bert做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持GPT2进行文章续写。
Stars: ✭ 298 (-65.67%)
TV4DialogNo description or website provided.
Stars: ✭ 33 (-96.2%)
LanguageCodesWe present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
Stars: ✭ 70 (-91.94%)