Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai

Stars: ✭ 4,003 (+1339.93%)

Mutual labels: datasets

newsletter-archive

Markdown archive & RSS/Atom feeds for Data Is Plural.

Stars: ✭ 65 (-76.62%)

Mutual labels: datasets

open-discourse

Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).

Stars: ✭ 47 (-83.09%)

Mutual labels: corpus

huozi.js

A simple typography engine for CJK languages, especially designed for game rich-text. 用于游戏富文本的中日韩文字排印引擎。

Stars: ✭ 135 (-51.44%)

Mutual labels: chinese

FewCLUE

FewCLUE 小样本学习测评基准，中文版

Stars: ✭ 251 (-9.71%)

Mutual labels: chinese

Species-Names-Corpus

物种名称语料库。植物名,动物名。

Stars: ✭ 23 (-91.73%)

Mutual labels: corpus

TSForecasting

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

Stars: ✭ 53 (-80.94%)

Mutual labels: datasets

Korpora

Korean corpus repository

Stars: ✭ 270 (-2.88%)

Mutual labels: corpus

Roapi

Create full-fledged APIs for static datasets without writing a single line of code.

Stars: ✭ 253 (-8.99%)

Mutual labels: datasets

datasets

TFDS data loaders for sign language datasets.

Stars: ✭ 17 (-93.88%)

Mutual labels: datasets

dialogue-datasets

collect the open dialog corpus and some useful data processing utils.

Stars: ✭ 24 (-91.37%)

Mutual labels: corpus

wordfish-python

extract relationships from standardized terms from corpus of interest with deep learning 🐟

Stars: ✭ 19 (-93.17%)

Mutual labels: corpus

dbcollection

A collection of popular datasets for deep learning.

Stars: ✭ 26 (-90.65%)

Mutual labels: datasets

English-level-up-tips-for-Chinese

An advanced guide to learn English that might benefit you a lot 🎉 . 可能是让你受益匪浅的英语进阶指南。

Stars: ✭ 23,212 (+8249.64%)

Mutual labels: chinese

Xmorse

🌞 ~1.5Kb morse code library for all. 一个支持 Unicode 中文摩斯密码编码的 Javascript 库。

Stars: ✭ 266 (-4.32%)

Mutual labels: chinese

NetEmb-Datasets

A collection of real-world networks/graphs for Network Embedding

Stars: ✭ 18 (-93.53%)

Mutual labels: datasets

awesome-hokchew

A curated list of resources about the Hokchew / Foochow language. 閩東語福州話的資源整合列表。

Stars: ✭ 16 (-94.24%)

Mutual labels: chinese

databrewer-recipes

DataBrewer Recipes Repository.

Stars: ✭ 19 (-93.17%)

Mutual labels: datasets

Meglass

An eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.

Stars: ✭ 281 (+1.08%)

Mutual labels: datasets

recurrent-defocus-deblurring-synth-dual-pixel

Reference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…

Stars: ✭ 30 (-89.21%)

Mutual labels: datasets

Medical-Names-Corpus

医疗语料库。医疗机构名语料库。药品本位码。

Stars: ✭ 26 (-90.65%)

Mutual labels: corpus

podium

Podium: a framework agnostic Python NLP library for data loading and preprocessing

Stars: ✭ 55 (-80.22%)

Mutual labels: datasets

Php Best Practices Zh cn

PHP Best Practices（中译版）

Stars: ✭ 261 (-6.12%)

Mutual labels: chinese

disent

🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib

Stars: ✭ 41 (-85.25%)

Mutual labels: datasets

Indian ParallelCorpus

Curated list of publicly available parallel corpus for Indian Languages

Stars: ✭ 23 (-91.73%)

Mutual labels: corpus

DeepSentiPers

Repository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"

Stars: ✭ 17 (-93.88%)

Mutual labels: corpus

Overview

中文编程的历史、现状和展望。issue 中进行相关问题的讨论.

Stars: ✭ 282 (+1.44%)

Mutual labels: chinese

dplace-data

The data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)

Stars: ✭ 49 (-82.37%)

Mutual labels: datasets

Writing-editing-Network

Code for Paper Abstract Writing through Editing Mechanism