All Projects → Cluecorpus2020 → Similar Projects or Alternatives

605 Open source projects that are alternatives of or similar to Cluecorpus2020

Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+659.71%)
Mutual labels:  chinese, datasets, corpus
CLUEmotionAnalysis2020
CLUE Emotion Analysis Dataset 细粒度情感分析数据集
Stars: ✭ 3 (-98.92%)
Mutual labels:  corpus, chinese
Datasets
Poetry-related datasets developed by THUAIPoet (Jiuge) group.
Stars: ✭ 111 (-60.07%)
Mutual labels:  chinese, corpus
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+772.3%)
Mutual labels:  chinese, corpus
Chinese Nlp Corpus
Collections of Chinese NLP corpus
Stars: ✭ 438 (+57.55%)
Mutual labels:  datasets, corpus
Weibo terminater
Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
Stars: ✭ 2,295 (+725.54%)
Mutual labels:  chinese, corpus
Cluepretrainedmodels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Stars: ✭ 493 (+77.34%)
Mutual labels:  chinese, corpus
TV4Dialog
No description or website provided.
Stars: ✭ 33 (-88.13%)
Mutual labels:  corpus, chinese
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (-76.62%)
Mutual labels:  corpus, datasets
Nlp chinese corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+2294.24%)
Mutual labels:  chinese, corpus
CBLUE
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+36.33%)
Mutual labels:  corpus, chinese
OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (-66.19%)
Mutual labels:  corpus, chinese
covid-19-data-cleanup
Scripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-91.01%)
Mutual labels:  datasets
sqlmap-wiki-zhcn
可能是最完整的 sqlmap 中文文档。
Stars: ✭ 51 (-81.65%)
Mutual labels:  chinese
hkcs
香港民間字集 Hong Kong Character Set Project (HKCS)
Stars: ✭ 29 (-89.57%)
Mutual labels:  chinese
fastmorph
Fast corpus search engine originally made for the Corpus of Written Tatar language
Stars: ✭ 14 (-94.96%)
Mutual labels:  corpus
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1339.93%)
Mutual labels:  datasets
newsletter-archive
Markdown archive & RSS/Atom feeds for Data Is Plural.
Stars: ✭ 65 (-76.62%)
Mutual labels:  datasets
open-discourse
Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Stars: ✭ 47 (-83.09%)
Mutual labels:  corpus
huozi.js
A simple typography engine for CJK languages, especially designed for game rich-text. 用于游戏富文本的中日韩文字排印引擎。
Stars: ✭ 135 (-51.44%)
Mutual labels:  chinese
FewCLUE
FewCLUE 小样本学习测评基准,中文版
Stars: ✭ 251 (-9.71%)
Mutual labels:  chinese
Species-Names-Corpus
物种名称语料库。植物名,动物名。
Stars: ✭ 23 (-91.73%)
Mutual labels:  corpus
TSForecasting
This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.
Stars: ✭ 53 (-80.94%)
Mutual labels:  datasets
Korpora
Korean corpus repository
Stars: ✭ 270 (-2.88%)
Mutual labels:  corpus
Roapi
Create full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (-8.99%)
Mutual labels:  datasets
datasets
TFDS data loaders for sign language datasets.
Stars: ✭ 17 (-93.88%)
Mutual labels:  datasets
dialogue-datasets
collect the open dialog corpus and some useful data processing utils.
Stars: ✭ 24 (-91.37%)
Mutual labels:  corpus
wordfish-python
extract relationships from standardized terms from corpus of interest with deep learning 🐟
Stars: ✭ 19 (-93.17%)
Mutual labels:  corpus
dbcollection
A collection of popular datasets for deep learning.
Stars: ✭ 26 (-90.65%)
Mutual labels:  datasets
English-level-up-tips-for-Chinese
An advanced guide to learn English that might benefit you a lot 🎉 . 可能是让你受益匪浅的英语进阶指南。
Stars: ✭ 23,212 (+8249.64%)
Mutual labels:  chinese
Xmorse
🌞 ~1.5Kb morse code library for all. 一个支持 Unicode 中文摩斯密码编码的 Javascript 库。
Stars: ✭ 266 (-4.32%)
Mutual labels:  chinese
NetEmb-Datasets
A collection of real-world networks/graphs for Network Embedding
Stars: ✭ 18 (-93.53%)
Mutual labels:  datasets
awesome-hokchew
A curated list of resources about the Hokchew / Foochow language. 閩東語福州話的資源整合列表。
Stars: ✭ 16 (-94.24%)
Mutual labels:  chinese
databrewer-recipes
DataBrewer Recipes Repository.
Stars: ✭ 19 (-93.17%)
Mutual labels:  datasets
Meglass
An eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+1.08%)
Mutual labels:  datasets
recurrent-defocus-deblurring-synth-dual-pixel
Reference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-89.21%)
Mutual labels:  datasets
Medical-Names-Corpus
医疗语料库。医疗机构名语料库。药品本位码。
Stars: ✭ 26 (-90.65%)
Mutual labels:  corpus
podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing
Stars: ✭ 55 (-80.22%)
Mutual labels:  datasets
Php Best Practices Zh cn
PHP Best Practices(中译版)
Stars: ✭ 261 (-6.12%)
Mutual labels:  chinese
disent
🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-85.25%)
Mutual labels:  datasets
Indian ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Stars: ✭ 23 (-91.73%)
Mutual labels:  corpus
DeepSentiPers
Repository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"
Stars: ✭ 17 (-93.88%)
Mutual labels:  corpus
Overview
中文编程的历史、现状和展望。issue 中进行相关问题的讨论.
Stars: ✭ 282 (+1.44%)
Mutual labels:  chinese
dplace-data
The data repository for the D-PLACE Project (Database of Places, Language, Culture and Environment)
Stars: ✭ 49 (-82.37%)
Mutual labels:  datasets
Writing-editing-Network
Code for Paper Abstract Writing through Editing Mechanism
Stars: ✭ 72 (-74.1%)
Mutual labels:  datasets
Filipino-Text-Benchmarks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Stars: ✭ 22 (-92.09%)
Mutual labels:  corpus
download audioset
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-80.94%)
Mutual labels:  datasets
Swift
swift 上手开发APP必备
Stars: ✭ 257 (-7.55%)
Mutual labels:  chinese
EdgarAllanPoetry
Computer-generated poetry
Stars: ✭ 22 (-92.09%)
Mutual labels:  corpus
cn.jenkins.io
Chinese version of the website
Stars: ✭ 30 (-89.21%)
Mutual labels:  chinese
ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Stars: ✭ 46 (-83.45%)
Mutual labels:  datasets
rust-pinyin
汉字转拼音
Stars: ✭ 111 (-60.07%)
Mutual labels:  chinese
rakutenma-python
Rakuten MA (Python version)
Stars: ✭ 15 (-94.6%)
Mutual labels:  chinese
opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
Stars: ✭ 161 (-42.09%)
Mutual labels:  datasets
Kartaslov
Stars: ✭ 270 (-2.88%)
Mutual labels:  datasets
Fakenewscorpus
A dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (-8.27%)
Mutual labels:  corpus
CSharpNamingGuidelines
C#命名规范中文版/C#编码规范中文版
Stars: ✭ 30 (-89.21%)
Mutual labels:  chinese
fuzzing-corpus
My fuzzing corpus
Stars: ✭ 120 (-56.83%)
Mutual labels:  corpus
SpiCE-Corpus
An open-access corpus of conversational bilingual speech in Cantonese and English
Stars: ✭ 33 (-88.13%)
Mutual labels:  corpus
Chinese-Word-Segmentation-in-NLP
State of the art Chinese Word Segmentation with Bi-LSTMs
Stars: ✭ 23 (-91.73%)
Mutual labels:  chinese
1-60 of 605 similar projects