Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-47.59%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-62.07%)
PansoriTools for ASR Corpus Generation from Online Video
Stars: ✭ 106 (-26.9%)
Dataset Listlists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-42.07%)
Sejong CorpusKorean sejong corpus download and simple analysis
Stars: ✭ 116 (-20%)
Jwiki📖 A library for effortlessly interacting with Wikipedia/MediaWiki
Stars: ✭ 69 (-52.41%)
KhcoderKH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (-13.1%)
Mediawiki Extensions MobilefrontendThis is a mirror from https://gerrit.wikimedia.org. See https://www.mediawiki.org/wiki/Developer_access for contributing.
Stars: ✭ 47 (-67.59%)
PycluePython toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Stars: ✭ 91 (-37.24%)
MwofflinerScrape any online Mediawiki motorised wiki (like Wikipedia) to your local filesystem
Stars: ✭ 121 (-16.55%)
Awesome ChatbotAwesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:
Stars: ✭ 1,785 (+1131.03%)
Wikiloop DoublecheckWikiLoop DoubleCheck: a web tool to help review Wikipedia edits easily and collaboratively.
Stars: ✭ 70 (-51.72%)
DatasetsPoetry-related datasets developed by THUAIPoet (Jiuge) group.
Stars: ✭ 111 (-23.45%)
LegislatorInterface to the Comparative Legislators Database
Stars: ✭ 62 (-57.24%)
Git Wiki ThemeA revolutionary full-featured wiki for github pages and jekyll. You don't need to compile it!
Stars: ✭ 139 (-4.14%)
WeeklypediaA weekly email update of all the most popular wikipedia articles
Stars: ✭ 50 (-65.52%)
Pubmed RctPubMed 200k RCT dataset: a large dataset for sequential sentence classification.
Stars: ✭ 101 (-30.34%)
Dialog corpus用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+1046.21%)
WikiforiaA Utility Library for Wikipedia dumps
Stars: ✭ 31 (-78.62%)
Linq To Wiki.Net library to access MediaWiki API
Stars: ✭ 93 (-35.86%)
RaunTool to watch the recent changes of Wikimedia Foundation projects, live.
Stars: ✭ 15 (-89.66%)
Isbntoolspython app/framework for 'all things ISBN' including metadata, descriptions, covers...
Stars: ✭ 122 (-15.86%)
MediawikiMediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/
Stars: ✭ 89 (-38.62%)
Code Docstring CorpusPreprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
Stars: ✭ 137 (-5.52%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-45.52%)
MediawikerMediawiker is a plugin for Sublime Text editor that adds possibility to use it as Wiki Editor on Mediawiki based sites like Wikipedia and many other.
Stars: ✭ 120 (-17.24%)
MicroscaleGenerated in real-time from random Wikipedia articles, microscale is a web-based, generative album.
Stars: ✭ 76 (-47.59%)
WikitWikipedia summaries from the command line
Stars: ✭ 141 (-2.76%)
Hovercard🖱️ Wikipedia summary cards for the web
Stars: ✭ 72 (-50.34%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-22.76%)
BlacklabA corpus retrieval engine based on Apache Lucene
Stars: ✭ 69 (-52.41%)
Kiwix JsFull portable & lightweight ZIM reader in Javascript
Stars: ✭ 130 (-10.34%)
WikimonA WebSocket-oriented monitor for Wikipedia (also, wikimon, wikital monsters)
Stars: ✭ 63 (-56.55%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-25.52%)
Wikipedia ner📖 Labeled examples from wiki dumps in Python
Stars: ✭ 61 (-57.93%)
Ultimate Java ResourcesJava programming. All in one Java Resource for learning. Updated every day and up to date. All Algorithms and DS along with Development in Java. Beginner to Advanced. Join the Discord link.
Stars: ✭ 143 (-1.38%)
Wikipediap2pWikipediaP2P.org Chrome Extension
Stars: ✭ 105 (-27.59%)
Wiki Fluttermore than an elegant wikipedia client
Stars: ✭ 48 (-66.9%)
Tft Overlay OutdatedTFT Overlay - Team and item builder for League of Legends Teamfight Tactics
Stars: ✭ 44 (-69.66%)
Refined WikipediaEnforces the mobile web version of Wikipedia and improves its interface
Stars: ✭ 98 (-32.41%)
Svg World Map🗺 A JavaScript library to easily integrate one or more SVG world maps with all nations (countries) and second-level political subdivisions (countries, provinces, states).
Stars: ✭ 38 (-73.79%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-4.14%)
Typing AssistantTyping Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-77.93%)
Huggle3 Qt LxHuggle is an anti-vandalism tool for use on MediaWiki based projects
Stars: ✭ 143 (-1.38%)
Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1572.41%)
Wiki SplitOne million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (-34.48%)