All Projects → Wp2txt → Similar Projects or Alternatives

240 Open source projects that are alternatives of or similar to Wp2txt

Russian news corpus
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-47.59%)
Mutual labels:  corpus
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-62.07%)
Mutual labels:  corpus
Pansori
Tools for ASR Corpus Generation from Online Video
Stars: ✭ 106 (-26.9%)
Mutual labels:  corpus
Dataset List
lists of text corpus and more (mainly Japanese)
Stars: ✭ 84 (-42.07%)
Mutual labels:  corpus
Chatterbot Corpus
A multilingual dialog corpus
Stars: ✭ 964 (+564.83%)
Mutual labels:  corpus
Sejong Corpus
Korean sejong corpus download and simple analysis
Stars: ✭ 116 (-20%)
Mutual labels:  corpus
Jwiki
📖 A library for effortlessly interacting with Wikipedia/MediaWiki
Stars: ✭ 69 (-52.41%)
Mutual labels:  wikipedia
Khcoder
KH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (-13.1%)
Mutual labels:  corpus
Mediawiki Extensions Mobilefrontend
This is a mirror from https://gerrit.wikimedia.org. See https://www.mediawiki.org/wiki/Developer_access for contributing.
Stars: ✭ 47 (-67.59%)
Mutual labels:  wikipedia
Apps Android Wikipedia
📱The official Wikipedia app for Android!
Stars: ✭ 1,350 (+831.03%)
Mutual labels:  wikipedia
Pyclue
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Stars: ✭ 91 (-37.24%)
Mutual labels:  corpus
Apps Android Wikiedudashboard
Access WikiEdu Dashboard from Android App.
Stars: ✭ 20 (-86.21%)
Mutual labels:  wikipedia
Mwoffliner
Scrape any online Mediawiki motorised wiki (like Wikipedia) to your local filesystem
Stars: ✭ 121 (-16.55%)
Mutual labels:  wikipedia
Fabric Gm Wiki
Fabric国密项目 wiki
Stars: ✭ 79 (-45.52%)
Mutual labels:  wikipedia
Awesome Chatbot
Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:
Stars: ✭ 1,785 (+1131.03%)
Mutual labels:  corpus
Wikiloop Doublecheck
WikiLoop DoubleCheck: a web tool to help review Wikipedia edits easily and collaboratively.
Stars: ✭ 70 (-51.72%)
Mutual labels:  wikipedia
Datasets
Poetry-related datasets developed by THUAIPoet (Jiuge) group.
Stars: ✭ 111 (-23.45%)
Mutual labels:  corpus
Legislator
Interface to the Comparative Legislators Database
Stars: ✭ 62 (-57.24%)
Mutual labels:  wikipedia
Git Wiki Theme
A revolutionary full-featured wiki for github pages and jekyll. You don't need to compile it!
Stars: ✭ 139 (-4.14%)
Mutual labels:  wikipedia
Weeklypedia
A weekly email update of all the most popular wikipedia articles
Stars: ✭ 50 (-65.52%)
Mutual labels:  wikipedia
Pubmed Rct
PubMed 200k RCT dataset: a large dataset for sequential sentence classification.
Stars: ✭ 101 (-30.34%)
Mutual labels:  corpus
Mitie chinese wikipedia corpus
Pre-trained Wikipedia corpus by MITIE
Stars: ✭ 43 (-70.34%)
Mutual labels:  corpus
Dialog corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
Stars: ✭ 1,662 (+1046.21%)
Mutual labels:  corpus
Wikiforia
A Utility Library for Wikipedia dumps
Stars: ✭ 31 (-78.62%)
Mutual labels:  wikipedia
Wikipedia Tools For Google Spreadsheets
Wikipedia Tools for Google Spreadsheets — Install:
Stars: ✭ 96 (-33.79%)
Mutual labels:  wikipedia
Linq To Wiki
.Net library to access MediaWiki API
Stars: ✭ 93 (-35.86%)
Mutual labels:  wikipedia
Raun
Tool to watch the recent changes of Wikimedia Foundation projects, live.
Stars: ✭ 15 (-89.66%)
Mutual labels:  wikipedia
Isbntools
python app/framework for 'all things ISBN' including metadata, descriptions, covers...
Stars: ✭ 122 (-15.86%)
Mutual labels:  wikipedia
Mediawiki
MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/
Stars: ✭ 89 (-38.62%)
Mutual labels:  wikipedia
Code Docstring Corpus
Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
Stars: ✭ 137 (-5.52%)
Mutual labels:  corpus
Ja.text8
Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-45.52%)
Mutual labels:  corpus
Mediawiker
Mediawiker is a plugin for Sublime Text editor that adds possibility to use it as Wiki Editor on Mediawiki based sites like Wikipedia and many other.
Stars: ✭ 120 (-17.24%)
Mutual labels:  wikipedia
Microscale
Generated in real-time from random Wikipedia articles, microscale is a web-based, generative album.
Stars: ✭ 76 (-47.59%)
Mutual labels:  wikipedia
Wikit
Wikipedia summaries from the command line
Stars: ✭ 141 (-2.76%)
Mutual labels:  wikipedia
Hovercard
🖱️ Wikipedia summary cards for the web
Stars: ✭ 72 (-50.34%)
Mutual labels:  wikipedia
Colibri Core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-22.76%)
Mutual labels:  corpus
Blacklab
A corpus retrieval engine based on Apache Lucene
Stars: ✭ 69 (-52.41%)
Mutual labels:  corpus
Kiwix Js
Full portable & lightweight ZIM reader in Javascript
Stars: ✭ 130 (-10.34%)
Mutual labels:  wikipedia
Wikimon
A WebSocket-oriented monitor for Wikipedia (also, wikimon, wikital monsters)
Stars: ✭ 63 (-56.55%)
Mutual labels:  wikipedia
Ua Gec
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-25.52%)
Mutual labels:  corpus
Wikipedia ner
📖 Labeled examples from wiki dumps in Python
Stars: ✭ 61 (-57.93%)
Mutual labels:  wikipedia
Ultimate Java Resources
Java programming. All in one Java Resource for learning. Updated every day and up to date. All Algorithms and DS along with Development in Java. Beginner to Advanced. Join the Discord link.
Stars: ✭ 143 (-1.38%)
Mutual labels:  wikipedia
Web Archives
A web archives reader
Stars: ✭ 52 (-64.14%)
Mutual labels:  wikipedia
Wikipediap2p
WikipediaP2P.org Chrome Extension
Stars: ✭ 105 (-27.59%)
Mutual labels:  wikipedia
Wiki Flutter
more than an elegant wikipedia client
Stars: ✭ 48 (-66.9%)
Mutual labels:  wikipedia
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+1356.55%)
Mutual labels:  corpus
Tft Overlay Outdated
TFT Overlay - Team and item builder for League of Legends Teamfight Tactics
Stars: ✭ 44 (-69.66%)
Mutual labels:  wikipedia
Refined Wikipedia
Enforces the mobile web version of Wikipedia and improves its interface
Stars: ✭ 98 (-32.41%)
Mutual labels:  wikipedia
Svg World Map
🗺 A JavaScript library to easily integrate one or more SVG world maps with all nations (countries) and second-level political subdivisions (countries, provinces, states).
Stars: ✭ 38 (-73.79%)
Mutual labels:  wikipedia
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-4.14%)
Mutual labels:  corpus
Typing Assistant
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-77.93%)
Mutual labels:  corpus
Lexicon Thai
คลังศัพท์ภาษาไทย
Stars: ✭ 96 (-33.79%)
Mutual labels:  corpus
Kaggle Web Traffic Time Series Forecasting
Solution to Kaggle - Web Traffic Time Series Forecasting
Stars: ✭ 29 (-80%)
Mutual labels:  wikipedia
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-16.55%)
Mutual labels:  corpus
Chi Corpus
迟先生语料库
Stars: ✭ 96 (-33.79%)
Mutual labels:  corpus
Huggle3 Qt Lx
Huggle is an anti-vandalism tool for use on MediaWiki based projects
Stars: ✭ 143 (-1.38%)
Mutual labels:  wikipedia
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1572.41%)
Mutual labels:  corpus
Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-5.52%)
Mutual labels:  corpus
Thealgorithms
Algorithms repository.
Stars: ✭ 122 (-15.86%)
Mutual labels:  wikipedia
Wiki Split
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (-34.48%)
Mutual labels:  wikipedia
1-60 of 240 similar projects