trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+3285.71%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+166.67%)
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+31595.24%)
django-anss-archiveA Django application to archive real-time earthquake notifications from the USGS's Advanced National Seismic System
Stars: ✭ 14 (-33.33%)
tvsubTVsub: DCU-Tencent Chinese-English Dialogue Corpus
Stars: ✭ 40 (+90.48%)
Brieflysource based news in short : Winner @MumbaiHackathon 2018
Stars: ✭ 35 (+66.67%)
german-nounsA list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Stars: ✭ 101 (+380.95%)
client-jsDemonstration of using the api in javascript
Stars: ✭ 20 (-4.76%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+185.71%)
DANeSDANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)
Stars: ✭ 64 (+204.76%)
GNewsA Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.
Stars: ✭ 271 (+1190.48%)
rclcRich Context leaderboard competition, including the corpus and current SOTA for required tasks.
Stars: ✭ 20 (-4.76%)
market-monitorInteractive app to monitor market using Python
Stars: ✭ 20 (-4.76%)
NasdaqCloudDataService-SDK-JavaNasdaq Data Link provides a modern and efficient method of delivery for real-time exchange data and other financial information. This repository provides a Java SDK for developing applications using Nasdaq Data Link's real-time data.
Stars: ✭ 70 (+233.33%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+0%)
NewsPinNews app for android using Kotlin, coroutines, MVP architecture
Stars: ✭ 25 (+19.05%)
esappAn unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (-38.1%)
Chart ToolA responsive charting application
Stars: ✭ 244 (+1061.9%)
proiel-treebankOfficial releases of the PROIEL treebank of ancient Indo-European languages
Stars: ✭ 30 (+42.86%)
NewscatchrFOSS Android News Reader App
Stars: ✭ 216 (+928.57%)
yapYet Another (natural language) Parser
Stars: ✭ 40 (+90.48%)
clickbait-workshopPydata 2017 workshop: build a clickbait detector with python
Stars: ✭ 13 (-38.1%)
RssNewsAPIFree News API for fetching and categorizing news articles
Stars: ✭ 13 (-38.1%)
vuejs-newsSingle page app that pulls in news from NYTimes
Stars: ✭ 19 (-9.52%)
ncovis-2020covid-19 舆论和新闻的可视化平台,获得了中国计算机学会、阿里云和机器之心等举办的疫情可视化比赛铜奖。🔥
Stars: ✭ 37 (+76.19%)
pylangacqLanguage Acquisition Research Tools
Stars: ✭ 33 (+57.14%)
ariel-news-appNews App developed with Flutter featuring beautiful UI, category-based news, story for faster news reading, inbuilt article viewer, share feature, and more.
Stars: ✭ 31 (+47.62%)
the-stackWebsite and datasets for The Stack, Daily Bruin's data journalism and newsroom tech blog.
Stars: ✭ 26 (+23.81%)
savepagenowA simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
Stars: ✭ 140 (+566.67%)
Android-WeeklyAndroid Weekly is a free newsletter that helps you to stay cutting-edge with your Android Development. The newsletter comes once a week and covers a broad range of topics like tutorials, screencasts, news... just everything that's awesome in the Android Development world!
Stars: ✭ 66 (+214.29%)
spark-gdeltBinding the GDELT universe in a Spark environment
Stars: ✭ 20 (-4.76%)
upnewsDisplay news and update outdated github R packages
Stars: ✭ 25 (+19.05%)
feed2emailRSS/Atom feed updates in your email
Stars: ✭ 37 (+76.19%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+204.76%)
bllip-parserBLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
Stars: ✭ 217 (+933.33%)
newsdashA news dashboard inspired by iGoogle and Netvibes
Stars: ✭ 44 (+109.52%)
CmsNews Management System Written In PHP
Stars: ✭ 245 (+1066.67%)
nearo🔥 Nearo: A react.js app for local selling, buying, and news
Stars: ✭ 40 (+90.48%)
Awesome OpenbsdA curated list of awesome OpenBSD resources
Stars: ✭ 228 (+985.71%)
OpenConvertText conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
Stars: ✭ 20 (-4.76%)
Marquee ScrollerMarquee Scroller Clock News Weather and More
Stars: ✭ 211 (+904.76%)
getNews互联网新闻推荐系统(myNews)--2016全国计算机设计大赛企业命题参赛作品
Stars: ✭ 43 (+104.76%)
habbo-downloader⚡A tiny script to download various files directly from Habbo.
Stars: ✭ 47 (+123.81%)
ocr2textConvert a PDF via OCR to a TXT file in UTF-8 encoding
Stars: ✭ 90 (+328.57%)