SearchBlue Brain text mining toolbox for semantic search and structured information extraction
Stars: ✭ 26 (+62.5%)
GspanPython implementation of frequent subgraph mining algorithm gSpan. Directed graphs are supported.
Stars: ✭ 103 (+543.75%)
ytprivYT metadata exporter
Stars: ✭ 28 (+75%)
Papers Literature Ml Dl Rl AiHighly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+8281.25%)
DaggyDaggy - Data Aggregation Utility. Open source, free, cross-platform, server-less, useful utility for remote or local data aggregation and streaming
Stars: ✭ 91 (+468.75%)
Csmath 2020This mathematics course is taught for the first year Ph.D. students of computer science and related areas @ZJU
Stars: ✭ 85 (+431.25%)
NlpplnNLP pipeline software using common workflow language
Stars: ✭ 31 (+93.75%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+7637.5%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (+56.25%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+7493.75%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+1081.25%)
WonderfulPolishLanguageThis is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (+93.75%)
BoltFast approximate vector operations
Stars: ✭ 70 (+337.5%)
AutophraseAutoPhrase: Automated Phrase Mining from Massive Text Corpora
Stars: ✭ 835 (+5118.75%)
GorseAn open source recommender system service written in Go
Stars: ✭ 1,148 (+7075%)
TokenizersFast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (+906.25%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+125%)
Linkedingiveaway👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Stars: ✭ 67 (+318.75%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+4837.5%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (+293.75%)
Ail FrameworkAIL framework - Analysis Information Leak framework
Stars: ✭ 1,091 (+6718.75%)
BigartmFast topic modeling platform
Stars: ✭ 563 (+3418.75%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+6625%)
LuciLogical Unity for Communicational Interactivity
Stars: ✭ 25 (+56.25%)
CgnnCrystal Graph Neural Networks
Stars: ✭ 48 (+200%)
HeliomlA book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (+125%)
koshort(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.
Stars: ✭ 62 (+287.5%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (+118.75%)
OnsetA language evolution simulator, using realistic phonetic changes.
Stars: ✭ 30 (+87.5%)
couchdb-pkgApache CouchDB Packaging support files
Stars: ✭ 24 (+50%)
pylangacqLanguage Acquisition Research Tools
Stars: ✭ 33 (+106.25%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+12306.25%)
SubdueThe Subdue graph miner discovers highly-compressing patterns in an input graph.
Stars: ✭ 20 (+25%)
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (+5443.75%)
En Data miningData Mining Historical Newspaper Metadata (METS/ALTO formats)
Stars: ✭ 14 (-12.5%)
awesome-toolscurated list of awesome tools and libraries for specific domains
Stars: ✭ 31 (+93.75%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+900%)
TableDisentanglerFunctional and structural analysis of tables in research papers (Table disentangling)
Stars: ✭ 21 (+31.25%)
the-stackWebsite and datasets for The Stack, Daily Bruin's data journalism and newsroom tech blog.
Stars: ✭ 26 (+62.5%)
sentometricsAn integrated framework in R for textual sentiment time series aggregation and prediction
Stars: ✭ 77 (+381.25%)
Pyclusteringpyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+4937.5%)
rake-rsMultilingual implementation of RAKE algorithm for Rust
Stars: ✭ 30 (+87.5%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+4300%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+3787.5%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+109400%)
Awesome Nlp📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+78812.5%)
MatminerData mining for materials science
Stars: ✭ 251 (+1468.75%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+19600%)
cdp-servicecdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (+87.5%)
ChemdataextractorAutomatically extract chemical information from scientific documents
Stars: ✭ 152 (+850%)
TweetfeelsReal-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (+1456.25%)
Textfeatures👷♂️ A simple package for extracting useful features from character objects 👷♀️
Stars: ✭ 148 (+825%)