Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-57.14%)
WebplotdigitizerHTML5 based online tool to extract numerical data from plot images.
Stars: ✭ 1,605 (+991.84%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+742.18%)
Ayakashi⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-20.41%)
GorseAn open source recommender system service written in Go
Stars: ✭ 1,148 (+680.95%)
StriplogLithology and stratigraphic logs for wells or outcrop.
Stars: ✭ 133 (-9.52%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+631.97%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-31.97%)
Csmath 2020This mathematics course is taught for the first year Ph.D. students of computer science and related areas @ZJU
Stars: ✭ 85 (-42.18%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-77.55%)
OpenhistorianThe Open Source Time-Series Data Historian
Stars: ✭ 120 (-18.37%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+726.53%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+9001.36%)
BoltFast approximate vector operations
Stars: ✭ 70 (-52.38%)
Lab WorkshopsMaterials for workshops on text mining, machine learning, and data visualization
Stars: ✭ 112 (-23.81%)
Linkedingiveaway👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Stars: ✭ 67 (-54.42%)
Efficient AprioriAn efficient Python implementation of the Apriori algorithm.
Stars: ✭ 145 (-1.36%)
Ail FrameworkAIL framework - Analysis Information Leak framework
Stars: ✭ 1,091 (+642.18%)
GspanPython implementation of frequent subgraph mining algorithm gSpan. Directed graphs are supported.
Stars: ✭ 103 (-29.93%)
CgnnCrystal Graph Neural Networks
Stars: ✭ 48 (-67.35%)
Mldmпотоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-76.19%)
Papers Literature Ml Dl Rl AiHighly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+812.24%)
Invoice2dataExtract structured data from PDF invoices
Stars: ✭ 943 (+541.5%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (-17.01%)
Dc Hi guides[Data Castle 算法竞赛] 精品旅行服务成单预测 final rank 11
Stars: ✭ 83 (-43.54%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-6.8%)
Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Stars: ✭ 118 (-19.73%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-48.98%)
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Stars: ✭ 73 (-50.34%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-21.77%)
FfbeDatamining for FFBE GL
Stars: ✭ 69 (-53.06%)
EvalneSource code for EvalNE, a Python library for evaluating Network Embedding methods.
Stars: ✭ 67 (-54.42%)
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Stars: ✭ 112 (-23.81%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-0.68%)
GendisContains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.
Stars: ✭ 59 (-59.86%)
Etherscan MlPython Data Science and Machine Learning Library for the Ethereum and ERC-20 Blockchain
Stars: ✭ 55 (-62.59%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (-11.56%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+5274.15%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (-30.61%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-70.75%)
HeliomlA book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (-75.51%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-32.65%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-76.19%)
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Stars: ✭ 94 (-36.05%)
Rosie Pattern LanguageRosie Pattern Language (RPL) and the Rosie Pattern Engine have MOVED!
Stars: ✭ 146 (-0.68%)
MatrixprofileA Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-4.08%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-14.97%)
DaggyDaggy - Data Aggregation Utility. Open source, free, cross-platform, server-less, useful utility for remote or local data aggregation and streaming
Stars: ✭ 91 (-38.1%)