Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-36.7%)
Excel BootEasy-POI是一款Excel导入导出解决方案组成的轻量级开源组件。
Stars: ✭ 347 (-64.63%)
Raio X📊 Análise de dados das mulheres do curso de Ciência da Computação na UFCG
Stars: ✭ 18 (-98.17%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-39.55%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (-64.93%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+476.55%)
Crime AnalysisAssociation Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-97.96%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (-37.51%)
Graph Fraud Detection PapersA curated list of fraud detection papers using graph information or graph neural networks
Stars: ✭ 339 (-65.44%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-98.17%)
Dash Docs📖 The Official Dash Userguide & Documentation
Stars: ✭ 338 (-65.55%)
Book sampleanother book on data science
Stars: ✭ 611 (-37.72%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+280.12%)
Excel MagicDo magic to your excel file!
Stars: ✭ 36 (-96.33%)
Keras MmoeA Keras implementation of "Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts" (KDD 2018)
Stars: ✭ 332 (-66.16%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (-38.33%)
DatmoOpen source production model management tool for data scientists
Stars: ✭ 334 (-65.95%)
Poi☀️ Read and Write Excel file using Java and Apache POI
Stars: ✭ 321 (-67.28%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+451.68%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (-67.79%)
KodiakEnhance your feature engineering workflow with Kodiak
Stars: ✭ 20 (-97.96%)
ProbabilityProbabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+261.88%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (-39.86%)
EvidentlyInteractive reports to analyze machine learning models during validation or production monitoring.
Stars: ✭ 304 (-69.01%)
Scikit RebateA scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
Stars: ✭ 314 (-67.99%)
Carefree LearnA minimal Automatic Machine Learning (AutoML) solution for tabular datasets based on PyTorch
Stars: ✭ 316 (-67.79%)
Awesome Ai UsecasesA list of awesome and proven Artificial Intelligence use cases and applications
Stars: ✭ 587 (-40.16%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (-9.58%)
Pm4py CorePublic repository for the PM4Py (Process Mining for Python) project.
Stars: ✭ 313 (-68.09%)
ReadxlRead excel files (.xls and .xlsx) into R 🖇
Stars: ✭ 585 (-40.37%)
Dash CytoscapeInteractive network visualization in Python and Dash, powered by Cytoscape.js
Stars: ✭ 309 (-68.5%)
Xxl Toola series of tools that make Java development more efficient.(Java工具类库XXL-TOOL)
Stars: ✭ 311 (-68.3%)
Food Inspections EvaluationThis repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
Stars: ✭ 311 (-68.3%)
TidyTidy up your data with JavaScript, inspired by dplyr and the tidyverse
Stars: ✭ 307 (-68.71%)
LaracsvA Laravel package to easily generate CSV files from Eloquent model
Stars: ✭ 583 (-40.57%)
Vehicle counting tensorflow🚘 "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
Stars: ✭ 582 (-40.67%)
Apricotapricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.readthedocs.io/en/latest/index.html
Stars: ✭ 306 (-68.81%)
Elixir ScrapeScrape any website, article or RSS/Atom Feed with ease!
Stars: ✭ 306 (-68.81%)
Xlnt📊 Cross-platform user-friendly xlsx library for C++11+
Stars: ✭ 876 (-10.7%)
LuxPython API for Intelligent Visual Data Discovery
Stars: ✭ 787 (-19.78%)
Data Science CompetitionsGoal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
Stars: ✭ 572 (-41.69%)
Xam🎯 Personal data science and machine learning toolbox
Stars: ✭ 306 (-68.81%)
TabulaTabula is a tool for liberating data tables trapped inside PDF files
Stars: ✭ 5,420 (+452.5%)
Excel4j✨ Excel operation component based on poi & CSV ✨
Stars: ✭ 305 (-68.91%)
Webxcel🤔 A REST backend built with plain VBA Microsoft Excel macros. Yes. Macros.
Stars: ✭ 305 (-68.91%)
PyamplitudeA Python connector for Amplitude Analytics
Stars: ✭ 16 (-98.37%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+448.32%)
CartolaExtração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (-69.01%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-69.11%)
BaikalA graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-41.59%)
NonechucksDeal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (-69.01%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (-10.3%)
TalksRepository of publicly available talks by Leon Eyrich Jessen, PhD. Talks cover Data Science and R in the context of research
Stars: ✭ 16 (-98.37%)
Pygam[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-42%)