All Projects → Janitor → Similar Projects or Alternatives

1618 Open source projects that are alternatives of or similar to Janitor

Matrixprofile Ts
A Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-36.7%)
Mutual labels:  data-science
Excel Boot
Easy-POI是一款Excel导入导出解决方案组成的轻量级开源组件。
Stars: ✭ 347 (-64.63%)
Mutual labels:  excel
Raio X
📊 Análise de dados das mulheres do curso de Ciência da Computação na UFCG
Stars: ✭ 18 (-98.17%)
Mutual labels:  data-analysis
Datasheets
Read data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-39.55%)
Mutual labels:  data-science
Machine Learning For Trading
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Stars: ✭ 4,979 (+407.54%)
Mutual labels:  data-science
Deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (-64.93%)
Mutual labels:  data-science
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+476.55%)
Mutual labels:  data-science
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-97.96%)
Mutual labels:  data-science
Dist Keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (-37.51%)
Mutual labels:  data-science
Graph Fraud Detection Papers
A curated list of fraud detection papers using graph information or graph neural networks
Stars: ✭ 339 (-65.44%)
Mutual labels:  data-science
Foxcross
AsyncIO serving for data science models
Stars: ✭ 18 (-98.17%)
Mutual labels:  data-science
Dash Docs
📖 The Official Dash Userguide & Documentation
Stars: ✭ 338 (-65.55%)
Mutual labels:  data-science
Book sample
another book on data science
Stars: ✭ 611 (-37.72%)
Mutual labels:  data-science
Mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+280.12%)
Mutual labels:  data-science
Excel Magic
Do magic to your excel file!
Stars: ✭ 36 (-96.33%)
Mutual labels:  excel
Keras Mmoe
A Keras implementation of "Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts" (KDD 2018)
Stars: ✭ 332 (-66.16%)
Mutual labels:  data-science
Siuba
Python library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (-38.33%)
Mutual labels:  data-analysis
Datmo
Open source production model management tool for data scientists
Stars: ✭ 334 (-65.95%)
Mutual labels:  data-science
Poi
☀️ Read and Write Excel file using Java and Apache POI
Stars: ✭ 321 (-67.28%)
Mutual labels:  excel
Smile
Statistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+451.68%)
Mutual labels:  data-science
Getting Started With Genomics Tools And Resources
Unix, R and python tools for genomics and data science
Stars: ✭ 587 (-40.16%)
Mutual labels:  data-science
Pandasvault
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (-67.79%)
Mutual labels:  data-science
Kodiak
Enhance your feature engineering workflow with Kodiak
Stars: ✭ 20 (-97.96%)
Mutual labels:  data-analysis
Probability
Probabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+261.88%)
Mutual labels:  data-science
Pdpipe
Easy pipelines for pandas DataFrames.
Stars: ✭ 590 (-39.86%)
Mutual labels:  data-science
Evidently
Interactive reports to analyze machine learning models during validation or production monitoring.
Stars: ✭ 304 (-69.01%)
Mutual labels:  data-science
Scikit Rebate
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
Stars: ✭ 314 (-67.99%)
Mutual labels:  data-science
Carefree Learn
A minimal Automatic Machine Learning (AutoML) solution for tabular datasets based on PyTorch
Stars: ✭ 316 (-67.79%)
Mutual labels:  data-science
Awesome Ai Usecases
A list of awesome and proven Artificial Intelligence use cases and applications
Stars: ✭ 587 (-40.16%)
Mutual labels:  data-science
Clevercsv
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (-9.58%)
Mutual labels:  data-science
Pm4py Core
Public repository for the PM4Py (Process Mining for Python) project.
Stars: ✭ 313 (-68.09%)
Mutual labels:  data-science
Readxl
Read excel files (.xls and .xlsx) into R 🖇
Stars: ✭ 585 (-40.37%)
Mutual labels:  excel
Dash Cytoscape
Interactive network visualization in Python and Dash, powered by Cytoscape.js
Stars: ✭ 309 (-68.5%)
Mutual labels:  data-science
Xxl Tool
a series of tools that make Java development more efficient.(Java工具类库XXL-TOOL)
Stars: ✭ 311 (-68.3%)
Mutual labels:  excel
Food Inspections Evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
Stars: ✭ 311 (-68.3%)
Mutual labels:  data-science
Tidy
Tidy up your data with JavaScript, inspired by dplyr and the tidyverse
Stars: ✭ 307 (-68.71%)
Mutual labels:  tidyverse
Visualization Of Global Terrorism Database
📊 Visualization of GTD with py Plotly lib, including amazing graphs and animation 📼
Stars: ✭ 16 (-98.37%)
Mutual labels:  data-analysis
Laracsv
A Laravel package to easily generate CSV files from Eloquent model
Stars: ✭ 583 (-40.57%)
Mutual labels:  excel
Lofo Importance
Leave One Feature Out Importance
Stars: ✭ 310 (-68.4%)
Mutual labels:  data-science
Erlemar.github.io
Data science portfolio
Stars: ✭ 309 (-68.5%)
Mutual labels:  data-science
Vehicle counting tensorflow
🚘 "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
Stars: ✭ 582 (-40.67%)
Mutual labels:  data-science
Apricot
apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.readthedocs.io/en/latest/index.html
Stars: ✭ 306 (-68.81%)
Mutual labels:  data-science
Elixir Scrape
Scrape any website, article or RSS/Atom Feed with ease!
Stars: ✭ 306 (-68.81%)
Mutual labels:  data-science
Xlnt
📊 Cross-platform user-friendly xlsx library for C++11+
Stars: ✭ 876 (-10.7%)
Mutual labels:  excel
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (-19.78%)
Mutual labels:  data-science
Data Science Competitions
Goal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
Stars: ✭ 572 (-41.69%)
Mutual labels:  data-science
Xam
🎯 Personal data science and machine learning toolbox
Stars: ✭ 306 (-68.81%)
Mutual labels:  data-science
Tabula
Tabula is a tool for liberating data tables trapped inside PDF files
Stars: ✭ 5,420 (+452.5%)
Mutual labels:  excel
Excel4j
✨ Excel operation component based on poi & CSV ✨
Stars: ✭ 305 (-68.91%)
Mutual labels:  excel
Webxcel
🤔 A REST backend built with plain VBA Microsoft Excel macros. Yes. Macros.
Stars: ✭ 305 (-68.91%)
Mutual labels:  excel
Pyamplitude
A Python connector for Amplitude Analytics
Stars: ✭ 16 (-98.37%)
Mutual labels:  data-analysis
Alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+448.32%)
Mutual labels:  data-analysis
Cartola
Extração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (-69.01%)
Mutual labels:  data-science
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-69.11%)
Mutual labels:  data-analysis
Baikal
A graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-41.59%)
Mutual labels:  data-science
120 Ds Interview Questions
My Answer to 120 Data Science Interview Questions
Stars: ✭ 304 (-69.01%)
Mutual labels:  data-science
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (-69.01%)
Mutual labels:  data-cleaning
Pydataset
Instant access to many datasets in Python.
Stars: ✭ 880 (-10.3%)
Mutual labels:  data-science
Talks
Repository of publicly available talks by Leon Eyrich Jessen, PhD. Talks cover Data Science and R in the context of research
Stars: ✭ 16 (-98.37%)
Mutual labels:  tidyverse
Pygam
[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-42%)
Mutual labels:  data-science
361-420 of 1618 similar projects