Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+7126.21%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+3317.93%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-55.17%)
ImhotepImhotep is a large-scale analytics platform built by Indeed.
Stars: ✭ 125 (-13.79%)
DatabenchData analysis tool.
Stars: ✭ 82 (-43.45%)
AkumuliTime-series database
Stars: ✭ 754 (+420%)
EdwardA probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Stars: ✭ 4,674 (+3123.45%)
ChicksexerA Python package for gender classification.
Stars: ✭ 64 (-55.86%)
TflearnDeep learning library featuring a higher-level API for TensorFlow.
Stars: ✭ 9,573 (+6502.07%)
Awesome RA curated list of awesome R packages, frameworks and software.
Stars: ✭ 4,858 (+3250.34%)
Dev PracticePractice your skills with these ideas.
Stars: ✭ 1,127 (+677.24%)
Data Science WgSF Brigade's Data Science Working Group.
Stars: ✭ 135 (-6.9%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+3539.31%)
Neural prophetNeuralProphet - A simple forecasting model based on Neural Networks in PyTorch
Stars: ✭ 1,125 (+675.86%)
PalladiumFramework for setting up predictive analytics services
Stars: ✭ 481 (+231.72%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-26.21%)
Combo(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Stars: ✭ 481 (+231.72%)
WhisperWhisper is a file-based time-series database format for Graphite.
Stars: ✭ 1,121 (+673.1%)
PandapyPandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Stars: ✭ 474 (+226.9%)
Dive Into Machine LearningDive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+7355.17%)
Cookiecutter Data ScienceA logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Stars: ✭ 5,271 (+3535.17%)
SkproSupervised domain-agnostic prediction framework for probabilistic modelling
Stars: ✭ 107 (-26.21%)
Ai PlatformAn open-source platform for automating tasks using machine learning models
Stars: ✭ 61 (-57.93%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+3405.52%)
Collaborative Deep Learning For Recommender SystemsThe hybrid model combining stacked denoising autoencoder with matrix factorization is applied, to predict the customer purchase behavior in the future month according to the purchase history and user information in the Santander dataset.
Stars: ✭ 60 (-58.62%)
Serenata De Amor🕵 Artificial Intelligence for social control of public administration
Stars: ✭ 4,367 (+2911.72%)
CarbonCarbon is one of the components of Graphite, and is responsible for receiving metrics over the network and writing them down to disk using a storage backend.
Stars: ✭ 1,435 (+889.66%)
Hpbandstera distributed Hyperband implementation on Steroids
Stars: ✭ 456 (+214.48%)
Pycon Ua 2018Talk at PyCon UA 2018 (Kharkov, Ukraine)
Stars: ✭ 60 (-58.62%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+211.72%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-13.79%)
Tensor HouseA collection of reference machine learning and optimization models for enterprise operations: marketing, pricing, supply chain
Stars: ✭ 449 (+209.66%)
TurbodbcTurbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
Stars: ✭ 449 (+209.66%)
InfluxdbScalable datastore for metrics, events, and real-time analytics
Stars: ✭ 22,577 (+15470.34%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-59.31%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+15057.24%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-6.9%)
Metaflow🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+3422.76%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+5783.45%)
Siridb ServerSiriDB is a highly-scalable, robust and super fast time series database. Build from the ground up SiriDB uses a unique mechanism to operate without a global index and allows server resources to be added on the fly. SiriDB's unique query language includes dynamic grouping of time series for easy analysis over large amounts of time series.
Stars: ✭ 438 (+202.07%)
NeuroflowArtificial Neural Networks for Scala
Stars: ✭ 105 (-27.59%)
Machine learning refinedNotes, examples, and Python demos for the textbook "Machine Learning Refined" (published by Cambridge University Press).
Stars: ✭ 750 (+417.24%)
River🌊 Online machine learning in Python
Stars: ✭ 2,980 (+1955.17%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (+409.66%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-44.14%)
Getting Things Done With PytorchJupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BERT.
Stars: ✭ 738 (+408.97%)
Uplot📈 A small, fast chart for time series, lines, areas, ohlc & bars
Stars: ✭ 6,808 (+4595.17%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-22.07%)