ToolboxA Java Toolbox for Scalable Probabilistic Machine Learning
Stars: ✭ 105 (-11.02%)
Danfojsdanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Stars: ✭ 1,304 (+1005.08%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-44.92%)
Michael S Guide To Becoming A Data ScientistI was once asked about transitioning to a career in data science by three different UChicago grad students over a short period of time, so I decided to put together this outline in case anyone else was curious.
Stars: ✭ 34 (-71.19%)
NeuralpyNeuralPy: A Keras like deep learning library works on top of PyTorch
Stars: ✭ 77 (-34.75%)
Python TrainingPython training for business analysts and traders
Stars: ✭ 972 (+723.73%)
Blog文章列表
Stars: ✭ 96 (-18.64%)
Feagen(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Stars: ✭ 33 (-72.03%)
Daru Viewdaru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.
Stars: ✭ 65 (-44.92%)
PyastronomyA collection of astronomy-related routines in Python
Stars: ✭ 91 (-22.88%)
Pydata Pandas WorkshopMaterial for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (-44.92%)
Mljar SupervisedAutomated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning 🚀
Stars: ✭ 961 (+714.41%)
SuspeitandoProjeto de análise de contratos com suspeita de superfaturamento e má qualidade na prestação de serviços.
Stars: ✭ 76 (-35.59%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-72.88%)
VenonaCodefresh runtime-environment agent
Stars: ✭ 31 (-73.73%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-45.76%)
Page clusteringA simple algorithm for clustering web pages, suitable for crawlers
Stars: ✭ 30 (-74.58%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+8779.66%)
Arcgis Python ApiDocumentation and samples for ArcGIS API for Python
Stars: ✭ 954 (+708.47%)
Python BigdataData science and Big Data with Python
Stars: ✭ 112 (-5.08%)
NeuroflowArtificial Neural Networks for Scala
Stars: ✭ 105 (-11.02%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-22.88%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-45.76%)
Wolfram CoronavirusWolfram Language code and notebooks related to the coronavirus outbreak
Stars: ✭ 30 (-74.58%)
MachinelearningA repo with tutorials for algorithms from scratch
Stars: ✭ 96 (-18.64%)
RebateRelief Based Algorithms of ReBATE implemented in Python with Cython optimization. This repository is no longer being updated. Please see scikit-rebate.
Stars: ✭ 29 (-75.42%)
BlenderdatavisData visualisation addon for Blender
Stars: ✭ 72 (-38.98%)
Pytorch ToolbeltPyTorch extensions for fast R&D prototyping and Kaggle farming
Stars: ✭ 942 (+698.31%)
PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Stars: ✭ 32,029 (+27043.22%)
ProbflowA Python package for building Bayesian models with TensorFlow or PyTorch
Stars: ✭ 95 (-19.49%)
Intro PythonPython pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-82.2%)
MagicboxA platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-38.14%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-45.76%)
TflearnDeep learning library featuring a higher-level API for TensorFlow.
Stars: ✭ 9,573 (+8012.71%)
WildfirepyWildfirePy, a Python library for Wildfire GIS data analysis.
Stars: ✭ 21 (-82.2%)
PermonA tool to monitor everything you want. Clean, simple, extensible and in one place.
Stars: ✭ 73 (-38.14%)
Crunchbase MlMerge and Acquisitions Prediction based on M&A information from Crunchbase.
Stars: ✭ 20 (-83.05%)
VistrailsVisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains detailed history information about the steps followed and data derived in the course of an exploratory task: VisTrails maintains provenance of data products, of the computational processes that derive these products and their executions.
Stars: ✭ 94 (-20.34%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+651.69%)
Allstate capstoneAllstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-38.98%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+645.76%)
EuropaPuppet Container Registry
Stars: ✭ 114 (-3.39%)
AtacseqATAC-seq peak-calling, QC and differential analysis pipeline
Stars: ✭ 72 (-38.98%)
Applied Ml📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+15005.08%)
ChicksexerA Python package for gender classification.
Stars: ✭ 64 (-45.76%)
BayesliteBayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
Stars: ✭ 877 (+643.22%)
R CourseUna introduccion al analisis de datos con R y R Studio
Stars: ✭ 93 (-21.19%)
Pydata.krPyData Korea 공식 홈페이지입니다. (준비중)
Stars: ✭ 13 (-88.98%)
AutowrapWrap existing D code for use in Python, Excel, C#
Stars: ✭ 64 (-45.76%)
YaboxYet another black-box optimization library for Python
Stars: ✭ 103 (-12.71%)
H2o TutorialsTutorials and training material for the H2O Machine Learning Platform
Stars: ✭ 1,305 (+1005.93%)
Dev PracticePractice your skills with these ideas.
Stars: ✭ 1,127 (+855.08%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+849.15%)
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-47.46%)
Labeled Tweet GeneratorSearch for tweets and download the data labeled with its polarity in CSV format
Stars: ✭ 111 (-5.93%)
GdlGDL - GNU Data Language
Stars: ✭ 104 (-11.86%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-24.58%)