PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+21.15%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+280.08%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+1610.27%)
Danfojsdanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Stars: ✭ 1,304 (+167.76%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+1011.29%)
Algorithmic-TradingI have been deeply interested in algorithmic trading and systematic trading algorithms. This Repository contains the code of what I have learnt on the way. It starts form some basic simple statistics and will lead up to complex machine learning algorithms.
Stars: ✭ 47 (-90.35%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+351.13%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-96.3%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-74.33%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (-35.11%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+21.77%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+70.02%)
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (-21.77%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+525.05%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+471.87%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (-22.79%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-95.28%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-43.94%)
Morpheus CoreThe foundational library of the Morpheus data science framework
Stars: ✭ 203 (-58.32%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+373.92%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-51.75%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+25.67%)
StyleframeA library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Stars: ✭ 252 (-48.25%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-93.63%)
PeroxideRust numeric library with R, MATLAB & Python syntax
Stars: ✭ 191 (-60.78%)
PandasguiPandasGUI is a GUI for viewing, plotting and analyzing Pandas DataFrames.
Stars: ✭ 2,495 (+412.32%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+4427.31%)
pyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+99.18%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-95.28%)
veridical-flowMaking it easier to build stable, trustworthy data-science pipelines.
Stars: ✭ 28 (-94.25%)
raccoonPython DataFrame with fast insert and appends
Stars: ✭ 64 (-86.86%)
hdfeNo description or website provided.
Stars: ✭ 22 (-95.48%)
TeachingTeaching Materials for Dr. Waleed A. Yousef
Stars: ✭ 435 (-10.68%)
skippaSciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-93.22%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-95.48%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (-89.53%)
FacetHuman-explainable AI.
Stars: ✭ 269 (-44.76%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+509.45%)
PantheraData-frames & arrays on Clojure
Stars: ✭ 168 (-65.5%)
tableau-scrapingTableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (-81.31%)
PandapyPandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Stars: ✭ 474 (-2.67%)
Openintro Statistics📚 An open-source textbook written at the college level. OpenIntro also offers a second college-level intro stat textbook and also a high school variant.
Stars: ✭ 283 (-41.89%)
Uncertainty BaselinesHigh-quality implementations of standard and SOTA methods on a variety of tasks.
Stars: ✭ 278 (-42.92%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-41.07%)
QframeImmutable data frame for Go
Stars: ✭ 282 (-42.09%)
LanternData exploration glue
Stars: ✭ 292 (-40.04%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+800.82%)
ProbabilityProbabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+628.95%)
EvidentlyInteractive reports to analyze machine learning models during validation or production monitoring.
Stars: ✭ 304 (-37.58%)
PystoreFast data store for Pandas time-series data
Stars: ✭ 325 (-33.26%)
Data Science LearningRepository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-43.94%)