Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+5717.39%)
KdepyKernel Density Estimation in Python
Stars: ✭ 244 (+960.87%)
covid19-data-greeceDatasets and analysis of Novel Coronavirus (COVID-19) outbreak in Greece
Stars: ✭ 16 (-30.43%)
StingrayAnything can happen in the next half hour (including spectral timing made easy)!
Stars: ✭ 94 (+308.7%)
mlmachinemlmachine accelerates machine learning experimentation
Stars: ✭ 23 (+0%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-21.74%)
PyastronomyA collection of astronomy-related routines in Python
Stars: ✭ 91 (+295.65%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (+43.48%)
ipychartThe power of Chart.js with Python
Stars: ✭ 48 (+108.7%)
growthbookOpen Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+10082.61%)
HiringCreate WOW Moments. Create superfans.
Stars: ✭ 85 (+269.57%)
data vis statistics geosciencesThis repository contains the laboratory portion of an upper level undergraduate class in Python on data visualization and statistics for geo & space scientists. Labs are updated when the course is in session through the most recent branch. See master version for current class.
Stars: ✭ 32 (+39.13%)
EegruntA Collection Python EEG (+ ECG) Analysis Utilities for OpenBCI and Muse
Stars: ✭ 171 (+643.48%)
rworkshopsMaterials for R Workshops
Stars: ✭ 43 (+86.96%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+5282.61%)
GuitarA Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Stars: ✭ 86 (+273.91%)
PlotnineA grammar of graphics for Python
Stars: ✭ 2,879 (+12417.39%)
CoreMSCoreMS is a comprehensive mass spectrometry software framework
Stars: ✭ 20 (-13.04%)
ExportifyExport Spotify playlists using the Web API. Analyze them in the Jupyter notebook.
Stars: ✭ 80 (+247.83%)
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
Stars: ✭ 84 (+265.22%)
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-43.48%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+5134.78%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-4.35%)
CC33ZCurso de Ciência da Computação
Stars: ✭ 50 (+117.39%)
AlphaPlot📈 Application for statistical analysis and data visualization which can generate different types of publication quality 2D and 3D plots with extensive visual customization.
Stars: ✭ 140 (+508.7%)
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (+217.39%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+21286.96%)
Data-VisualizationCollection of interactive Jupiter Notebook widgets and graphs.
Stars: ✭ 112 (+386.96%)
Facebook Data Analyzer📊Python script to analyze the contents of your Facebook data export
Stars: ✭ 71 (+208.7%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+4413.04%)
AmadeusHarmonious distributed data analysis in Rust.
Stars: ✭ 240 (+943.48%)
gocellsEvent Based Applications [DEPRECATED]
Stars: ✭ 69 (+200%)
DatatableA Python package for manipulating 2-dimensional tabular data structures
Stars: ✭ 1,166 (+4969.57%)
PSelectPowerShell DSL for aggregating data
Stars: ✭ 27 (+17.39%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+630.43%)
covid-19COVID-19 World is yet another Project to build a Dashboard like app to showcase the data related to the COVID-19(Corona Virus).
Stars: ✭ 28 (+21.74%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (+200%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+600%)
dataViz CADiMaterials for the "Data Visualization" CADi workshop @ "Tecnológico de Monterrey"
Stars: ✭ 14 (-39.13%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (+17.39%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+9391.3%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (+86.96%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (+191.3%)
PythonTipsDSPython Tips for Data Scientist
Stars: ✭ 23 (+0%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+13165.22%)
Pydata Pandas WorkshopMaterial for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (+182.61%)
ipaddressData analysis of IP addresses and networks
Stars: ✭ 20 (-13.04%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-17.39%)
Bitcoin Analysis-Python Bitcoin is widely used cryptocurrency for digital market. It is decentralised that means it is not own by government or any other company.Transactions are simple and easy as it doesn’t belong to any country.Records data are stored in Blockchain.Bitcoin price is variable and it is widely used so it is important to predict the price of it f…
Stars: ✭ 42 (+82.61%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+7360.87%)
GonumGonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
Stars: ✭ 5,384 (+23308.7%)