CdsData syncing in golang for ClickHouse.
Stars: ✭ 501 (+1065.12%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+874.42%)
onelinerhub2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (+1400%)
cdsData syncing in golang for ClickHouse.
Stars: ✭ 839 (+1851.16%)
PandahousePandas interface for Clickhouse database
Stars: ✭ 126 (+193.02%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+32155.81%)
wax-mlA Python library for machine-learning and feedback loops on streaming data
Stars: ✭ 36 (-16.28%)
xpandasUniversal 1d/2d data containers with Transformers functionality for data analysis.
Stars: ✭ 25 (-41.86%)
five-minute-midasPredicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-4.65%)
weaverbirdA visual data pipeline builder with various backends
Stars: ✭ 65 (+51.16%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-67.44%)
DS-Cookbook101A jupyter notebook having all most frequent used code snippet for daily data scienceoperations
Stars: ✭ 59 (+37.21%)
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+25697.67%)
machine-learning-capstone-projectThis is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-34.88%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-27.91%)
jcastsSimple podcast MVP
Stars: ✭ 27 (-37.21%)
Information-RetrievalInformation Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (+139.53%)
toucan-connectorsConnectors available to retrieve data in Toucan Toco small apps
Stars: ✭ 13 (-69.77%)
obsplusA Pandas-Centric ObsPy Expansion Pack
Stars: ✭ 28 (-34.88%)
faldo more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Stars: ✭ 567 (+1218.6%)
ydata-qualityData Quality assessment with one line of code
Stars: ✭ 311 (+623.26%)
datarA Grammar of Data Manipulation in python
Stars: ✭ 142 (+230.23%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+274.42%)
dbal-clickhouseDoctrine DBAL driver for ClickHouse database
Stars: ✭ 77 (+79.07%)
jupyter-djangoUsing Jupyter Notebook with Django: a presentation
Stars: ✭ 42 (-2.33%)
clickhouse hadoopImport data from clickhouse to hadoop with pure SQL
Stars: ✭ 26 (-39.53%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+1860.47%)
shieldShield is a role-based cloud-native user management system, identity & access proxy, and authorization server for your applications and API endpoints.
Stars: ✭ 158 (+267.44%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (+39.53%)
ExposureExposure是一个帮助做曝光统计需求的库,可以很方便的对曝光事件进行埋点,在现有代码上少量侵入即可实现曝光埋点。支持RV的线性布局、网格布局、瀑布流布局、横向滑动RV,ScrollView等各种滚动布局。支持配置item的有效曝光面积。
Stars: ✭ 51 (+18.6%)
chatstats💬📊 Fun data visualizations for Facebook Messenger chats
Stars: ✭ 18 (-58.14%)
pantabRead/Write pandas DataFrames with Tableau Hyper Extracts
Stars: ✭ 64 (+48.84%)
appmetrica-logsapi-loaderA tool for automatic data loading from AppMetrica LogsAPI into (local) ClickHouse
Stars: ✭ 18 (-58.14%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-27.91%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-46.51%)
flokkrDocumentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (-30.23%)
ProtonHigh performance Pinba server
Stars: ✭ 27 (-37.21%)
ml-workflow-automationPython Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
Stars: ✭ 44 (+2.33%)
pybacenThis library was developed for economic analysis in the Brazilian scenario (Investments, micro and macroeconomic indicators)
Stars: ✭ 40 (-6.98%)
UnROOT.jlNative Julia I/O package to work with CERN ROOT files
Stars: ✭ 52 (+20.93%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1323.26%)
gw2raidarA log parsing website for Guild Wars 2 combat logs
Stars: ✭ 19 (-55.81%)
pandas twitterAnalyzing Trump's tweets using Python (Pandas + Twitter workshop)
Stars: ✭ 81 (+88.37%)
SparkTwitterAnalysisAn Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Stars: ✭ 29 (-32.56%)
flink-learnLearning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (+62.79%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-55.81%)
pytdTreasure Data Driver for Python
Stars: ✭ 15 (-65.12%)
tsa-tutorialMaterial for the tutorial, "Time series analysis with pandas" at T-Academy
Stars: ✭ 21 (-51.16%)