OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-68.17%)
IATI.cloudThe open-source IATI datastore for IATI data with RESTful web API providing XML, JSON, CSV output. It extracts and parses IATI XML files referenced in the IATI Registry and powered by Apache Solr.
Stars: ✭ 35 (-91.23%)
MaisUniversalizando o acesso a dados no Brasil. Docs: https://basedosdados.github.io/mais/
Stars: ✭ 122 (-69.42%)
blogpost codesRepo of my blogpost articles codes
Stars: ✭ 41 (-89.72%)
OpendataSkillCorner Open Data with 9 matches of broadcast tracking data.
Stars: ✭ 86 (-78.45%)
BeastLoad data from Kafka to any data warehouse
Stars: ✭ 119 (-70.18%)
GgstatsplotEnhancing `ggplot2` plots with statistical analysis 📊🎨📣
Stars: ✭ 1,121 (+180.95%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-86.72%)
SkaterPython Library for Model Interpretation/Explanations
Stars: ✭ 973 (+143.86%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+122.31%)
DitrasDITRAS (DIary-based TRAjectory Simulator), a mathematical model to simulate human mobility
Stars: ✭ 19 (-95.24%)
PyreadstatPython package to read sas, spss and stata files into pandas data frames. It is a wrapper for the C library readstat.
Stars: ✭ 151 (-62.16%)
Datastream.ioAn open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: ✭ 814 (+104.01%)
qinstDraft of generic instrumentation tool based on QEMU using eBPF to implement trivial instrumentations with trivial code
Stars: ✭ 17 (-95.74%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+1563.91%)
SwifterA package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Stars: ✭ 1,844 (+362.16%)
VegasThe missing MatPlotLib for Scala + Spark
Stars: ✭ 709 (+77.69%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-79.7%)
Business Machine LearningA curated list of practical business machine learning (BML) and business data science (BDS) applications for Accounting, Customer, Employee, Legal, Management and Operations (by @firmai)
Stars: ✭ 575 (+44.11%)
Gspread DataframeRead/write Google spreadsheets using pandas DataFrames
Stars: ✭ 118 (-70.43%)
Or Pandas【运筹OR帷幄|数据科学】pandas教程系列电子书
Stars: ✭ 492 (+23.31%)
grafana-pandas-datasourceGrafana Pandas Datasource - using Python for generating timeseries-, table-data and annotations
Stars: ✭ 38 (-90.48%)
Kranglkrangl is a {K}otlin DSL for data w{rangl}ing
Stars: ✭ 430 (+7.77%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+363.91%)
Linq To BigqueryLINQ to BigQuery is C# LINQ Provider for Google BigQuery. It also enables Desktop GUI Client with LINQPad and plug-in driver.
Stars: ✭ 69 (-82.71%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-29.07%)
Tracker-AggregatorAn abstraction layer for analytics in your app to keep your tracking code clean and reusable.
Stars: ✭ 17 (-95.74%)
Metaflow🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+1180.2%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-83.71%)
140stories140Stories: Collaborative stories 140 chars at a time.
Stars: ✭ 14 (-96.49%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+139.1%)
autonomioCore functionality for the Autonomio augmented intelligence workbench.
Stars: ✭ 27 (-93.23%)
spark-on-k8s-gcp-examplesExample Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
Stars: ✭ 36 (-90.98%)
S3bpRead and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
Stars: ✭ 24 (-93.98%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-93.48%)
Datashare ToolkitDIY commercial datasets on Google Cloud Platform
Stars: ✭ 41 (-89.72%)
AdamCoroutine-friendly Android Debug Bridge client written in Kotlin
Stars: ✭ 129 (-67.67%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-95.24%)
sklearndfDataFrame support for scikit-learn.
Stars: ✭ 54 (-86.47%)
RemoteNETExamine, create and interact with remote objects in other .NET processes.
Stars: ✭ 29 (-92.73%)
yahoo-historicalDownloads historical EOD (end of day) prices from yahoo finance
Stars: ✭ 96 (-75.94%)
schrutepyThe Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-94.49%)
ob google-bigqueryThis service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about installation, configuration or ongoing maintenance related to an SDK environment. This can be helpful to those who would prefer to not to be responsible for those activities.
Stars: ✭ 43 (-89.22%)