DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-97.15%)
Edavizedaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (-73.9%)
NotebooksAll of our computational notebooks
Stars: ✭ 292 (-65.36%)
OctosqlOctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
Stars: ✭ 2,579 (+205.93%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+61.92%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-25.5%)
MillerMiller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+449.58%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-98.34%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (-50.3%)
ZebrasData analysis library for JavaScript built with Ramda
Stars: ✭ 192 (-77.22%)
validadaAnother library for defensive data analysis.
Stars: ✭ 29 (-96.56%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-97.39%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+420.4%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-64.06%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (-74.85%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-96.44%)
tv📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
Stars: ✭ 1,763 (+109.13%)
Pyda 2e Zh📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+2.73%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (-28.23%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+844.6%)
PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Stars: ✭ 32,029 (+3699.41%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (-92.88%)
PbpythonCode, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+104.51%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (-83.51%)
Dtale DesktopBuild a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-84.82%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (-80.9%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-86.48%)
Pytablewriterpytablewriter is a Python library to write a table in various formats: CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
Stars: ✭ 422 (-49.94%)
VolbxGraphical tool for data manipulation written in C++/Qt
Stars: ✭ 187 (-77.82%)
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (-70.94%)
Hemmelig.appKeep your sensitive information out of chat logs, emails, and more with encrypted secrets.
Stars: ✭ 183 (-78.29%)
EmmaEmma Memory and Mapfile Analyser
Stars: ✭ 21 (-97.51%)
pandoc-placetablePandoc filter to include CSV data (from file or URL)
Stars: ✭ 35 (-95.85%)
TextrudeCode generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (-90.63%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-98.22%)
pyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+15.07%)
SchemaMapperA .NET class library that allows you to import data from different sources into a unified destination
Stars: ✭ 41 (-95.14%)
cookie-consent-jsA simple dialog and framework to handle the German and EU law about cookies in a website (December 2021)
Stars: ✭ 55 (-93.48%)
nebulaA distributed block-based data storage and compute engine
Stars: ✭ 127 (-84.93%)
onelinerhub2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (-23.49%)
datascienvdatascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (-93.71%)
uzbekistan-regions-dataFull Database of regions Uzbekistan available in JSON, SQL & CSV Format All Regions, Districts & Quarters with Latin, Cyrillic and Russian versions. (Районы (туманы) Республики Узбекистан и Города областного (республиканского) подчинения)
Stars: ✭ 46 (-94.54%)
react-cookie-lawReact Cookie Law is a cookie-info banner compliance with the GDPR and the EU cookie law. It allows the user to give consent in a granular way.
Stars: ✭ 103 (-87.78%)
antzANTz immersive 3D data visualization engine
Stars: ✭ 25 (-97.03%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (-97.15%)
CC33ZCurso de Ciência da Computação
Stars: ✭ 50 (-94.07%)
mercuryMercury - data visualize and discovery with Javascript, such as apache zeppelin and jupyter
Stars: ✭ 29 (-96.56%)
csvtogsTake a CSV file and create a Google Spreadsheet with the contents
Stars: ✭ 15 (-98.22%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (-96.44%)