DataprooferA proofreader for your data
Stars: ✭ 628 (-29.2%)
Csv File Validator🔧🔦 Validation of CSV file against user defined schema (returns back object with data and invalid messages)
Stars: ✭ 60 (-93.24%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-80.72%)
FilehelpersThe FileHelpers are a free and easy to use .NET library to read/write data from fixed length or delimited records in files, strings or streams
Stars: ✭ 917 (+3.38%)
Awesomecsv🕶️A curated list of awesome tools for dealing with CSV.
Stars: ✭ 305 (-65.61%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-85.91%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+243.97%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+36.98%)
Intellij Csv ValidatorCSV validator, highlighter and formatter plugin for JetBrains Intellij IDEA, PyCharm, WebStorm, ...
Stars: ✭ 198 (-77.68%)
csvtogsTake a CSV file and create a Google Spreadsheet with the contents
Stars: ✭ 15 (-98.31%)
flatpackCSV/Tab Delimited and Fixed Length Parser and Writer
Stars: ✭ 55 (-93.8%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-3.72%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (-93.35%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-97.63%)
CsvTextFieldParserA simple CSV parser based on Microsoft.VisualBasic.FileIO.TextFieldParser.
Stars: ✭ 40 (-95.49%)
CodeCompilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-67.64%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-65.95%)
Pm4py CorePublic repository for the PM4Py (Process Mining for Python) project.
Stars: ✭ 313 (-64.71%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+394.59%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+320.41%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-60.77%)
Mli ResourcesH2O.ai Machine Learning Interpretability Resources
Stars: ✭ 428 (-51.75%)
Kranglkrangl is a {K}otlin DSL for data w{rangl}ing
Stars: ✭ 430 (-51.52%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+473.06%)
comma spliceFixes CSVs with unquoted commas in values
Stars: ✭ 67 (-92.45%)
csvlixirA CSV reading/writing application for Elixir.
Stars: ✭ 32 (-96.39%)
CursivelyA CSV reader for .NET. Fast, RFC 4180 compliant, and fault tolerant. UTF-8 only.
Stars: ✭ 34 (-96.17%)
csv2latex🔧 Simple script in python to convert CSV files to LaTeX table
Stars: ✭ 54 (-93.91%)
node-emails-from-csvA simple NodeJS aplication that helps sending emails for events. Uses CSV files for target users.
Stars: ✭ 18 (-97.97%)
VBA-CSV-interfaceThe most powerful and comprehensive CSV/TSV/DSV data management library for VBA, providing parsing/writing capabilities compliant with RFC-4180 specifications and a complete set of tools for manipulating records and fields.
Stars: ✭ 24 (-97.29%)
gpx-converterpython package for manipulating gpx files and easily converting gpx to other different formats
Stars: ✭ 54 (-93.91%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-68.09%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-69%)
FlatfilesReads and writes CSV, fixed-length and other flat file formats with a focus on schema definition, configuration and speed.
Stars: ✭ 275 (-69%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-97.52%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-97.97%)
Graph Fraud Detection PapersA curated list of fraud detection papers using graph information or graph neural networks
Stars: ✭ 339 (-61.78%)
Test ListsURL testing lists intended for discovering website censorship
Stars: ✭ 236 (-73.39%)
SpecsTechnical specifications and guidelines for implementing Frictionless Data.
Stars: ✭ 403 (-54.57%)
JsonconsA C++, header-only library for constructing JSON and JSON-like data formats, with JSON Pointer, JSON Patch, JSON Schema, JSONPath, JMESPath, CSV, MessagePack, CBOR, BSON, UBJSON
Stars: ✭ 400 (-54.9%)
Metaflow🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+475.87%)
SktimeA unified framework for machine learning with time series
Stars: ✭ 4,741 (+434.5%)
VroomFast reading of delimited files
Stars: ✭ 462 (-47.91%)
Ml From ScratchPython implementations of some of the fundamental Machine Learning models and algorithms from scratch.
Stars: ✭ 20,624 (+2225.14%)
Combo(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Stars: ✭ 481 (-45.77%)
RioA Swiss-Army Knife for Data I/O
Stars: ✭ 467 (-47.35%)
Csvutilcsvutil provides fast and idiomatic mapping between CSV and Go (golang) values.
Stars: ✭ 501 (-43.52%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-30.89%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-29.88%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-39.01%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-20.63%)
Industry Machine LearningA curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Stars: ✭ 6,077 (+585.12%)
Pyclusteringpyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (-9.13%)