All Projects → arbox → Data Science With Ruby

arbox / Data Science With Ruby

Licence: cc0-1.0
Practical Data Science with Ruby based tools.

Programming Languages

ruby
36898 projects - #4 most used programming language

Projects that are alternatives of or similar to Data Science With Ruby

Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-68.85%)
Mutual labels:  data-science, data-analysis, data-mining, data-analytics, data-visualization
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+28.23%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (+455.74%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Deepgraph
Analyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (-57.74%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+125.5%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-1.46%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-95.99%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+7665.76%)
Mutual labels:  data-science, data-analysis, data-analytics, data-visualization
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-44.99%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Awesome Scientific Python
A curated list of awesome scientific Python resources
Stars: ✭ 127 (-76.87%)
Mutual labels:  data-science, data-analysis, list, data-visualization
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-60.29%)
Mutual labels:  data-science, data-analysis, data-mining, data-visualization
Cjworkbench
The data journalism platform with built in training
Stars: ✭ 244 (-55.56%)
Mutual labels:  data-science, data-analysis, data-visualization
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-17.3%)
Mutual labels:  data-science, data-analysis, data-visualization
Pyod
A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+825.87%)
Mutual labels:  data-science, data-analysis, data-mining
Book Socialmediaminingpython
Companion code for the book "Mastering Social Media Mining with Python"
Stars: ✭ 462 (-15.85%)
Mutual labels:  data-science, data-mining, data-visualization
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+474.13%)
Mutual labels:  data-science, data-mining, data-visualization
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-56.28%)
Mutual labels:  data-science, data-analysis, data-visualization
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-50.27%)
Mutual labels:  data-science, data-analysis, data-visualization
Urs
Universal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-49.91%)
Mutual labels:  data-science, data-analysis, data-mining
Awesome Datascience
📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+3091.26%)
Mutual labels:  data-science, data-mining, data-visualization

[RubyNLP | RubyML | RubyInterop]

Awesome Data Science with Ruby Awesome

Links and Resources for Data Processing and Analysis in Ruby

Data Science is a new "sexy" buzzword without specific meaning but often used to substitute Statistics, Scientific Computing, Text and Data Mining and Visualization, Machine Learning, Data Processing and Warehousing as well as Retrieval Algorithms of any kind.

This curated list comprises awesome tutorials, libraries, information sources about various Data Science applications using the Ruby programming language.

A lot of useful resources on this list come from the development by The Ruby Science Foundation, our contributors and our own day to day work on various data intensive applications. Read why this list is awesome.

✨ Every contribution is welcome! Add links through pull requests or create an issue to start a discussion.

Follow us on Twitter and please spread the word using the #RubyDataScience hash tag!

Contents

Ruby vs. Python vs. Julia vs. R

Ruby Python Julia R
Daru Pandas
NArray NumPy

Standing on the shoulders of giants

Ruby is (for now) not a Data Science centric language with a very large established library. Leveraging libraries from R, Python, and Julia helps Ruby to solve your tasks!

Data Manipulation

  • kiba — lightweight Ruby ETL (Extract-Transform-Load) framework.
  • jongleur — Workflow manager using DAG definitions to execute ETL tasks.

Distributed Computing

Data Structures

  • daru — Data Frame and Vector structures with comprehensive manipulating and visualization methods.
  • numo-narray — n-dimensional Numerical Array for Ruby.
  • nmatrix — dense and sparse linear algebra library for Ruby via SciRuby.
  • kdtree — blazingly fast native 2d k-d tree.
  • mdarray — Array structure for JRuby.
  • spreadsheet — manipulation library for MS Excel spreadsheets.
  • networkx — Ruby based NetworkX clone that handles various usecases of the Graph Data Structure.
  • cumo — CUDA-aware numerical Array library with NArray similar interface.

Data sets

  • rdatasets — Data sets available in R via Rdatasets.
  • red-datasets — Growing collection of publicly available data sets such as CIFAR-10, Iris, MNIST etc.

Statistics

Numeric and Symbolic Computation

Visualization

Comprehensive tools for Data Visualization.

Interactive Computing

Input and Output

General formats

Database Adapters

  • pg
  • Mongo
  • MySQL

Domain specific formats

  • BibTeX
  • inih — fast C based INI parser for Ruby.
  • bolognese — conversion tool for citation formats like BibTeX, RIS, or Crossref XML.

Provisioning Infrastructure

Machine Learning

Please look at our extensive Awesome ML with Ruby list.

Articles, Posts, Talks, and Presentations

Community

Related resources

Wait but why?

There are a lot of software lists with tools related to the Data Science. There are a couple of lists with Ruby related projects. There are no lists of only working and tested software with documented scope. We'll try to make one!

What is awesome? Awesome are documented, maintained and focused tools.

Can something turn not awesome at a point? Yes! Abandoned projects with broken dependencies aren't awesome any more! They leave this list.

License

Creative Commons Zero 1.0 Awesome Data Science with Ruby by Andrei Beliankou and Contributors.

To the extent possible under law, the person who associated CC0 with Awesome Data Science with Ruby has waived all copyright and related or neighboring rights to Awesome Data Science with Ruby.

You should have received a copy of the CC0 legalcode along with this work. If not, see https://creativecommons.org/publicdomain/zero/1.0/.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].