All Projects → hi-primus → bumblebee

hi-primus / bumblebee

Licence: Apache-2.0 license
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

Programming Languages

Vue
7211 projects
javascript
184084 projects - #8 most used programming language
typescript
32286 projects
CSS
56736 projects
SCSS
7915 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to bumblebee

optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1025.83%)
Mutual labels:  dask, data-preparation, data-cleaning, data-profiling, cudf, dask-cudf
foofah
Foofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (-80%)
Mutual labels:  data-preparation, data-cleaning
reskit
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
Stars: ✭ 27 (-77.5%)
Mutual labels:  data-preparation, prepare-data
allie
🤖 A machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers).
Stars: ✭ 93 (-22.5%)
Mutual labels:  datasets, data-cleaning
datatile
A library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+249.17%)
Mutual labels:  dask, data-profiling
covid-19-data-cleanup
Scripts to cleanup data from https://github.com/CSSEGISandData/COVID-19
Stars: ✭ 25 (-79.17%)
Mutual labels:  datasets, data-cleaning
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-85%)
Mutual labels:  datasets
kaggledatasets
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (-63.33%)
Mutual labels:  datasets
Google-Playstore-Dataset
Google PlayStore App dataset. (2.3 million App Data) and 24 attributes
Stars: ✭ 27 (-77.5%)
Mutual labels:  datasets
cifair
A duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-89.17%)
Mutual labels:  datasets
scRNAseq cell cluster labeling
Scripts to run and benchmark scRNA-seq cell cluster labeling methods
Stars: ✭ 41 (-65.83%)
Mutual labels:  datasets
big-data-exploration
[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-64.17%)
Mutual labels:  datasets
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+837.5%)
Mutual labels:  datasets
errorlocate
Find and replace erroneous fields in data using validation rules
Stars: ✭ 19 (-84.17%)
Mutual labels:  data-cleaning
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (-63.33%)
Mutual labels:  datasets
data-profiling
a set of scripts to pull meta data and data profiling metrics from relational database systems
Stars: ✭ 57 (-52.5%)
Mutual labels:  data-profiling
exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-80.83%)
Mutual labels:  data-cleaning
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (-45.83%)
Mutual labels:  datasets
Spatio-Temporal-papers
This project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (+50%)
Mutual labels:  datasets
rs datasets
Tool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-81.67%)
Mutual labels:  datasets

Logo Bumblebee

Slack Docker Pulls

Bumblebee

The easiest and most powerful tool to clean, transform, and prepare data of any size for Analysis, Visualization, Reporting, and Machine Learning; all in a spreadsheet-like interface. Built over Optimus so you can handle small and big data efficiently.

Bumblebee can be used to:

  • Explore data using an ergonomic UI
  • Clean and transform datasets with more than 100 functions available
  • Prepare data for Machine Learning
  • Join and concatenate your datasets with a visual interface

Resources

Try Bumblebee

Try Bumblebee using this Docker image.

docker run --name my_instance_name -p 3000:3000 -p 4000:4000 -e ADDRESS=localhost hiprimus/bumblebee:develop

Contributing to Bumblebee

Contributions go far beyond pull requests and commits. We are very happy to receive any kind of contributions including:

  • Documentation updates, enhancements, designs, or bugfixes.
  • Spelling or grammar fixes.
  • README.md corrections or redesigns.
  • Adding unit, or functional tests.
  • Triaging GitHub issues; especially determining whether an issue still persists or is reproducible.
  • Entering our Slack community and helping someone else who needs help.
  • Blogging, speaking about, or creating tutorials about Bumblebee and its many features.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].