All Projects → jldbc → Coffee Quality Database

jldbc / Coffee Quality Database

Licence: mit
Building the Coffee Quality Institute Database

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Coffee Quality Database

Retriever
Quickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+70.92%)
Mutual labels:  data-science, dataset, data
Ml Pyxis
Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-34.04%)
Mutual labels:  data-science, dataset, data
Awesome Twitter Data
A list of Twitter datasets and related resources.
Stars: ✭ 533 (+278.01%)
Mutual labels:  data-science, dataset, data
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+21.28%)
Mutual labels:  data-science, dataset, data
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+93.62%)
Mutual labels:  data-science, dataset, data
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (+108.51%)
Mutual labels:  data-science, dataset, data
Dbg Pds
Deutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-12.06%)
Mutual labels:  data-science, dataset, data
Flyte
Accelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+780.85%)
Mutual labels:  data-science, data
Iso 3166 Countries With Regional Codes
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
Stars: ✭ 1,372 (+873.05%)
Mutual labels:  dataset, data
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+877.3%)
Mutual labels:  data-science, data
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-22.7%)
Mutual labels:  data-science, data
Openml R
R package to interface with OpenML
Stars: ✭ 81 (-42.55%)
Mutual labels:  data-science, dataset
Gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+771.63%)
Mutual labels:  data-science, data
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-43.97%)
Mutual labels:  data-science, dataset
Magicbox
A platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-48.23%)
Mutual labels:  data-science, data
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+7331.21%)
Mutual labels:  data-science, data
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+971.63%)
Mutual labels:  data-science, data
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-23.4%)
Mutual labels:  data-science, data
Python Pytest Cases
Separate test code from test cases in pytest.
Stars: ✭ 127 (-9.93%)
Mutual labels:  dataset, data
Githubrankingsspain
⬆️ Rankings with the most active GitHub users in Spain (sorted by public contributions) 🇪🇸
Stars: ✭ 127 (-9.93%)
Mutual labels:  dataset, data

coffee-quality-database

Digitizing 1,340 coffee reviews

Data

These data contain reviews of 1312 arabica and 28 robusta coffee beans from the Coffee Quality Institute's trained reviewers. The features include:

Quality Measures

  • Aroma
  • Flavor
  • Aftertaste
  • Acidity
  • Body
  • Balance
  • Uniformity
  • Cup Cleanliness
  • Sweetness
  • Moisture
  • Defects

Bean Metadata

  • Processing Method
  • Color
  • Species (arabica / robusta)

Farm Metadata

  • Owner
  • Country of Origin
  • Farm Name
  • Lot Number
  • Mill
  • Company
  • Altitude
  • Region

The data folder contains both raw and cleaned data. The raw data is exactly as it was found on the CQI site. Since these human-recorded data use a variety of different encodings, abbreviations, and units of measurement for their farm names, altitude, region, and other fields, I recommend using the cleaned data as a starting point.

The site was scraped using a Selenium headless browser and Beautiful Soup. To replicate this or collect updated data, create a login for the CQI site and enter your credentials in the scraper

Source

These data were collected from the Coffee Quality Institute's review pages in January 2018.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].