All Projects → krlmlr → Dm

krlmlr / Dm

Licence: other
Relational data models

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Dm

database
Relational database access made simpler and safer
Stars: ✭ 40 (-84.31%)
Mutual labels:  relational-databases
db seeder
Relational database data generator..
Stars: ✭ 36 (-85.88%)
Mutual labels:  relational-databases
CEDS-IDS
The CEDS Integrated Data Store factors the entities and attributes of the CEDS Domain Entity Schema (DES) with standard technical syntax and 3rd normal form database normalization. The IDS Logical Model provides a standard framework for integration of P-20 data systems through a well-normalized “operational data store”. In a P-20 data system, th…
Stars: ✭ 29 (-88.63%)
Mutual labels:  relational-databases
aws-dbs-refarch-rdbms
Reference Architectures for Relational Databases on AWS
Stars: ✭ 23 (-90.98%)
Mutual labels:  relational-databases
framework
Solu Framework is a full featured, ORM-backed, isomorphic framework using RPython, Pouch/CouchDB and React.
Stars: ✭ 20 (-92.16%)
Mutual labels:  relational-databases
RDMP
Research Data Management Platform (RDMP) is an open source application for the loading,linking,anonymisation and extraction of datasets stored in relational databases.
Stars: ✭ 20 (-92.16%)
Mutual labels:  relational-databases
brmodelo-app
brModeloWeb is a free open source entity-relationship database modeling tool. We try to make learning database modeling simple and accessible for everyone.
Stars: ✭ 289 (+13.33%)
Mutual labels:  relational-databases
hsdatalog
BDD-based implementation of Datalog
Stars: ✭ 30 (-88.24%)
Mutual labels:  relational-databases
spiced-final-project
Career explorer platform developed in React.js in 6 days.
Stars: ✭ 14 (-94.51%)
Mutual labels:  relational-databases
beam-nuggets
Collection of transforms for the Apache beam python SDK.
Stars: ✭ 64 (-74.9%)
Mutual labels:  relational-databases
generaptr
Generaptr is a node package that helps when starting up a project by generating boilerplate code for Express api.
Stars: ✭ 16 (-93.73%)
Mutual labels:  relational-databases
datajoint-python
Relational data pipelines for the science lab
Stars: ✭ 140 (-45.1%)
Mutual labels:  relational-databases
oesophagus
Enterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-95.29%)
Mutual labels:  relational-databases
BirDayBer
'BirDayBer' is an application made for irresponsible people with friends or family birthdays like me. So it allows you to add birthdays and other minimal information to a database to notify you to remember them.
Stars: ✭ 22 (-91.37%)
Mutual labels:  relational-databases
dm
Working with relational data models in R
Stars: ✭ 358 (+40.39%)
Mutual labels:  relational-databases
carrot
Autumn 2017. A simple implementation of relational database with query optimization as the course project of Principles and Design of Database System, Renmin University of China.
Stars: ✭ 15 (-94.12%)
Mutual labels:  relational-databases
AlgebraicRelations.jl
Relational Algebra, now with more algebra!
Stars: ✭ 31 (-87.84%)
Mutual labels:  relational-databases
pireal
Relational Algebra Interpreter writting in Python and Qt
Stars: ✭ 31 (-87.84%)
Mutual labels:  relational-databases
lighthouse
Easy clojure relational database queries, migrations and connection pooling
Stars: ✭ 19 (-92.55%)
Mutual labels:  relational-databases
jds
Jenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
Stars: ✭ 17 (-93.33%)
Mutual labels:  relational-databases

dm

Lifecycle: maturing R build status Codecov test coverage CRAN status Launch rstudio.cloud

TL;DR

Are you using multiple data frames or database tables in R? Organize them with dm.

  • Use it today (if only like a list of tables).
  • Build data models tomorrow.
  • Deploy the data models to your organization’s RDBMS the day after.

Overview

dm bridges the gap in the data pipeline between individual data frames and relational databases. It’s a grammar of joined tables that provides a consistent set of verbs for consuming, creating, and deploying relational data models. For individual researchers, it broadens the scope of datasets they can work with and how they work with them. For organizations, it enables teams to quickly and efficiently create and share large, complex datasets.

dm objects encapsulate relational data models constructed from local data frames or lazy tables connected to an RDBMS. dm objects support the full suite of dplyr data manipulation verbs along with additional methods for constructing and verifying relational data models, including key selection, key creation, and rigorous constraint checking. Once a data model is complete, dm provides methods for deploying it to an RDBMS. This allows it to scale from datasets that fit in memory to databases with billions of rows.

Features

dm makes it easy to bring an existing relational data model into your R session. As the dm object behaves like a named list of tables it requires little change to incorporate it within existing workflows. The dm interface and behavior is modeled after dplyr, so you may already be familiar with many of its verbs. dm also offers:

  • visualization to help you understand relationships between entities represented by the tables
  • simpler joins that “know” how tables are related, including a “flatten” operation that automatically follows keys and performs column name disambiguation
  • consistency and constraint checks to help you understand (and fix) the limitations of your data

That’s just the tip of the iceberg. See Getting started to hit the ground running and explore all the features.

Installation

The latest stable version of the {dm} package can be obtained from CRAN with the command

install.packages("dm")

The latest development version of {dm} can be installed from GitHub.

# install.packages("devtools")
devtools::install_github("krlmlr/dm")

Usage

Create a dm object (see Getting started for details).

library(dm)
dm <- dm_nycflights13()
dm
#> ── Metadata ────────────────────────────────────────────────────────────────────
#> Tables: `airlines`, `airports`, `flights`, `planes`, `weather`
#> Columns: 53
#> Primary keys: 3
#> Foreign keys: 3

dm is a named list of tables:

names(dm)
#> [1] "airlines" "airports" "flights"  "planes"   "weather"
nrow(dm$airports)
#> [1] 1458
dm$flights %>%
  count(origin)
#> # A tibble: 3 x 2
#>   origin     n
#> * <chr>  <int>
#> 1 EWR     4043
#> 2 JFK     3661
#> 3 LGA     3523

Visualize relationships at any time:

dm %>%
  dm_draw()

Simple joins:

dm %>%
  dm_flatten_to_tbl(flights)
#> Renamed columns:
#> * year -> flights.year, planes.year
#> * name -> airlines.name, airports.name
#> # A tibble: 11,227 x 35
#>    flights.year month   day dep_time sched_dep_time dep_delay arr_time
#>           <int> <int> <int>    <int>          <int>     <dbl>    <int>
#>  1         2013     1    10        3           2359         4      426
#>  2         2013     1    10       16           2359        17      447
#>  3         2013     1    10      450            500       -10      634
#>  4         2013     1    10      520            525        -5      813
#>  5         2013     1    10      530            530         0      824
#>  6         2013     1    10      531            540        -9      832
#>  7         2013     1    10      535            540        -5     1015
#>  8         2013     1    10      546            600       -14      645
#>  9         2013     1    10      549            600       -11      652
#> 10         2013     1    10      550            600       -10      649
#> # … with 11,217 more rows, and 28 more variables: sched_arr_time <int>,
#> #   arr_delay <dbl>, carrier <chr>, flight <int>, tailnum <chr>, origin <chr>,
#> #   dest <chr>, air_time <dbl>, distance <dbl>, hour <dbl>, minute <dbl>,
#> #   time_hour <dttm>, airlines.name <chr>, airports.name <chr>, lat <dbl>,
#> #   lon <dbl>, alt <dbl>, tz <dbl>, dst <chr>, tzone <chr>, planes.year <int>,
#> #   type <chr>, manufacturer <chr>, model <chr>, engines <int>, seats <int>,
#> #   speed <int>, engine <chr>

Check consistency:

dm %>%
  dm_examine_constraints()
#> ! Unsatisfied constraints:
#> ● Table `flights`: foreign key tailnum into table `planes`: 1640 entries (14.6%) of `flights$tailnum` not in `planes$tailnum`: N722MQ (27), N725MQ (20), N520MQ (19), N723MQ (19), N508MQ (16), …

Learn more in the Getting started article.

Getting help

If you encounter a clear bug, please file an issue with a minimal reproducible example on GitHub. For questions and other discussion, please use community.rstudio.com.


License: MIT © cynkra GmbH.

Funded by:

energie360° cynkra


Please note that the ‘dm’ project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].