All Projects → UWQuickstep → quickstep

UWQuickstep / quickstep

Licence: Apache-2.0 license
Quickstep project

Programming Languages

C++
36643 projects - #6 most used programming language
CMake
9771 projects
c
50402 projects - #5 most used programming language
python
139335 projects - #7 most used programming language
shell
77523 projects
Makefile
30231 projects

Projects that are alternatives of or similar to quickstep

Ignite Book Code Samples
All code samples, scripts and more in-depth examples for the book high performance in-memory computing with Apache Ignite. Please use the repository "the-apache-ignite-book" for Ignite version 2.6 or above.
Stars: ✭ 86 (+290.91%)
Mutual labels:  high-performance, in-memory
Sharedhashfile
Share Hash Tables With Stable Key Hints Stored In Memory Mapped Files Between Arbitrary Processes
Stars: ✭ 380 (+1627.27%)
Mutual labels:  high-performance, in-memory
heurist
Core development repository. gitHub: Vsn 6 (2020 - ), Vsn 5 (2018 - 2020), Vsn 4 (2014-2017). Sourceforge: Vsn 3 (2009-2013), Vsn 1 & 2 (2005-2009)
Stars: ✭ 39 (+77.27%)
Mutual labels:  database-management
keshi
A better in-memory cache for Node and the browser
Stars: ✭ 76 (+245.45%)
Mutual labels:  in-memory
Papers4DataAchitect
Collect papers for data engineering such as OLTP/OLAP/ETL/DistributedStorage.
Stars: ✭ 17 (-22.73%)
Mutual labels:  olap
sched
⏳ a high performance reliable task scheduling package in Go.
Stars: ✭ 46 (+109.09%)
Mutual labels:  high-performance
core
Enterprise Grade #NodeJS Platform implementing Industry Standards & Patterns in order to provide Connectivity, Stability, High-Availability and High-Performance
Stars: ✭ 54 (+145.45%)
Mutual labels:  high-performance
torrentit
Telegram bot for downloading torrents without storage
Stars: ✭ 33 (+50%)
Mutual labels:  in-memory
sqlite-gui
Lightweight SQLite editor for Windows
Stars: ✭ 151 (+586.36%)
Mutual labels:  database-management
fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Stars: ✭ 123 (+459.09%)
Mutual labels:  high-performance
insightedge
InsightEdge Core
Stars: ✭ 22 (+0%)
Mutual labels:  in-memory
magedbm
💾 Magento 1.x Database Backup Manager
Stars: ✭ 38 (+72.73%)
Mutual labels:  database-management
exposed
High performance RPC framework
Stars: ✭ 28 (+27.27%)
Mutual labels:  high-performance
Clinic-Management-System-ASP.NET
👨‍⚕️ A fully featured Clinic Management System based on three tier architecture made using ASP.NET, C# with a well documented README.md file.
Stars: ✭ 82 (+272.73%)
Mutual labels:  database-management
0chain
0Chain is a decentralized blockchain-based storage platform with built-in privacy and security compliance. It provides high performance, enforceable SLAs, choice of providers for all enterprise grade applications.
Stars: ✭ 79 (+259.09%)
Mutual labels:  high-performance
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+10627.27%)
Mutual labels:  olap
HTAPBench
Benchmark suite to evaluate HTAP database engines
Stars: ✭ 15 (-31.82%)
Mutual labels:  olap
Ming
MongoDB ODM (Object Document Mapper) with Unit of Works, IdentityMap, Relations and Mongo-In-Memory implementation
Stars: ✭ 19 (-13.64%)
Mutual labels:  in-memory
maricutodb
PHP Flat File Database Manager
Stars: ✭ 23 (+4.55%)
Mutual labels:  database-management
netty-in-action-cn
Netty In Action 中文版
Stars: ✭ 1,389 (+6213.64%)
Mutual labels:  high-performance

Quickstep

Travis Widget

What is Quickstep?

Quickstep is high-performance database engine designed to exploit the full potential of hardware that is packed in modern computing boxes (servers and laptops). This version targets single-node in-memory environments. If your data spills overs the memory limit Quickstep will still work, so you don't have to obsessively worry about the in-memory part. Also, if your working set fits in memory then Quickstep will transparently and automatically figure that out, and cache that hot set to deliver in-memory performance.

Why Quickstep?

Did you know that the hardware that you have in your laptop was spread across a small cluster just a decade ago? (PS: Hopefully you are not using a very old laptop!) If you look at a high-end server box, then that packs compute and storage power that was a full rack about 5 years ago! And, the way hardware technology is going, that box is going to become even more powerful in the future. In fact, it is likely that the computing power in each box is going to grow faster than other hardware components (e.g. networking) in data centers. So, if you care about performance and/or total operating costs, paying attention to single box performance is likely to be super important in the long run.

In other words there is a small data center in an individual compute boxes today! Quickstep aims to allow you to fully exploit the potential of that data center that is hidden in each individual box today. We call this the scaling-in approach, and it complements a scaling-out approach. But without scaling-in, you are overpaying (by a lot!) when you run your data service.

What are the key ingredients?

Modern computing boxes contain a large number of computing cores and large main memory configuration. Quickstep allows you to fully exploit these hardware resources using novel data processing, data storage, and query processing methods that include:

  1. A unique decoupling of data-flow from control-flow for query execution that allows for unlimited intra and inter-query parallelism. Thus, using all the processing core effectively.

  2. A template meta-programming framework that provides fast vectorized query execution. Thus, using each processor cycle very efficiently.

  3. A hybrid data storage architecture that includes columnar and row-store. Yes, this may surprise some of you, but sometimes a row-store beats a column-store!

And, it is open source!

Giving it a spin

  1. Checkout the code: git clone https://github.com/UWQuickstep/quickstep.git
  2. Then, go to the code directory: cd quickstep
  3. Initialize the dependencies: git submodule init
  4. Checkout the dependencies: git submodule update
  5. Download additional third-party dependencies and apply patches:
    cd third_party && ./download_and_patch_prerequisites.sh && cd ../
  6. Go into the build directory: cd build
  7. Create the Makefile: cmake -D CMAKE_BUILD_TYPE=Release ..
  8. Build: make -j4. Note you may replace the 4 with the number of cores on your machine.
  9. Start quickstep: ./quickstep_cli_shell --initialize_db=true. You can now fire SQL queries. To quit, you can type in quit; Your data is stored in the directory qsstor. Note the next time you start Quickstep, you can omit the --initialize_db flag (as the database has already been initialized), and simply start Quickstep as: ./quickstep_cli_shell. There are also a number of optional flags that you can specify, and to see the full list, you can type in: ./quickstep_cli_shell --help
  10. Next let us load some data and fire some queries. A few points to note: The SQL surface of Quickstep is small (it will grow over time). The traditional SQL CREATE TABLE and SELECT statements work. The data types that are supported include INTEGER, FLOAT, DOUBLE, VARCHAR, CHAR, DATE, and DATETIME. Quickstep also does not have support for NULLS or keys (yet). Let us create two tables by typing into the Quickstep shell (which you opened in the step above), the following SQL command:
CREATE TABLE Weather (cid INTEGER, recordDate DATE, highTemperature FLOAT, lowTemperature FLOAT);

and then,

CREATE TABLE City (cid Integer, name VARCHAR(80), state CHAR(2));
  1. Next, let us insert some tuples in these two tables.

    INSERT INTO City VALUES (1, 'Madison', 'WI');
    INSERT INTO City VALUES (2, 'Palo Alto', 'CA');
    INSERT INTO Weather VALUES (1, '2015-11-1', 50, 30);
    INSERT INTO Weather VALUES (1, '2015-11-2', 51, 32);
    INSERT INTO Weather VALUES (2, '2015-11-1', 60, 50);
    
  2. We can now issue SQL queries such as: a. Find all weather records for California:

SELECT * FROM WEATHER W, City C WHERE C.cid = W.cid AND C.state = 'CA';

b. Find the min and max temperature for each city, printing the cid:

SELECT cid, MIN(lowTemperature), MAX(highTemperature) FROM Weather GROUP BY cid;

c. Find the min and max temperature for each city using a nested query, and printing thie city name:

SELECT * FROM City C, (SELECT cid, MIN(lowTemperature), MAX(highTemperature) FROM Weather GROUP BY cid) AS T WHERE C.cid = T.cid;
  1. Quickstep also supports a COPY TABLE command. If you want to try that, then from a separate shell file type in the following:

    echo "3|2015-11-3|49|29" > /tmp/tmp.tbl
    echo "3|2015-11-4|48|28" >> /tmp/tmp.tbl
    echo "3|2015-11-5|47|27" >> /tmp/tmp.tbl
    

    Then, load this new data by typing the following SQL in the Quickstep shell:

    COPY Weather FROM '/tmp/tmp.tbl' WITH (DELIMITER '|');
    

    Now, you have loaded three more tuples into the Weather table, and you can fire the SQL queries above again against this modified database.

    Remember, to quit Quickstep, you can type in quit; into the Quickstep shell.

Additional pointers

  1. For other build options, see the more comprehensive build guide.
  2. To get started as a developer, you should start with the code organization guide.

Licensing

Quickstep is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].