umbrellio / sequel-batches

Licence: MIT license

Sequel extension for querying large datasets in batches

Programming Languages

ruby

36898 projects - #4 most used programming language

Projects that are alternatives of or similar to sequel-batches

sequel-seed

A Sequel extension to make seeds/fixtures manageable like migrations

Stars: ✭ 25 (+56.25%)

Mutual labels: sequel

sequel-pg advisory lock

Extension for ruby Sequel library that allows using PostgreSQL advisory locks for application-level mutexes.

Stars: ✭ 20 (+25%)

Mutual labels: sequel

Will paginate

Pagination library for Rails, Sinatra, Merb, DataMapper, and more

Stars: ✭ 5,621 (+35031.25%)

Mutual labels: sequel

Sequel Ace

MySQL/MariaDB database management for macOS

Stars: ✭ 3,989 (+24831.25%)

Mutual labels: sequel

discord-against-humanity

cards against humanity, integrated into discord

Stars: ✭ 21 (+31.25%)

Mutual labels: sequel

doorkeeper-sequel

Doorkeeper Sequel ORM

Stars: ✭ 12 (-25%)

Mutual labels: sequel

sinator

Sinatra application generator

Stars: ✭ 19 (+18.75%)

Mutual labels: sequel

sqlike

Golang Sequel ORM that supports Enum, JSON, Spatial, and many more

Stars: ✭ 18 (+12.5%)

Mutual labels: sequel

CopyAsMarkdownForSequelProBundle

Copy selected rows as Markdown. It's so easy to paste in GitHub issue.

Stars: ✭ 23 (+43.75%)

Mutual labels: sequel

Sequel::Batches

This dataset extension provides the #in_batches method. The method splits dataset in parts and yields it.

Note: currently only PostgreSQL database is supported.

Installation

Add this line to your application's Gemfile:

gem 'sequel-batches'

Usage

In order to use the feature you should enable the extension:

Sequel::DATABASES.first.extension :batches

After that the #in_batches method becomes available on dataset:

User.where(role: "admin").in_batches(of: 4) do |ds|
  ds.delete
end

Finally, here's an example including all the available options:

options = {
  of: 4,
  pk: [:project_id, :external_user_id],
  start: { project_id: 2, external_user_id: 3 },
  finish: { project_id: 5, external_user_id: 70 },
  order: :desc,
}

Event.where(type: "login").in_batches(options) do |ds|
  ds.delete
end

Options

You can set the following options:

pk

Overrides primary key of your dataset. This option is required in case your table doesn't have a real PK, otherwise you will get Sequel::Extensions::Batches::MissingPKError.

Note that you have to provide columns that don't contain NULL values, otherwise this may not work as intended. You will receive Sequel::Extensions::Batches::NullPKError in case batch processing detects a NULL value on it's way, but it's not guaranteed since it doesn't check all the rows for performance reasons.

of

Sets chunk size (1000 by default).

start

A hash { [column]: <start_value> } that represents frame start for batch processing. Note that you will get Sequel::Extensions::Batches::InvalidPKError in case you provide a hash with wrong keys (ordering matters as well).

finish

Same as start but represents the frame end.

order

Specifies the primary key order (can be :asc or :desc). Defaults to :asc.

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/umbrellio/sequel-batches.

License

The gem is available as open source under the terms of the MIT License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

umbrellio / sequel-batches

Programming Languages

Labels

Projects that are alternatives of or similar to sequel-batches

Sequel::Batches

Installation

Usage

Options

pk

of

start

finish

order

Contributing

License