/blind_index

Securely search encrypted database fields

Primary LanguageRubyMIT LicenseMIT

Blind Index

Securely search encrypted database fields

Designed for use with attr_encrypted

Here’s a full example of how to use it with Devise

Build Status

How It Works

We use this approach by Scott Arciszewski. To summarize, we compute a keyed hash of the sensitive data and store it in a column. To query, we apply the keyed hash function (PBKDF2-HMAC-SHA256 by default) to the value we’re searching and then perform a database search. This results in performant queries for equality operations, while keeping the data secure from those without the key.

Getting Started

Add these lines to your application’s Gemfile:

gem 'attr_encrypted'
gem 'blind_index'

Add columns for the encrypted data and the blind index

# encrypted data
add_column :users, :encrypted_email, :string
add_column :users, :encrypted_email_iv, :string

# blind index
add_column :users, :encrypted_email_bidx, :string
add_index :users, :encrypted_email_bidx

And add to your model

class User < ApplicationRecord
  attr_encrypted :email, key: [ENV["EMAIL_ENCRYPTION_KEY"]].pack("H*")
  blind_index :email, key: [ENV["EMAIL_BLIND_INDEX_KEY"]].pack("H*")
end

We use environment variables to store the keys as hex-encoded strings (dotenv is great for this). Here’s an explanation of why pack is used. Do not commit them to source control. Generate one key for encryption and one key for hashing. You can generate keys in the Rails console with:

SecureRandom.hex(32)

For development, you can use these:

EMAIL_ENCRYPTION_KEY=0000000000000000000000000000000000000000000000000000000000000000
EMAIL_BLIND_INDEX_KEY=ffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff

And query away

User.where(email: "test@example.org")

Validations

To prevent duplicates, use:

class User < ApplicationRecord
  validates :email, uniqueness: true
end

We also recommend adding a unique index to the blind index column through a database migration.

Expressions

You can apply expressions to attributes before indexing and searching. This gives you the the ability to perform case-insensitive searches and more.

class User < ApplicationRecord
  blind_index :email, expression: ->(v) { v.downcase } ...
end

Multiple Indexes

You may want multiple blind indexes for an attribute. To do this, add another column:

add_column :users, :encrypted_email_ci_bidx, :string
add_index :users, :encrypted_email_ci_bidx

And update your model

class User < ApplicationRecord
  blind_index :email, ...
  blind_index :email_ci, attribute: :email, expression: ->(v) { v.downcase } ...
end

Search with:

User.where(email_ci: "test@example.org")

Index Only

If you don’t need to store the original value (for instance, when just checking duplicates), use a virtual attribute:

class User < ApplicationRecord
  attribute :email
  blind_index :email, ...
end

Requires ActiveRecord 5.1+

Multiple Columns

You can also use virtual attributes to index data from multiple columns:

class User < ApplicationRecord
  attribute :initials

  # must come before blind_index method
  before_validation :set_initials, if: -> { changes.key?(:first_name) || changes.key?(:last_name) }
  blind_index :initials, ...

  def set_initials
    self.initials = "#{first_name[0]}#{last_name[0]}"
  end
end

Requires ActiveRecord 5.1+

Fixtures

You can use encrypted attributes and blind indexes in fixtures with:

test_user:
  encrypted_email: <%= User.encrypt_email("test@example.org", iv: Base64.decode64("0000000000000000")) %>
  encrypted_email_iv: "0000000000000000"
  encrypted_email_bidx: <%= User.compute_email_bidx("test@example.org").inspect %>

Be sure to include the inspect at the end, or it won’t be encoded properly in YAML.

Algorithms

PBKDF2-HMAC-SHA256

The default hashing algorithm. Key stretching increases the amount of time required to compute hashes, which slows down brute-force attacks. You can set the number of iterations with:

class User < ApplicationRecord
  blind_index :email, iterations: 1000000, ...
end

The default is 10000. Changing this value requires you to recompute the blind index.

scrypt

Add scrypt to your Gemfile and use:

class User < ApplicationRecord
  blind_index :email, algorithm: :scrypt, ...
end

Set the cost parameters with:

class User < ApplicationRecord
  blind_index :email, algorithm: :scrypt, cost: {n: 4096, r: 8, p: 1}, ...
end

Argon2

Add argon2 to your Gemfile and use:

class User < ApplicationRecord
  blind_index :email, algorithm: :argon2, ...
end

Set the cost parameters with:

class User < ApplicationRecord
  blind_index :email, algorithm: :argon2, cost: {t: 3, m: 12}, ...
end

Key Rotation

To rotate keys without downtime, add a new column:

add_column :users, :encrypted_email_v2_bidx, :string
add_index :users, :encrypted_email_v2_bidx

And add to your model

class User < ApplicationRecord
  blind_index :email, key: [ENV["EMAIL_BLIND_INDEX_KEY"]].pack("H*")
  blind_index :email_v2, attribute: :email, key: [ENV["EMAIL_V2_BLIND_INDEX_KEY"]].pack("H*")
end

Backfill the data

User.find_each do |user|
  user.compute_email_v2_bidx
  user.save!
end

Then update your model

class User < ApplicationRecord
  blind_index :email, bidx_attribute: :encrypted_email_v2_bidx, key: [ENV["EMAIL_V2_BLIND_INDEX_KEY"]].pack("H*")

  # remove this line after dropping column
  self.ignored_columns = ["encrypted_email_bidx"]
end

Finally, drop the old column.

Reference

By default, blind indexes are encoded in Base64. Set a different encoding with:

class User < ApplicationRecord
  blind_index :email, encode: ->(v) { [v].pack("H*") }
end

Alternatives

One alternative to blind indexing is to use a deterministic encryption scheme, like AES-SIV. In this approach, the encrypted data will be the same for matches.

Upgrading

0.3.0

This version introduces a breaking change to enforce secure key generation. An error is thrown if your blind index key isn’t both binary and 32 bytes.

We recommend rotating your key if it doesn’t meet this criteria. You can generate a new key in the Rails console with:

SecureRandom.hex(32)

Update your model to convert the hex key to binary.

class User < ApplicationRecord
  blind_index :email, key: [ENV["EMAIL_BLIND_INDEX_KEY"]].pack("H*")
end

And recompute the blind index.

User.find_each do |user|
  user.compute_email_bidx
  user.save!
end

To continue without rotating, set:

class User < ApplicationRecord
  blind_index :email, insecure_key: true, ...
end

History

View the changelog

Contributing

Everyone is encouraged to help improve this project. Here are a few ways you can help:

To get started with development and testing:

git clone https://github.com/ankane/blind_index.git
cd blind_index
bundle install
rake test