-
Requirements
- PHP 7.4^
-
Supports:
- A single Yaml configuration file
- Multiple Nodes (DB servers)
- Mysql
- Postgres
- Mysql & Postgres can be used together and hot swapped
- Multiple Geo Locations
- UUIDs bakes in all relevant data for tables and to which node it belongs
- Docker
- Kubernetes
- Fast Asynchronous DB queries (using a purpose built GoThreaded service https://github.com/jrsaunders/go-threaded | https://hub.docker.com/r/jrsaunders/gothreaded or PHP Forking for crons or dev work)
- Caching to File or to Redis or MemcacheD
- Unique table columns across nodes
- Table Grouping to ensure data is kept in the right shards so joins can be done
- Using popular ORM from Laravel ( though your project does not need be in Laravel ) https://laravel.com/docs/7.x
- Query building being database agnostic
- Efficient pagination system across Nodes using caching
- Raw SQL Queries
Once you have initiated it as outlined in the INSTALLATION section below - here are some quick examples of usage.
If you are familiar with the ORM in Laravel - this is just an extension of that.
- Creates Table across all appropriate Nodes (Mysql and Postgres simultaneously). This follows the guidance you have given in your Yaml Config file as to what tables belong on what nodes
use ShardMatrix\DB\Builder\Schema;
# Creates Table across all appropriate Nodes (Mysql and Postgres simultaneously).
# This follows the guidance you have given in your Yaml Config file as to what tables
# belong on what nodes
Schema::create( 'users',
function ( \Illuminate\Database\Schema\Blueprint $table ) {
$table->string( 'uuid', 50 )->primary();
$table->string('username',255)->unique();
$table->string('email',255)->unique();
$table->integer('something');
$table->dateTime( 'created' );
}
);
- Insert Data - the system will choose an appropriate shard node and create a UUID for it that will be attributed to an appropriate node
use ShardMatrix\DB\Builder\DB;
# Insert Data - the system will choose an appropriate shard node and create a UUID for it that will be attributed to an appropriate node
$uuid = DB::table( 'users' )->insert(
[
'username' => 'jack-malone',
'password' => 'poootpooty',
'created' => (new \DateTime())->format('Y-m-d H:i:s'),
'something' => 5,
'email' => 'jack.malone@yatti.com',
]
);
echo $uuid->toString();
# outputs 06a00233-1ea8af83-9b6f-6104-b465-444230303037
echo $uuid->getNode()->getName();
# outputs DB0007
echo $uuid->getTable()->getName();
# outputs users
Inserted Data
uuid 06a00233-1ea8af83-9b6f-6104-b465-444230303037
username jack-malone
password poootpooty
email jack.malone@yatti.com
created 2020-04-30 15:35:31.000000
something 5
- Any further inserts done in this php process will be inserted into the same shard, if in the correct table group
- Get the record directly from the correct node (shard)
- Manipulate the record
- Update the record
use ShardMatrix\DB\Builder\DB;
use ShardMatrix\DB\Interfaces\DBDataRowTransactionsInterface;
# Get the record directly from the correct node (shard)
$record = DB::getByUuid( '06a00233-1ea8af83-9b6f-6104-b465-444230303037' );
# Manipulate the record
if ( $record && $record instanceof DBDataRowTransactionsInterface) {
# As above you could run an additional check for the instance of the record returned, but it should always follow this interface through the query builder
echo $record->username;
# outputs jack-malone
echo $record->email;
# outputs jack.malone@yatti.com
# overwrite the email attribute
$record->email = 'anotheremail@yatti.com';
# Update the record
$record->save();
}
- Query all relevant nodes for the data
- Data returns as a Collection that can be iterated through
- Use data conditionally
- Manipulate the record and commit changes
use ShardMatrix\DB\Builder\DB;
use ShardMatrix\DB\Interfaces\DBDataRowTransactionsInterface;
# Query all relevant nodes for the data
$collection = DB::allNodesTable( 'users')->where('email','like','%yatti%')->limit(50)->get();
# Data returns as a Collection that can be iterated through
$collection->each( function(DBDataRowTransactionsInterface $record){
# Use data conditionally
if($record->username == 'a-bad-user'){
# Manipulate the record and commit changes
$record->delete();
}
});
use ShardMatrix\DB\Builder\DB;
use ShardMatrix\DB\Interfaces\DBDataRowTransactionsInterface;
$pagination = DB::allNodesTable( 'users' )
->orderBy( 'created', 'desc' )
->paginate();
$pagination->each( function ( DBDataRowTransactionsInterface $record) {
echo $record->username;
echo $record->getUuid();
});
echo $pagination->total();
echo $pagination->perPage();
echo $pagination->nextPageUrl();
echo $pagination->previousPageUrl();
use ShardMatrix\DB\Builder\DB;
use ShardMatrix\DB\Interfaces\DBDataRowTransactionsInterface;
$uuidFromCurrentUser = "06a00233-1ea8af83-d514-6a76-83ae-444230303037";
$pagination = DB::table( 'users' )
->uuidAsNodeReference($uuidFromCurrentUser)
->orderBy( 'created', 'desc' )
->paginate();
$pagination->each( function ( DBDataRowTransactionsInterface $record) {
echo $record->username;
echo $record->getUuid();
});
echo $pagination->total();
echo $pagination->perPage();
echo $pagination->nextPageUrl();
echo $pagination->previousPageUrl();
Use Composer to install ShardMatrix, or pull the repo from github.
composer require jrsaunders/shard-matrix
ShardMatrix needs to know how your tables and columns and databases interact, so this config file will define this in a simple yaml file.
- You will need your credentials for your databases, and access privileges setup. Reference Yaml file
This is a full example of how a configuration file should look.
version: 1
table_groups:
user:
- users
- payments
- offers
tracking:
- visitors
- sign_ups
published:
- published_offers
unique_columns:
users:
- email
- username
nodes:
DB0001:
dsn: mysql:dbname=shard;host=localhost:3301;user=root;password=password
docker_network: DB0001:3306
geo: UK
table_groups:
- user
- published
DB0002:
dsn: mysql:dbname=shard;host=localhost:3302;user=root;password=password
docker_network: DB0002:3306
geo: UK
table_groups:
- user
- published
DB0003:
dsn: mysql:dbname=shard;host=localhost:3303;user=root;password=password
docker_network: DB0003:3306
geo: UK
table_groups:
- user
- published
DB0004:
dsn: mysql:dbname=shard;host=localhost:3304;user=root;password=password
docker_network: DB0004:3306
geo: UK
table_groups:
- published
DB0005:
dsn: mysql:dbname=shard;host=localhost:3305;user=root;password=password
docker_network: DB0005:3306
table_groups:
- tracking
DB0006:
dsn: mysql:dbname=shard;host=localhost:3306;user=root;password=password
docker_network: DB0006:3306
geo: UK
insert_data: false
table_groups:
- tracking
DB0007:
dsn: pgsql:dbname=shard;host=localhost:5407;user=postgres;password=password
docker_network: DB0007:5432
geo: UK
table_groups:
- user
- tracking
Define the version. The most recent version is 1.
version: 1
Define the table groups. As you add tables to your Application you will need to explicitly add them here to.
The group name is only used in ShardMatrix.
The table names are attributed to the groups. A table can only be in one group at a time and once you have written to the Databases, it is best not to change any table assigned to a group.
- Denotes the table groups section on config
- Denotes the name of a group of tables
- Denotes the table name
# Denotes the table groups section on config
table_groups:
# Denotes the name of a group of tables
user:
# Denotes the table name
- users
This section as it may appear.
table_groups:
user:
- users
- payments
- offers
tracking:
- visitors
- sign_ups
published:
- published_offers
Unique Columns can be defined here. So in the users
table email
and username
must be unique across all Nodes (shard databases).
unique_columns:
users:
- email
- username
facebook_users:
- fb_id
This is where you define your database connections, credentials, and what table groups and geos the node maybe using.
Nodes can be extended and added to as you go.
Node names must remain the same though as must the table groups they correspond to.
The anatomy of the node section.
- Denotes the where the nodes are defined
- Node Name
- DSN for connection to DB
- *optional Docker service name and port number
- *optional Geo - if a geo is stated the application inserting data will use this to choose this node to write new inserts to it
- *optional Stop new data being written here, unless connected to an existing UUID from this node
- Table groups that use this node must be defined here
- Table group user (that consists of the users, offers, payments tables)
# Denotes the where the nodes are defined
nodes:
# Node Name
DBUK01:
# DSN for connection to DB
dsn: mysql:dbname=shard;host=localhost:3301;user=root;password=password
# *optional docker service name and port number
docker_network: DBUK:3306
# *optional Geo - if a geo is stated the application inserting data will use this to choose this node to write new inserts to it
geo: UK
# *optional Stop new data being written here, unless connected to an existing UUID from this node
insert_data: false
# Table groups that use this node must be defined here
table_groups:
# Table group user (that consists of the users, offers, payments tables)
- user
- published
The Node Section as it may appear in the config yaml.
nodes:
DBUK01:
dsn: mysql:dbname=shard;host=localhost:3301;user=root;password=password
docker_network: DBUK:3306
geo: UK
table_groups:
- user
- published
postg1:
dsn: pgsql:dbname=shard;host=localhost:5407;user=postgres;password=password
docker_network: postg1_db:5432
table_groups:
- tracking
DB0001:
dsn: mysql:dbname=shard;host=localhost:3304;user=root;password=password
docker_network: DB0001:3306
insert_data: false
table_groups:
- user
- published
Save the file to where the application is, in either a protected directory or externally inaccessible directory.
Alternatively it can be made into a Kubernetes Secret and given to your application that way.
In these examples we have saved our Config file as shard_matrix.yaml
and placed it in the same directory as our applications index php.
- Our config file
- Specifying a local directory to write db data to when it needs to
use ShardMatrix\ShardMatrix;
# Our config file
ShardMatrix::initFromYaml( __DIR__ . '/shard_matrix.yaml' );
# Specifying a local directory to write db data to when it needs to
ShardMatrix::setPdoCachePath( __DIR__ . '/shard_matrix_cache' );
- Our config file
- Changes the service from PHP forking for asynchronous queries to GoThreaded
- Uses GoThreaded for asynchronous DB calls when we have to query all relevant shards
- This overwrites the PdoCache Service that was using writing to file, and now instead uses Redis caching
use ShardMatrix\ShardMatrix;
# Our config file
ShardMatrix::initFromYaml( __DIR__ . '/shard_matrix.yaml' );
# Changes the service from PHP forking for asynchronous queries to GoThreaded
ShardMatrix::useGoThreadedForAsyncQueries();
# Uses GoThreaded for asynchronous DB calls when we have to query all relevant shards
ShardMatrix::setGoThreadedService( function () {
return new \ShardMatrix\GoThreaded\Client( '127.0.0.1', 1534, 'gothreaded', 'password' );
} );
# This overwrites the PdoCache Service that was used to write to file, and now instead uses Redis caching
ShardMatrix::setPdoCacheService( function () {
return new \ShardMatrix\PdoCacheRedis( new \Predis\Client( 'tcp://127.0.0.1:6379' ) );
} );