/RockSack

Makes Facebook/Meta RocksDB key/value store easier to use in Java as a true object oriented database.

Primary LanguageJava

RockSack

  • Makes it easier to use Facebook/Meta key/value database RocksDB/RocksJava in Java applications.
  • Allows Java objects to control database behavior.
  • Expands RocksDB low level byte-oriented API to use native Java object serialization.
  • Build Java database applications more quickly.
  • Gives RocksDB the power of a true object oriented database.
  • Makes it easier to manage transactions and organize databases by Java class.
  • Supports transaction and transactionless models.
  • Provides tooling to rapidly adapt existing classes to use Java serialization and inherent language Comparable compatability!
  • RocksDB

    https://github.com/facebook/rocksdb
    http://rocksdb.org/
    https://github.com/facebook/rocksdb/tree/main/java/src/main/java/org/rocksdb

    Why RockSack?

    A bag or sack is a computer science term for a structure to hold a large amount of data that is usually unstructured. RockSack can store Java objects so that they can be efficiently indexed, preserved, and retrieved in a manner that mirrors the java.util.Hashmap, java.util.TreeMap and java.util.TreeSet classes while providing the benefits of a full blown database. The amount of data the system can maintain far exceeds resident and even virtual memory. The implementation uses the RocksDB key/value store for performance and durability. The RocksDB K/V store is a Meta project used by numerous high volume online orgs and has lots of support. What it lacks is a first class Java object method indexing.

    Once we can index the stored objects via a built-in method, we add the capabilities of a full object-oriented database. Once we can use the objects in streams and lambda expressions, we create a truly 'functional' Java database.
    So to summarize; RockSack provides a way of using serialized first class Java objects as indexes for RocksDB, unlocking the power of object-oriented and functional paradigms in a high-performance environment.

    The assumption is that classes are obviously serializable, and for indexing implement the java.lang.Comparable.compareTo method, which most do by default, including String. So in addition to RockSack having the means to store a large number of objects, it adds the properties of recoverability, isolation, durability, atomicity, and concurrency and provides tools for the user to easily use a widely used and supported, high volume key/value store.

    Technical Details:

    RockSack is a Java persistence mechanism that provides key/value store functionality with a small footprint and native object storage capability. Just about any Java object, meaning Serializable objects implementing the java.lang.Comparable interface, can be stored. The Comparable interface is part of the standard Java Collections Framework and is implemented in the majority of built-in Java classes such as String.

    Whats the advantage? Whereas an agnostic K/V store only allows you to index the raw byte values of the key, a 'functional' index gives you the ability to index the data based on an arbitrarily complex arrangement of any of the fields in the class, or even through a strictly computational process since we are using the result of a method call to control the order in which the data are stored. RockSack abstracts away the details of using RocksDB and provides tools to easily adapt your classes for use in the environment.

    There are methods in the class com.neocoretechs.rocksack.DatabaseManager to organize the maps and sets on the basis of type. In this way a rudimentary schema can be maintained. A non-transactional BufferedMap can be obtained by the following methods:

    
    DatabaseManager.setTableSpaceDir(argv[0]);
    BufferedMap map = DatabaseManager.getMap(key.getClass());
    map.put(key, value);
    Comparable c = map.get(key);
    
    
    

    If a transaction context is desired, in other words one in which multiple operations can be committed or rolled back under the control of the application, the following methods can be used:

    
    DatabaseManager.setTableSpaceDir(argv[0]);
    TransactionId xid = DatabaseManager.getTransactionId();
    TransactionalMap map = DatabaseManager.getTransactionalMap(key.getClass(), xid);
    map.put(key, value);
    Comparable c = map.get(key);
    
    DatabaseManager.commitTransaction(xid); // Or
    DatabaseManager.rollbackTransaction(xid); // Or
    DatabaseManager.checkpointTransaction(xid); // establish intermediate checkpoint that can be rolled back to
    DatabaseManager.rollbackToCheckpoint(xid);
    DatabaseManager.clearAllOutstandingTransactions(); // roll back and close all outstanding transactions on all open databases
    
    Various reporting functions:
    List<String> s = DatabaseManager.getOutstandingTransactionState(); // get the status of all outstanding transactions
    List<Transaction> t = DatabaseManager.getOutstandingTransactions(database);
    List<Transaction> t = DatabaseManager.getOutstandingTransactionsById(xid); // gets all transactions under given transaction Id
    etc..
    
    

    In addition to the 'get', 'put', 'remove', 'contains', 'size', 'contains', 'containsKey', 'first', 'last', 'firstKey', 'lastKey' the full set of iterators can be obtained to retrieve subsets of the data for sets and maps:

    Sets:
    headSet
    headSetKV
    tailSet
    tailSetKV
    subSet
    subSetKV
    keySet
    entrySet

    Maps:
    headMap
    headMapKV (key and value)
    tailMap
    tailMapKV
    subMap
    subMapKV

    Streams are supported via corresponding methods to provide functional programming constructs such as lambda expressions, a feature not found in most database environments:
    headSetStream
    headSetKVStream
    tailSetStream
    tailSetKVStream
    subSetStream
    subSetKVStream
    headMapStream
    headMapKVStream
    tailMapStream
    tailMapKVStream
    subMapStream
    subMapKVStream
    entrySetStream
    keySetStream

    
    	// Basic retrieval format for sub map range:
    	
    	String sminx = "100";
    	String smaxx = "175";
    	BufferedMap map = DatabaseManager.getMap(sminx); // Get the map for classes of this instance
    	Iterator<?> itk = map.subMap(sminx, smaxx); // retrieve values 'from' inclusive, 'to' exclusive
    	while(itk.hasNext()) {
    		System.out.println(itk.next());
    	}
    		
    	// Demonstration of transactional stream retrieval functional lambda expressions:
    	
    	TransactionId xid = DatabaseManager.getTransactionId();
    	// Get a transactional map for a fully qualified class name delivered on the command line
    	TransactionalMap map = DatabaseManager.getTransactionalMap(Class.forName(argv[1]), xid);
    	Object o; // Object to hold stream retrieval items
    	int i = 0; // item counter
    	
    	// Functionally equivalent stream retrievals below:
    	
    	map.headSetStream((Comparable) map.lastKey()).forEach(System.out::println);
    
    	// Lamdba expressions:
    	
    	map.tailSetStream((Comparable) map.firstKey()).forEach(o -> {
    		System.out.println("["+(i++)+"]"+o);
    	});
    	
    	map.subSetStream((Comparable) map.firstKey(), (Comparable) map.lastKey()).forEach(o -> {
    		System.out.println("["+(i++)+"]"+o);
    	});
    		
    

    New capabilities include a ClassTool to rapidly adapt existing classes to use built-in Java functionality for serialization and class indexing. In order for Java classes to properly index in RocksDB, they must exhibit "total ordering". The serialized representation of stored bytes must be represented by a compareTo and equals method that respects the total ordering of those bytes as RockdsDB sees it, otherwise errors are incurred regarding out of order keys during the compaction process. The ClassTool will use proper unsigned primitive comparison and the Externalizable extension of Serialization to ensure total ordering.

    Starting with this:

    package com;
    
    import com.neocoretechs.rocksack.CompareAndSerialize;
    import com.neocoretechs.rocksack.ComparisonOrderField;
    import com.neocoretechs.rocksack.ComparisonOrderMethod;
    /**
     * Basic annotation tooling for RockSack to generate the necessary fields and methods for
     * storage and retrieval under the java.lang.Comparable interface as used throughout the language.
     * The ordering of the keys is defined here as the order in which they appear: i,j, and l. We
     * demonstrate method and field access and generate compareTo method and Externalizable interface
     * implementation with SerialUID. No modifications will affect the operation of the original class.
     * The original class will be backed up as TestTooling2.bak before modification.
     * {@link CompareAndSerialize} annotation to designate the class as toolable. The {@link ComparisonOrderField} and
     * {@link ComparisonOrderMethod}. {@link com.neocoretechs.rocksack.ClassTool}
     */
    @CompareAndSerialize
    public class TestTooling2{
    	@ComparisonOrderField
    	private int i;
    	@ComparisonOrderField
    	private String j;
    	private String l;
    	private double d;
    	@ComparisonOrderMethod
    	public String getL() {
    		return l;
    	}
    	@ComparisonOrderMethod
    	public double getD() {
    		return d;
    	}
    	public void setD(double d) {
    		this.d = d;
    	}
    	public void setL(String l) {
    		this.l = l;
    	}
    	public TestTooling2(int key1, String key2, String key3) {
    		this.i = key1;
    		this.j = key2;
    		this.l = key3;
    	}
    }
    
    
    
    

    The ClassTool runs in one command line to produce a fully instrumented version like this:

    package com;
    
    import com.neocoretechs.rocksack.CompareAndSerialize;
    import com.neocoretechs.rocksack.ComparisonOrderField;
    import com.neocoretechs.rocksack.ComparisonOrderMethod;
    /**
     * Basic annotation tooling for RockSack to generate the necessary fields and methods for
     * storage and retrieval under the java.lang.Comparable interface as used throughout the language.
     * The ordering of the keys is defined here as the order in which they appear: i,j, and l. We
     * demonstrate method and field access and generate compareTo method and Externalizable interface
     * implementation with SerialUID. No modifications will affect the operation of the original class.
     * The original class will be backed up as TestTooling2.bak before modification.
     * {@link CompareAndSerialize} annotation to designate the class as toolable. The {@link ComparisonOrderField} and
     * {@link ComparisonOrderMethod}. {@link com.neocoretechs.rocksack.ClassTool}
     */
    @CompareAndSerialize
    public class TestTooling2 implements java.io.Externalizable,java.lang.Comparable{
    	private static final long serialVersionUID = 1L;
    	@ComparisonOrderField
    	private int i;
    	@ComparisonOrderField
    	private String j;
    	private String l;
    	private double d;
    	@ComparisonOrderMethod
    	public String getL() {
    		return l;
    	}
    	@ComparisonOrderMethod
    	public double getD() {
    		return d;
    	}
    	public void setD(double d) {
    		this.d = d;
    	}
    	public void setL(String l) {
    		this.l = l;
    	}
    	public TestTooling2(int key1, String key2, String key3) {
    		this.i = key1;
    		this.j = key2;
    		this.l = key3;
    	}
    	@Override
    	public int compareTo(Object o) {
    		int n;
    		n=Integer.compareUnsigned(i,((TestTooling2)o).i);
    		if(n != 0) return n;
    		n = j.compareTo(((TestTooling2)o).j);
    		if(n != 0) return n;
    		n=Long.compareUnsigned(Double.doubleToRawLongBits(getD()),Double.doubleToRawLongBits(((TestTooling2)o).getD()));
    		if(n != 0) return n;
    		n = getL().compareTo(((TestTooling2)o).getL());
    		if(n != 0) return n;
    		return 0;
    	}
    	@Override
    	public boolean equals(Object o) {
    		int n;
    		boolean b;
    		n=Integer.compareUnsigned(i,((TestTooling2)o).i);
    		if(n != 0) return false;
    		b = j.equals(((TestTooling2)o).j);
    		if(!b) return b;
    		n=Long.compareUnsigned(Double.doubleToRawLongBits(getD()),Double.doubleToRawLongBits(((TestTooling2)o).getD()));
    		if(n != 0) return false;
    		b = getL().equals(((TestTooling2)o).getL());
    		if(!b) return b;
    		return true;
    	}
    	@Override
    	public int hashCode() {
    		int n=0;
    		n+=Integer.hashCode(i);
    		n+=j.hashCode();
    		n+=Double.hashCode(getD());
    		n+= getL().hashCode();
    		return n;
    	}
    	@Override
    	public void readExternal(java.io.ObjectInput in) throws java.io.IOException,ClassNotFoundException {
    		i=in.readInt();
    		j=(String)in.readObject();
    		setD(in.readDouble());
    		setL((String)in.readObject());
    	}
    	@Override
    	public void writeExternal(java.io.ObjectOutput out) throws java.io.IOException {
    		out.writeInt(i);
    		out.writeObject(j);
    		out.writeDouble(getD());
    		out.writeObject(getL());
    	}
    	public TestTooling2() {}
    }
    
    
    

    And the class is easily provisioned for indexing in the RocksDB database via the key order specified!

    Beginning in Java 9 an interactive session application called jshell was included with the JDK. We can perform ad-hoc database operations using jshell and RockSack. Beginning with this class:

    package com;
    import com.neocoretechs.rocksack.CompareAndSerialize;
    import com.neocoretechs.rocksack.ComparisonOrderField;
    import com.neocoretechs.rocksack.ComparisonOrderMethod;
    /**
    * Basic annotation tooling for RockSack to generate the necessary fields and methods for
    * storage and retrieval under the java.lang.Comparable interface as used throughout the language.
    * The ordering of the keys is defined here as by the annotation order field: j,i, and l. We
    * demonstrate method and field access and generate compareTo method and Externalizable interface
    * implementation with SerialUID. We also show how to wrap a custom object to give Comparable
    * functionality to any class. No modifications will affect the operation of the original class.
    * The original class will be backed up as TestTooling1.bak before modification.
    * {@link CompareAndSerialize} annotation to designate the class as toolable. The {@link ComparisonOrderField} and
    * {@link ComparisonOrderMethod}. {@link com.neocoretechs.rocksack.ClassTool}
    */
    @CompareAndSerialize
    public class TestTooling1{
    	@ComparisonOrderField(order=2)
    	private int i;
    	@ComparisonOrderField(order=1)
    	private String j;
    	private ByteObject l = new ByteObject();
    	@ComparisonOrderMethod(order=3)
    	public ByteObject getL() {
    		return l;
    	}
    	public TestTooling1(String key1, int key2) {
    		j = key1;
    		i = key2;	
    	}
    	static class ByteObject implements Comparable, java.io.Serializable {
    		byte[] bytes = new byte[] {10,9,8,7,6,5,4,3,2,1};
    		@Override
    		public int compareTo(Object o) {
    			ByteObject b = (ByteObject)o;
    			for(int i = 0; i < b.bytes.length; i++) {
    				if(bytes[i] > b.bytes[i])
    					return 1;
    				if(bytes[i] < b.bytes[i])
    					return -1;
    			}
    			return 0;
    		}
    		
    	}
    	
    	@Override
    	public String toString() {
    		return "Key1="+j+" key2="+i;
    	}
    }
    
    

    Our ClassTool instruments it as follows:

    package com;
    
    import com.neocoretechs.rocksack.CompareAndSerialize;
    import com.neocoretechs.rocksack.ComparisonOrderField;
    import com.neocoretechs.rocksack.ComparisonOrderMethod;
    /**
    * Basic annotation tooling for RockSack to generate the necessary fields and methods for
    * storage and retrieval under the java.lang.Comparable interface as used throughout the language.
    * The ordering of the keys is defined here as by the annotation order field: j,i, and l. We
    * demonstrate method and field access and generate compareTo method and Externalizable interface
    * implementation with SerialUID. We also show how to wrap a custom object to give Comparable
    * functionality to any class. No modifications will affect the operation of the original class.
    * The original class will be backed up as TestTooling1.bak before modification.
    * {@link CompareAndSerialize} annotation to designate the class as toolable. The {@link ComparisonOrderField} and
    * {@link ComparisonOrderMethod}. {@link com.neocoretechs.rocksack.ClassTool}
    */
    @CompareAndSerialize
    public class TestTooling1 implements java.io.Externalizable,java.lang.Comparable{
    	private static final long serialVersionUID = 1L;
    	@ComparisonOrderField(order=2)
    	private int i;
    	@ComparisonOrderField(order=1)
    	private String j;
    	private ByteObject l = new ByteObject();
    	@ComparisonOrderMethod(order=3)
    	public ByteObject getL() {
    		return l;
    	}
    	public void setL(ByteObject l) {
    		this.l = l;
    	}
    	public TestTooling1(String key1, int key2) {
    		j = key1;
    		i = key2;	
    	}
    	static class ByteObject implements Comparable, java.io.Serializable {
    		byte[] bytes = new byte[] {10,9,8,7,6,5,4,3,2,1};
    		@Override
    		public int compareTo(Object o) {
    			ByteObject b = (ByteObject)o;
    			for(int i = 0; i < b.bytes.length; i++) {
    				if(bytes[i] > b.bytes[i])
    					return 1;
    				if(bytes[i] < b.bytes[i])
    					return -1;
    			}
    			return 0;
    		}
    		
    	}
    	
    	@Override
    	public String toString() {
    		return "Key1="+j+" key2="+i;
    	}
    	@Override
    	public int compareTo(Object o) {
    		int n;
    		n = j.compareTo(((TestTooling1)o).j);
    		if(n != 0) return n;
    		n=Integer.compareUnsigned(i,((TestTooling1)o).i);
    		if(n != 0) return n;
    		n = getL().compareTo(((TestTooling1)o).getL());
    		if(n != 0) return n;
    		return 0;
    	}
    	@Override
    	public boolean equals(Object o) {
    		int n;
    		boolean b;
    		b = j.equals(((TestTooling1)o).j);
    		if(!b) return b;
    		n=Integer.compareUnsigned(i,((TestTooling1)o).i);
    		if(n != 0) return false;
    		b = getL().equals(((TestTooling1)o).getL());
    		if(!b) return b;
    		return true;
    	}
    	@Override
    	public int hashCode() {
    		int n=0;
    		n+=j.hashCode();
    		n+=Integer.hashCode(i);
    		n+= getL().hashCode();
    		return n;
    	}
    	@Override
    	public void readExternal(java.io.ObjectInput in) throws java.io.IOException,ClassNotFoundException {
    		j=(String)in.readObject();
    		i=in.readInt();
    		setL((ByteObject)in.readObject());
    	}
    	@Override
    	public void writeExternal(java.io.ObjectOutput out) throws java.io.IOException {
    		out.writeObject(j);
    		out.writeInt(i);
    		out.writeObject(getL());
    	}
    	public TestTooling1() {}
    }
    
    
    

    Then the interactive jshell session demonstrates the ease with which ad-hoc data management can be performed using the power of RockSack!

    D:\etc>jshell --class-path D:/etc/rocksdbjni-7.7.3-win64.jar;D:/etc/RockSack.jar;D:/etc D:/etc/rocksack.jshell --feedback RockSack
    ->DatabaseManager.setTableSpaceDir("D:/etc/db/test");
    ->com.TestTooling1 t1 = new com.TestTooling1("a1",1);
    ->BufferedMap map = DatabaseManager.getMap(t1.getClass());
    ->map.put(t1,new String("value1"));
    ->t1 = new com.TestTooling1("a2",2);
    ->map.put(t1,new String("value2"));
    ->t1 = new com.TestTooling1("a3",3);
    ->map.put(t1,new String("value3"));
    ->t1 = new com.TestTooling1("a4",4);
    ->map.put(t1,new String("value4"));
    ->t1 = new com.TestTooling1("a5",5);
    ->map.put(t1,new String("value5"));
    ->map.tailSetStream((Comparable) map.firstKey()).forEach(System.out::println);
    Key1=a1 key2=1
    Key1=a2 key2=2
    Key1=a3 key2=3
    Key1=a4 key2=4
    Key1=a5 key2=5
    ->map.tailSetKVStream((Comparable) map.firstKey()).forEach(System.out::println);
    KeyValuePair:[Key1=a1 key2=1,value1]
    KeyValuePair:[Key1=a2 key2=2,value2]
    KeyValuePair:[Key1=a3 key2=3,value3]
    KeyValuePair:[Key1=a4 key2=4,value4]
    KeyValuePair:[Key1=a5 key2=5,value5]
    ->t1 = new com.TestTooling1("a0",0);
    ->map.put(t1,new String("zero"));
    ->map.tailSetKVStream((Comparable) map.firstKey()).forEach(System.out::println);
    KeyValuePair:[Key1=a0 key2=0,zero]
    KeyValuePair:[Key1=a1 key2=1,value1]
    KeyValuePair:[Key1=a2 key2=2,value2]
    KeyValuePair:[Key1=a3 key2=3,value3]
    KeyValuePair:[Key1=a4 key2=4,value4]
    KeyValuePair:[Key1=a5 key2=5,value5]
    ->
    ->t1 = new com.TestTooling1("a0",99);
    ->map.put(t1,new String("key a-zero and 99"));
    ->map.tailSetKVStream((Comparable) map.firstKey()).forEach(System.out::println);
    KeyValuePair:[Key1=a0 key2=0,zero]
    KeyValuePair:[Key1=a0 key2=99,key a-zero and 99]
    KeyValuePair:[Key1=a1 key2=1,value1]
    KeyValuePair:[Key1=a2 key2=2,value2]
    KeyValuePair:[Key1=a3 key2=3,value3]
    KeyValuePair:[Key1=a4 key2=4,value4]
    KeyValuePair:[Key1=a5 key2=5,value5]
    ->