/mdarray

Multidimensional array similar to NumPy and NArray

Primary LanguageRubyOtherNOASSERTION

Announcement

MDArray version 0.5.5.2 has been released. MDArray is a multi dimensional array implemented for JRuby inspired by NumPy (www.numpy.org) and Masahiro Tanaka´s Narray (narray.rubyforge.org).
MDArray stands on the shoulders of Java-NetCDF and Parallel Colt. At this point MDArray has libraries for linear algebra, mathematical, trigonometric and descriptive statistics methods.

NetCDF-Java Library is a Java interface to NetCDF files, as well as to many other types of scientific data formats. It is developed and distributed by Unidata (http://www.unidata.ucar.edu).

Parallel Colt (https://sites.google.com/site/piotrwendykier/software/parallelcolt is a multithreaded version of Colt (http://acs.lbl.gov/software/colt/). Colt provides a set of Open Source Libraries for High Performance Scientific and Technical Computing in Java. Scientific and technical computing is characterized by demanding problem sizes and a need for high performance at reasonably small memory footprint.

What´s new:

Version 0.5.5.2 is a bug fix for a class StringArray. In Java-NetCDF when passing "string" as type an ObjectArray is created and not a StringArray. This version fix this issue and gets a StringArray when the "string" type is selected.

MDArray and SciRuby:

MDArray subscribes fully to the SciRuby Manifesto (http://sciruby.com/).

“Ruby has for some time had no equivalent to the beautifully constructed NumPy, SciPy, and matplotlib libraries for Python.

We believe that the time for a Ruby science and visualization package has come. Sometimes when a solution of sugar and water becomes super-saturated, from it precipitates a pure, delicious, and diabetes-inducing crystal of sweetness, induced by no more than the tap of a finger. So is occurring now, we believe, with numeric and visualization libraries for Ruby.”

MDArray main properties are:

  • Homogeneous multidimensional array, a table of elements (usually numbers), all of the same type, indexed by a tuple of positive integers;
  • Support for many linear algebra methods (see bellow);
  • Easy calculation for large numerical multi dimensional arrays;
  • Basic types are: boolean, byte, short, int, long, float, double, string, structure;
  • Based on JRuby, which allows importing Java libraries;
  • Operator: +,-,*,/,%,**, >, >=, etc.;
  • Functions: abs, ceil, floor, truncate, is_zero, square, cube, fourth;
  • Binary Operators: &, |, ^, ~ (binary_ones_complement), <<, >>;
  • Ruby Math functions: acos, acosh, asin, asinh, atan, atan2, atanh, cbrt, cos, erf, exp, gamma, hypot, ldexp, log, log10, log2, sin, sinh, sqrt, tan, tanh, neg;
  • Boolean operations on boolean arrays: and, or, not;
  • Fast descriptive statistics from Parallel Colt (complete list found bellow);
  • Easy manipulation of arrays: reshape, reduce dimension, permute, section, slice, etc.;
  • Support for reading and writing NetCDF-3 files;
  • Reading of two dimensional arrays from CSV files (mainly for debugging and simple testing purposes);
  • StatList: a list that can grow/shrink and that can compute Parallel Colt descriptive statistics;
  • Experimental lazy evaluation (still slower than eager evaluation).

Supported linear algebra methods:

  • backwardSolve: Solves the upper triangular system U*x=b;
  • chol: Constructs and returns the cholesky-decomposition of the given matrix.
  • cond: Returns the condition of matrix A, which is the ratio of largest to smallest singular value.
  • det: Returns the determinant of matrix A.
  • eig: Constructs and returns the Eigenvalue-decomposition of the given matrix.
  • forwardSolve: Solves the lower triangular system L*x=b;
  • inverse: Returns the inverse or pseudo-inverse of matrix A.
  • kron: Computes the Kronecker product of two real matrices.
  • lu: Constructs and returns the LU-decomposition of the given matrix.
  • mult: Inner product of two vectors; Sum(x[i] * y[i]).
  • mult: Linear algebraic matrix-vector multiplication; z = A * y.
  • mult: Linear algebraic matrix-matrix multiplication; C = A x B.
  • multOuter: Outer product of two vectors; Sets A[i,j] = x[i] * y[j].
  • norm1: Returns the one-norm of vector x, which is Sum(abs(x[i])).
  • norm1: Returns the one-norm of matrix A, which is the maximum absolute column sum.
  • norm2: Returns the two-norm (aka euclidean norm) of vector x; equivalent to Sqrt(mult(x,x)).
  • norm2: Returns the two-norm of matrix A, which is the maximum singular value; obtained from SVD.
  • normF: Returns the Frobenius norm of matrix A, which is Sqrt(Sum(A[i]2)).
  • normF: Returns the Frobenius norm of matrix A, which is Sqrt(Sum(A[i,j]2)).
  • normInfinity: Returns the infinity norm of vector x, which is Max(abs(x[i])).
  • normInfinity: Returns the infinity norm of matrix A, which is the maximum absolute row sum.
  • pow: Linear algebraic matrix power; B = Ak <==> B = AA...*A.
  • qr: Constructs and returns the QR-decomposition of the given matrix.
  • rank: Returns the effective numerical rank of matrix A, obtained from Singular Value Decomposition.
  • solve: Solves A*x = b.
  • solve: Solves A*X = B.
  • solveTranspose: Solves X*A = B, which is also A'*X' = B'.
  • svd: Constructs and returns the SingularValue-decomposition of the given matrix.
  • trace: Returns the sum of the diagonal elements of matrix A; Sum(A[i,i]).
  • trapezoidalLower: Modifies the matrix to be a lower trapezoidal matrix.
  • vectorNorm2: Returns the two-norm (aka euclidean norm) of vector X.vectorize();
  • xmultOuter: Outer product of two vectors; Returns a matrix with A[i,j] = x[i] * y[j].
  • xpowSlow: Linear algebraic matrix power; B = Ak <==> B = AA...*A.

Properties´ methods tested on matrices:

  • density: Returns the matrix's fraction of non-zero cells; A.cardinality() / A.size().
  • generate_non_singular!: Modifies the given square matrix A such that it is diagonally dominant by row and column, hence non-singular, hence invertible.
  • diagonal?: A matrix A is diagonal if A[i,j] == 0 whenever i != j.
  • diagonally_dominant_by_column?: A matrix A is diagonally dominant by column if the absolute value of each diagonal element is larger than the sum of the absolute values of the off-diagonal elements in the corresponding column.
  • diagonally_dominant_by_row?: A matrix A is diagonally dominant by row if the absolute value of each diagonal element is larger than the sum of the absolute values of the off-diagonal elements in the corresponding row.
  • identity?: A matrix A is an identity matrix if A[i,i] == 1 and all other cells are zero.
  • lower_bidiagonal?: A matrix A is lower bidiagonal if A[i,j]==0 unless i==j || i==j+1.
  • lower_triangular?: A matrix A is lower triangular if A[i,j]==0 whenever i < j.
  • nonnegative?: A matrix A is non-negative if A[i,j] >= 0 holds for all cells.
  • orthogonal?: A square matrix A is orthogonal if A*transpose(A) = I.
  • positive?: A matrix A is positive if A[i,j] > 0 holds for all cells.
  • singular?: A matrix A is singular if it has no inverse, that is, iff det(A)==0.
  • skew_symmetric?: A square matrix A is skew-symmetric if A = -transpose(A), that is A[i,j] == -A[j,i].
  • square?: A matrix A is square if it has the same number of rows and columns.
  • strictly_lower_triangular?: A matrix A is strictly lower triangular if A[i,j]==0 whenever i <= j.
  • strictly_triangular?: A matrix A is strictly triangular if it is triangular and its diagonal elements all equal 0.
  • strictly_upper_triangular?: A matrix A is strictly upper triangular if A[i,j]==0 whenever i >= j.
  • symmetric?: A matrix A is symmetric if A = tranpose(A), that is A[i,j] == A[j,i].
  • triangular?: A matrix A is triangular iff it is either upper or lower triangular.
  • tridiagonal?: A matrix A is tridiagonal if A[i,j]==0 whenever Math.abs(i-j) > 1.
  • unit_triangular?: A matrix A is unit triangular if it is triangular and its diagonal elements all equal 1.
  • upper_bidiagonal?: A matrix A is upper bidiagonal if A[i,j]==0 unless i==j || i==j-1.
  • upper_triangular?: A matrix A is upper triangular if A[i,j]==0 whenever i > j.
  • zero?: A matrix A is zero if all its cells are zero.
  • lower_bandwidth: The lower bandwidth of a square matrix A is the maximum i-j for which A[i,j] is nonzero and i > j.
  • semi_bandwidth: Returns the semi-bandwidth of the given square matrix A.
  • upper_bandwidth: The upper bandwidth of a square matrix A is the maximum j-i for which A[i,j] is nonzero and j > i.

Descriptive statistics methods imported from Parallel Colt:

  • auto_correlation, correlation, covariance, durbin_watson, frequencies, geometric_mean,
  • harmonic_mean, kurtosis, lag1, max, mean, mean_deviation, median, min, moment, moment3,
  • moment4, pooled_mean, pooled_variance, product, quantile, quantile_inverse,
  • rank_interpolated, rms, sample_covariance, sample_kurtosis, sample_kurtosis_standard_error,
  • sample_skew, sample_skew_standard_error, sample_standard_deviation, sample_variance,
  • sample_weighted_variance, skew, split, standard_deviation, standard_error, sum,
  • sum_of_inversions, sum_of_logarithms, sum_of_powers, sum_of_power_deviations,
  • sum_of_squares, sum_of_squared_deviations, trimmed_mean, variance, weighted_mean,
  • weighted_rms, weighted_sums, winsorized_mean.

Double and Float methods from Parallel Colt:

  • acos, asin, atan, atan2, ceil, cos, exp, floor, greater, IEEEremainder, inv, less, lg,
  • log, log2, rint, sin, sqrt, tan.

Double, Float, Long and Int methods from Parallel Colt:

  • abs, compare, div, divNeg, equals, isEqual (is_equal), isGreater (is_greater),
  • isles (is_less), max, min, minus, mod, mult, multNeg (mult_neg), multSquare (mult_square),
  • neg, plus (add), plusAbs (plus_abs), pow (power), sign, square.

Long and Int methods from Parallel Colt

  • and, dec, factorial, inc, not, or, shiftLeft (shift_left), shiftRightSigned (shift_right_signed), shiftRightUnsigned (shift_right_unsigned), xor.

MDArray installation and download:

  • Install Jruby
  • jruby –S gem install mdarray

MDArray Homepages:

Contributors:

Contributors are welcome.

MDArray History:

  • 30/Dec/2014: Version 0.5.5.2 - Fix for StringArray
  • 16/Nov/2014: Version 0.5.5.1 - Small bug fix
  • 14/Nov/2013: Version 0.5.5 - Support for linear algebra methods
  • 07/Aug/2013: Version 0.5.4 - Support for reading and writing NetCDF-3 files
  • 24/Jun/2013: Version 0.5.3 – Over 90% Performance improvements for methods imported from Parallel Colt and over 40% performance improvements for all other methods (implemented in Ruby);
  • 16/Mai/2013: Version 0.5.0 - All loops transferred to Java with over 50% performance improvements. Descriptive statistics from Parallel Colt;
  • 19/Apr/2013: Version 0.4.3 - Fixes a simple, but fatal bug in 0.4.2. No new features;
  • 17/Apr/2013: Version 0.4.2 - Adds simple statistics and boolean operators;
  • 05/Apr/2013: Version 0.4.0 – Initial release.