Scala-points

This is my complements to:

twitter's Effective Scala and more:
Databricks Scala Style
Scala Best Practices
A list of code warts (if some parts look scary, treat them at least as warnings.)
Neophyte's Guide

Flavored:

www.javamex.com by Neil Coffey

1.val initialization order:

http://docs.scala-lang.org/tutorials/FAQ/initialization-order.html

when override hashCode, make sure every val involved is not null.

2.pitfall of mapValues and filterNot etc.

https://issues.scala-lang.org/browse/SI-4776

Similar: Map.keySet, Map.values

some methods do not create new map, but instead, create a MapView

(Deprecated)3.disable Mylyn in eclipse to speed up scala compiling.

simply enter Preferences-> Mylyn disable any Auto*

4.lazy argument

f(x: =>B):C now, B is something like lazy val a=(expensive-like fun)

5.Enum

a.(recommended)use case object as enumeration, along side with enumeratum

b.override Enumeration http://stackoverflow.com/questions/2507273/overriding-scala-enumeration-value so you can inherit your own Value creation method in subclass or inner-class

6.evaluation style

//BAD:
val fun=expensiveFunction.cheapFunction(_:T)  //everytime access fun, both functions are calculated
//GOOD:
val result=expensiveFunction
val fun=result.cheapFunction(_:T)

7.use .view properly

import com.typesafe.scalalogging.LazyLogging

object TempTest extends App with LazyLogging {
  def timer(f: () => Any): Unit = {
    val time1 = System.currentTimeMillis()
    f()
    val time2 = System.currentTimeMillis()
    logger.info(f.toString + " done, time consumed:" + (time2 - time1))
  }

  val f1 = () => {
    r(List(1, 2, 3, 4, 5),1000)
  }
  val f2 = () => {
    r(List(1, 2, 3, 4, 5).view,1000).view.force
  }
  
  def r(l: Seq[Int], n: Int):Seq[Int] = {
    if (n == 0) l
    else r(l.map(_ + 1), n - 1)
  }
  
  timer(f1)
  timer(f2)
}

f2 is faster than f1, but, if you change run times from 1000 to 10000, it will throw overflow exception

8.if procedure control

//BAD:
if(condition1)
  val a=x
else
  val a=y
  
//GOOD:
val a=if(condition1) x else y

9.package sealed trait

package foo
sealed trait I
private[foo] trait I2 extends I

trait I2 can be and only can be implemented within package foo

10.abuse of closure

val map1:Map[Key1,Map[Key2,Value]]=someMap1
val map2:Map[Key1,Map[Key2,Function]]=someMap2

val mapWithProcessedValue=map1.map{
  e1=>
    (e1._1,e1._2.map{
      e2=>
        val fun=map2(e1._1)(e2._1) //Bad, but not worst
        (e2._1,fun(e2._2)) //apply fun to value
    })
}

val funToFind=map2(_:Key1)(_:Key2)
val mapWithProcessedValue=map1.map{
  e1=>
    val partialFun=funToFind(e1.1)_
    (e1._1,e1._2.map{
      e2=>
        val fun=partialFun(e2._1) //Now, you suppose that this be faster, in reality, this is the slowest
        (e2._1,fun(e2._2)) //apply fun to value
    })
}

val mapWithProcessedValue=map1.map{
  e1=>
    val subMap2=map2(e1.1)
    (e1._1,e1._2.map{
      e2=>
        val fun=subMap2(e2._1) //Good.
        (e2._1,fun(e2._2)) //apply fun to value
    })
}

11.AOP in scala

trait Channel {
def send ( x : String ) = println ( x )
}
object LogAspect {
trait LogAfter extends Channel {
// before advice
abstract override def send ( x : String ) = { log ( ) ; super.send ( x ) }
}
trait LogBefore extends Channel {
// after advice
abstract override def send ( x : String ) = { super.send ( x ) ; log ( ) }
}
def log ( ) = println ( " logging ! " )
}
def main( args : Array [ String ] ) = {
val channel = new Channel with LogAspect.LogBefore
channel.send ( "message" )
}

12.Be careful about iterator.map

//not good(because next time you access the iterator, it may have changed):
iterator.map
//better:
iterator.toSeq.map

13."Mixin composition"

Favor inheritance over composition for immutable objects. Mixin of traits only lacks the ability of dynamically changing components, which is also dropped out by immutability.

trait A {def delegateDo()}
trait B extends A //B not necessary
trait B1 extends B{def delegateDo()=doSomeThing1}
trait B2 extends B{def delegateDo()=doSomeThing2}
val immutableA=condition match{
  case c1 => new A with B1
  case c2 => new A with B2
}

Use condition to choose what trait to mix in, instead of inserting composited ones.

14.Recursive sequnce traversal

Using a cursor of indices instead of shrinking the sequence has a slight performance advantage.(scala 2.11.7)

15.Type inference in generic method:

implicit class ExMap[A, B](val in: Map[A, B]) {
  def innerJoin[B2, C](that: Map[A, B2],f: (B, B2) => C): Map[A, C] // type B2 needs to be explicit
  def innerJoinInfer[B2, C](that: Map[A, B2])(f: (B, B2) => C): Map[A, C] //type B2 in f can be infered from that Map[A,B2]
}

16.String interpolation encountered with regex.

s-interpolation does't recoganize triple-qouted string parts. see http://stackoverflow.com/a/25633978/5172925

17.Arguments order of constructor.

Arguments with same type should not be put together, when there the order does not matter in other perspective.

//Some times not so good:
case class Person(name:String,motto:String,age:Int)
//Better:
case class Person(name:String,age:Int,motto:String)

18.Beware when using java API.

The convention of java API may deviate from scala's. e.g. java.io.File.listFiles returns null when directory cannot be reached. Read java doc carefully.

19.Duck typing.

See this link

20.Name tuples mapping(avoid calling ._N on Tuples):

Map(k,v).map(e=> e._1 ...  e._2 ...) //bad. low readability
Map(k,v).map{case (someKey,someValue)=> ...} //good.

Applies to Seq[TupleN] alike as well.

def methodThatReturnsTuple():(A,B,C)

val abc = methodThatReturnsTuple()

//context: when consume the tuple as a whole:
def consume1(abc:ABC)
consume1(abc) //no problem

//context: when consume part of the tuple:
def comsume2(a:A,b:B)
consume2(abc._1,abc._2) //bad

val (a,b,_) = methodThatReturnsTuple()
consume2(a,b) //good.

21.Give public field explicit type signature.

See Incremental compilation

Named private fields like private[packageName] are also considered as public.

There are exceptions: when type signature makes it too unreadable or ugly, and you can make sure this field won't change often. Well make it a val or final val

22.Align overloaded methods (if you have to overload).

def method1(a:A)(b:B)
def method1(b:B)
//legal but bad. What if you want to add one more arg group:
def method1(a:A)(b:B)(implicit c:C)
def method1(b:B)(implicit c:C)

method1(aOrb)(bOrc) //What?

//Align them:
def method2(a:A)(b:B)(implicit c:C)
def method2()(b:B)(implicit c:C)

method2()(b) //clear.

23.Returning the "Current" Type in Scala (Restriction on return subtype.)

see: http://tpolecat.github.io/2015/04/29/f-bounds.html

24.Only use "return"s as guards.

see first: http://tpolecat.github.io/2014/05/09/return.html

However when return is used to control flow on behalf of require(condition,"exception msg"):

//pseudocode:
def check(cred:Credential):Boolean = {
  if(pre-condition-check1) return false
  if(pre-condition-check2) return false
  
  //specific-check..
}

it is more readable than:

def check(cred:Credential):Boolean = {
  if(pre-condition-check1) false
  else if(pre-condition-check2) false
  else {
    //specific-check..
  } 
}

25.Use helper recusive function to create recusive type evidence.

case class Structure(name: String, children: Seq[Structure] = Nil)
object Structure {
  implicit val evidence: TypeClass[Structure] = (a: Structure) => {
     //one cannot recusively use this evidence.
  }
}

Use a helper function to help: example

26.Default empty generic parameter without Option.

Consider an interface method that you provide to client user:

def method[T](mandatory:String, optional:T = some-default-value) //how to make this optional

Of course, you can use an Option:

def method[T](mandatory:String, optional:Option[T] = None)
//client usage:
method("sth",Option(myValue)) //clients don't need to worry about null value of "myValue", good!

But some clients consider Option(myValue) as a bit verbose and want you not to use it. Now, what the method would look like?

def method[T](mandatory:String, optional:T = null) //able to compile but bad. null is explicitly used.
def method[T](mandatory:String, optional:T = ()) //not compile, Unit is not a type of T
def method[T](mandatory:String, optional:T = Nothing) //not compile, scala does not have an instance of Nothing

Use a helper def combined with non-strict-param(call-by-name) to provide empty value:

private def Empty: Nothing = throw new IllegalArgumentException("Empty default value called.")
def method[T](mandatory:String, optional: => T = Empty) 
//yeah! Codes compile and client will not see null or know what happened.

But notice, you need to use try/Try to check this optional.

We could check type T exhaustively to manually provide empty value:

def method[T: ClassTag](mandatory:String, optional: => T = Empty) 
private def Empty[T: ClassTag]: T ={
    val rc = implicitly[ClassTag[T]].runtimeClass
    val empty = rc match{
      case rc if rc == classOf[String] => ""
      case rc if rc == classOf[Int] => 0
      //etc...
      case rc => throw new AssertionError(s"$rc is not supported!")
    }
    empty.asInstanceOf[T]
}
//stupid, but does solve some of the problem.

Use a custom Option class with implicit conversion:

def method[T: ClassTag](mandatory:String, optional: OptionParam[T] = NoneParam) 

trait OptionParam[+T]
case class SomeParam[T](value:T) extends OptionParam[T]
case object NoneParam extends OptionParam[Nothing]
object OptionParam{
  implicit def enclose[T](v: T): OptionParam[T] = if(v == null) NoneParam else SomeParam(v)
                                                 //or: Option(v) match{...
}

Macro is the ultimate solution:

private def Empty: Nothing = throw new AssertionError("This should not be triggered.")
def method[T](mandatory:String, optional: => T = Empty) = macro MacroImpl.method
//method parameters are checked and manipulated at client's compile time.

27.Use explicit type or implicit conversion to guard toString.

def consume(v:String)  //An api that only accept String.

val myType = {...} //Return "MyType" rather than String, while MyType.toString makes sense.
consume(myType.toString) //Bad!

When myType is not MyType, you lose compile time type checking. Notice toString is a method of Any.

val myType:MyType = {...} //Good! Compile time type check occurs here.
consume(myType.toString)

val myType = {...}
consume(myType) //Succinct! When type is wrong, conversion will not apply.

private implicit def myType2String(in:MyType):String = in.toString

28. Ad hoc subclass check. (subclass check without TypeTag)

  trait SubClassGauge[A, B] {
    def A_isSubclassOf_B: Boolean
  }

  implicit class IsSubclassOps[A](a: A) {
    def isSubclassOf[B](implicit ev: SubClassGauge[A, B]): Boolean = ev.A_isSubclassOf_B
  }

  trait LowerLevelImplicits {
    implicit def defaultSubClassGauge[A, B] = new SubClassGauge[A, B] {
      override def A_isSubclassOf_B: Boolean = false
    }
  }

  object Implicits extends LowerLevelImplicits {
    implicit def subClassGauge[A <: B, B]: SubClassGauge[A, B] = new SubClassGauge[A, B] {
      override def A_isSubclassOf_B: Boolean = true
    }
  }

  trait Prime
  class NotSuper
  class Super extends Prime
  class Sub extends Super
  class NotSub

import Implicits._
@ (new Sub).isSubclassOf[NotSuper] 
res29: Boolean = false
@ (new Sub).isSubclassOf[Super] 
res30: Boolean = true
@ (new Sub).isSubclassOf[Prime] 
res31: Boolean = true
@ (new Super).isSubclassOf[Prime] 
res32: Boolean = true
@ (new Super).isSubclassOf[Sub] 
res33: Boolean = false
@ (new NotSub).isSubclassOf[Super] 
res34: Boolean = false

29. Ad hoc overloading. (Monkey patch method with different return type.)

object AdHocOverloading extends App {

  implicit class UnboxOps[T, R, B](b: B)(implicit ev: UnboxEv[T, B, R], ev1: B <:< Box[T]) {
    def value: R = ev.unbox(b)
  }

  val optional = Box(Some(3))
  val confident = new Box(Some("C")) with Confidence
  val otherType = Seq("bad")

  optional.value
  confident.value

  //otherType.value //compile time error

  println(optional.value)
  //Some(3)
  println(confident.value)
  //C
}

trait UnboxEv[+T, -B, +R] {
  def unbox(b: B): R
}

trait Confidence
case class Box[+T](v: Option[T]) //v could be private
trait LowLevelImplicitOfBox {
  this: Box.type =>
  implicit def optionEvidence[T]: UnboxEv[T, Box[T], Option[T]] =
    new UnboxEv[T, Box[T], Option[T]] {
      override def unbox(b: Box[T]): Option[T] = b.v
    }
}
object Box extends LowLevelImplicitOfBox {
  implicit def confidentEvidence[T]: UnboxEv[T, Box[T] with Confidence, T] =
    new UnboxEv[T, Box[T] with Confidence, T] {
      override def unbox(b: Box[T] with Confidence): T = b.v.get
    }
}

30. Use more general argument when pattern matching against trait mixin.

trait A
class B

@ new B 
res3: B = ammonite.$sess.cmd1$B@37f627d0
@ res3.asInstanceOf[B with A] 
res4: B with A = ammonite.$sess.cmd1$B@37f627d0
def consume(in:B with A):Unit = println(in) 

@ consume(res4) //it was fooled?
ammonite.$sess.cmd1$B@37f627d0
@ consume(res3) //compile error

What about pattern match:

def consume(in:B with A):Unit = {
  in match{
    case a: A => println("with A")
    case b => println("no A")
  }
}
@ consume(res4) //fooled again?
with A

Make parameter more general:

def consume(in:B):Unit = {
  in match{
    case a: A => println("with A")
    case b => println("no A")
  }
}
@ consume(res4) //correct
no A
@ new B with A 
res2: B with A = ammonite.$sess.cmd2$$anon$1@690df641
@ consume(res2) //correct
with A

Last, use asInstanceOf with great caution.

31.Use outer extractor to reuse value in pattern match guards.

see https://stackoverflow.com/a/3532681/5172925

32.case class's "copy" method only work with constructor parameters.

copy does not copy internal variables.
copy does not copy mixins.

33.extend `Product with Serializable` to clean inferred type.

see http://underscore.io/blog/posts/2015/06/04/more-on-sealed.html

34.Define local def outside enclosing def as possible.

If inner-def is a function, define it outside to avoid variable interference (and careless errors.)

35.Use sequencial data structure on performance intensive tasks.

The CPU caches can load chunks of the array into the cache at a time, and have the CPU access the data directly in the CPU cache once loaded. This is not really possible with a linked list where elements are scattered all over the RAM.

Benchmarking Scala Collections

36.Beware of java8 Stream

java.util.stream.Stream has many methods that look like scala's, e.g. flatMap. Java Stream is more like views, and methods like count() terminates the stream. Read doc carefully. When reading a file use Files.lines, a Stream that wraps a BufferedReader is returned. The Stream throws UncheckedIOException. When converting to scala stream, one may use stream.iterator.asScala, the properties of a java.util.stream.Stream are then passed along to the whatever-scala representation.

stream.iterator may not fetch all upstream elements, however stream.forEach has a different behavior, which is affected by the implementation of spliterator.

37.Use java interface in scala project

38.Scope limitation

We could use typeclass to limit the scope in which a mutable method can be called.

trait MutableContext[S]

class IamMutable{
  //...
  def mutate()(implicit mutableContext: MutableContext[Scope])
}

{
  new IamMutable.mutate() //compile time error
}

{
  implicit val mutableContext: MutableContext[Scope] = new MutableContext[Scope]{}
  new IamMutable.mutate() //allowed
}

39. Magnet pattern

http://spray.io/blog/2012-12-13-the-magnet-pattern/