/testcaffe

Architecture files for Caffe

Primary LanguageShell

Hardwares tested

  • CPU: Intel Xeon E5-4620 (Sandy-bridge) (2.2GHz, 16MB, 7.2GT/s)
  • GPU: GTX Titan (2688 cores, 6GB)
  • Files

    MNIST

  • `mnist_9850`: 'F:1000'
  • `mnist_9862`: 'F:600 - F:600'
  • `mnist_9863`: 'F:1000' w dropout
  • `mnist_9869`: 'F:600 - F:600' w dropout
  • `mnist_9890`: 'C:12@9x9 - P:2x2'
  • `mnist_9913`: 'C:20@9x9 - P:2x2 - F:100'
  • `mnist_9923`: 'C:20@5x5 - P:2x2 - C:40@5x5 - P:2x2 - F:1000 - F:1000'
  • `mnist_9925`: 'C:20@9x9 - P:2x2 - F:1000'
  • `mnist_9930`: 'C:20@9x9 - P:2x2 - F:1000' w dropout
  • `mnist_9931`: 'C:20@5x5 - P:2x2 - C:40@5x5 - P:2x2 - F:1000'
  • `mnist_9934`: 'C:20@5x5 - P:2x2 - C:40@5x5 - P:2x2 - F:1000' w dropout
  • `mnist_9936`: 'C:32@5x5 - P:2x2 - C:64@5x5 - P:2x2 - F:600'
  • `mnist_9943`: 'C:20@5x5 - P:2x2 - C:40@5x5 - P:2x2 - F:1000 - F:1000' w dropout
  • Results

    running times per 1 epoch (=600 iters)

    Architecture Caffe CPU Caffe GPU
    'F:1000' 21s 1s
    'F:600 - F:600' 22s 1s
    'C:12@9x9 - P:2x2' 36s 10s
    'C:20@9x9 - P:2x2
    - F:1000'
    1m 45s 11s
    'C:20@5x5 - P:2x2
    - C:40@5x5 - P:2x2
    - F:1000'
    2m 21s 18s
    'C:20@5x5 - P:2x2
    - C:40@5x5 - P:2x2
    - F:1000 - F:1000'
    2m 30s 19s
    'C:32@5x5 - P:2x2
    - C:64@5x5 - P:2x2
    - F:600'
    20s

    test accuracy (10 runs, 40 epochs, relu, SGD)

    Architecture w/o dropout w dropout w adv w adv & dropout
    'F:1000' 98.509 pm 0.042 98.633 pm 0.061
    'F:600 - F:600' 98.626 pm 0.061 98.690 pm 0.079
    'C:12@9x9 - P:2x2' 98.901 pm 0.086 -
    'C:20@9x9 - P:2x2
    - F:100'
    99.134 pm 0.043
    'C:20@9x9 - P:2x2
    - F:1000'
    99.249 pm 0.045 99.302 pm 0.044
    'C:20@5x5 - P:2x2
    - C:40@5x5 - P:2x2
    - F:1000'
    99.305 pm 0.056 99.341 pm 0.051
    'C:20@5x5 - P:2x2
    - C:40@5x5 - P:2x2
    - F:1000 - F:1000'
    99.238 pm 0.036 99.433 pm 0.072 99.360 pm 0.045 99.437 pm 0.048
    'C:32@5x5 - P:2x2
    - C:64@5x5 - P:2x2
    - F:600'
    99.363 pm 0.033

    Results in literatures

    Architecture Description Test Accuracy
    'F:800' Standard Neural Net, sigmoid 98.40
    'F:1024 - F:1024' Dropout 98.75
    'F:1024 - F:1024' Dropout + max-norm constraint 98.94
    'F:4096' Dropout + max-norm constraint 98.99
    'F - F' Maxout + Dropout 99.06
    'F:500 - F:500 - F:2000' DBN + Dropout finetuning, sigmoid 99.08
    'F:500 - F:500 - F:2000' DBM + Dropout finetuning, sigmoid 99.21
    'C - C - C - P' Maxout + Dropout 99.55
    'F:800 - F:800' DropConnect 98.80 pm 0.034
    'F:800 - F:800' x 5 + voting 98.88
    'C:32 - C:64 - F:150' DropConnect 99.37 pm 0.035
    'C:32 - C:64 - F:150' x 5 + voting 99.43
    'C:32 - C:64 - F:150' DropConnect + rotation scaling 99.72 pm 0.032
    'C:32 - C:64 - F:150' x 5 + voting 99.79

    references

  • DropConnect
  • Wan et al., *ICML*, 2013.
  • Dropout, max-norm constraint
  • Srivastava et al., *JMLR*, 2014.
  • Maxout
  • Goodfellow et al., *ICML*, 2013.
  • Standard Neural Net
  • Simard et al., *ICDAR*, 2003.