- "Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent" Wei Xu (2011)
- "Large-scale Image Classification: Fast Feature Extraction and SVM Training" Yuanqing Lin, Fengjun Lv, Shenghuo Zhu, Ming Yang, Timothee Cour, Kai Yu, Liangliang Cao, and Thomas Huang (CVPR 2011)
- "Large-Scale Machine Learning with Stochastic Gradient Descent" Leon Bottou (2010)