1,718
Views
42
CrossRef citations to date
0
Altmetric
Original Article

Statistical transfer learning: A review and some extensions to statistical process control

, , &

References

  • Argyriou, A., T. Evgeniou, and M. Pontil. 2007a. Multi-task feature learning. In Advances in neural information processing systems, B., Schölkopf, J., Platt, T., Hoffman (Eds.), Vol. 19, pp. 41–48. Cambridge: MIT Press.
  • Blangiardo, M., and M. Cameletti. 2015. Spatial and spatio-temporal Bayesian models with R-INLA. John Wiley & Sons.
  • Blei, D. M., A. Kucukelbir, and J. D. McAuliffe. 2016. Variational inference: A review for statisticians. arXiv preprint arXiv 1601:00670.
  • Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993–1022.
  • Blitzer, J., M. Dredze, and F. Pereira. 2007, June. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In ACL ( Vol. 7, pp. 440–447) Stroudsburg: Association for Computational Linguistics.
  • Blitzer, J., R. McDonald, and F. Pereira. 2006, July. Domain adaptation with structural correspondence learning. In Proceedings of the 2006 conference on empirical methods in natural language processing (pp. 120–128). Stroudsburg: Association for Computational Linguistics.
  • Bonilla, E. V., K. M. A. Chai, and C. K. Williams. 2007, December. Multi-task Gaussian process prediction. In NIPs ( Vol. 20, pp. 153–160). Vancouver: Neural Information Processing Systems.
  • Bouveyron, C., and J. Jacques. 2014. Adaptive mixtures of regressions: Improving predictive inference when population has changed. Communications in Statistics-Simulation and Computation 43 (10):2570–2592.
  • Breiman, L. 1996. Bagging predictors. Machine learning 24 (2):123–140.
  • Castagliola, P., and F. Tsung. 2005. Autocorrelated SPC for non‐normal situations. Quality and Reliability Engineering International 21 (2):131–161.
  • Cressie, N., and C. K. Wikle. 2015. Statistics for spatio-temporal data. Hoboken: John Wiley & Sons.
  • Cheng, L., F. Tsung, and A. Wang. 2017. A transfer learning perspective for modeling shape deviations in additive manufacturing. IEEE Robotics and Automation Letters 2 (4):1988–1993.
  • Dai, W., Q. Yang, G. R. Xue, and Y. Yu. 2008, July. Self-taught clustering. In Proceedings of the 25th International Conference on Machine Learning (pp. 200–207). New York: ACM.
  • Dai, W., Q. Yang, G. R. Xue, and Y. Yu. 2007, June. Boosting for transfer learning. In Proceedings of the 24th International Conference on Machine Learning (pp. 193–200). New York: ACM.
  • Dalla Valle, L., and R. S. Kenett. 2015. Official Statistics Data Integration for Enhanced Information Quality, Quality and Reliability Engineering International Vol. 31, No. 7:pp. 1281–1300.
  • Efron, B., and R. J. Tibshirani. 1994. An introduction to the bootstrap. CRC Press.
  • Evgeniou, A., and M. Pontil. 2007. Multi-task feature learning. Advances in Neural Information Processing Systems 19:41.
  • Evgeniou, T., and M. Pontil. 2004, August. Regularized multi–task learning. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 109–117). New York: ACM.
  • Freund, Y., and R. E. Schapire. 1995, March. A desicion-theoretic generalization of on-line learning and an application to boosting. In European conference on computational learning theory (pp. 23–37). Heidelberg: Springer Berlin Heidelberg.
  • Gilks, W. R., S. Richardson, and D. Spiegelhalter. Eds.).1995. Markov chain Monte Carlo in practice. Boca Raton: CRC Press.
  • Gonçalves, A. R., P. Das, S. Chatterjee, V. Sivakumar, F. J. Von Zuben, and A. Banerjee. 2014, November. Multi-task sparse structure learning. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (pp. 451–460). New York: ACM.
  • Gonçalves, A. R., F. J. Von Zuben, and A. Banerjee. 2016. Multi-task sparse structure learning with Gaussian copula models. Journal of Machine Learning Research 17 (33):1–30.
  • Huang, S., J. Li, K. Chen, T. Wu, J. Ye, X. Wu, and L. Yao. 2012. A transfer learning approach for network modeling. IIE Transactions 44 (11):915–931.
  • Huang, Q., H. Nouri, K. Xu, Y. Chen, S. Sosina, and T. Dasgupta. 2014. Statistical predictive modeling and compensation of geometric deviations of three-dimensional printed products. Journal of Manufacturing Science and Engineering 136 (6):061008.
  • Huang, Q., J. Zhang, A. Sabbaghi, and T. Dasgupta. 2015. Optimal offline compensation of shape shrinkage for three-dimensional printing processes. IIE Transactions 47 (5):431–441.
  • Jaiswal, S., J. Bunker, and L. Ferreira. 2010. Influence of platform walking on BRT station bus dwell time estimation: Australian analysis. Journal of Transportation Engineering, 136 (12), 1173–1179.
  • Jiang, J., and C. Zhai. 2007, June. Instance weighting for domain adaptation in NLP. In ACL ( Vol. 7, pp. 264–271). Stroudsburg: Association for Computational Linguistics.
  • Jin, O., N. N. Liu, K. Zhao, Y. Yu, and Q. Yang. 2011, October. Transferring topical knowledge from auxiliary long texts for short text clustering. In Proceedings of the 20th ACM international conference on Information and knowledge management (pp. 775–784). New York: ACM.
  • Kamishima, T., M. Hamasaki, and S. Akaho. 2009, December. TrBagg: A simple transfer learning method and its application to personalization in collaborative tagging. In Data Mining, 2009. ICDM'09. Ninth IEEE International Conference on (pp. 219–228). Piscataway: IEEE.
  • Kenett, R. S. 2016. On generating high InfoQ with Bayesian networks, Quality Technology and Quantitative Management 13 (3).
  • Kenett, R. S., and G.. Shmueli. 2016. Information quality: the potential of data and analytics to generate knowledge, Hoboken: John Wiley and Sons.
  • Lawrence, N. D., and J. C. Platt. 2004, July. Learning to learn with the informative vector machine. In Proceedings of the Twenty-First International Conference on Machine Learning (p. 65). New York: ACM.
  • Lee, S. I., V. Chatalbashev, D. Vickrey, and D. Koller. 2007, June. Learning a meta-level prior for feature relevance from multiple related tasks. In Proceedings of the 24th International Conference on Machine Learning (pp. 489–496). New York: ACM.
  • Li, J., C. Zou, and F. Tsung. 2009, August. Monitoring multivariate binomial data via log-linear models. Proceedings of the 1 st INFORMS International Conference on Service Science. Catonsville: INFORMS.
  • Liao, X., Y. Xue, and L. Carin. 2005, August. Logistic regression with an auxiliary data source. In Proceedings of the 22 nd International Conference on Machine Learning (pp. 505–512). New York: ACM.
  • Lin, D., X. An, and J. Zhang. 2013. Double-bootstrapping source data selection for instance-based transfer learning. Pattern Recognition Letters 34 (11):1279–1285.
  • Liu, J., S. Ji, and J. Ye. 2009, June. Multi-task feature learning via efficient l 2, 1-norm minimization. In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence (pp. 339–348). Corvallis: AUAI Press.
  • Liu, H., M. Palatucci, and J. Zhang. 2009, June. Blockwise coordinate descent procedures for the multi-task lasso, with applications to neural semantic basis discovery. In Proceedings of the 26th Annual International Conference on Machine Learning (pp. 649–656). New York: ACM.
  • Lu, J., V. Behbood, P. Hao, H. Zuo, S. Xue, and G. Zhang. 2015. Transfer learning using computational intelligence: a survey. Knowledge-Based Systems 80:14–23.
  • Luan, H., and Q. Huang. 2015, August. Predictive modeling of in-plane geometric deviation for 3D printed freeform products. In Automation Science and Engineering (CASE), 2015 IEEE International Conference on (pp. 912–917). Piscataway: IEEE.
  • Lütkepohl, H. 2005. New introduction to multiple time series analysis. Springer Science & Business Media.
  • Maurer, A., M. Pontil, and B. Romera-Paredes. 2013, January. Sparse coding for multitask and transfer learning. In ICML (2) (pp. 343–351). Princeton: The International Machine Learning Society.
  • Mei, Y. 2010. Efficient scalable schemes for monitoring a large number of data streams. Biometrika 97 (2):419–433.
  • Pan, S. J., and Q. Yang. 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22 (10):1345–1359.
  • Pan, S. J., V. W. Zheng, Q. Yang, and D. H. Hu. 2008, July. Transfer learning for wifi-based indoor localization. In Association for the advancement of artificial intelligence (AAAI) workshop (p. 6). Palo Alto: The Association for the Advancement of Artificial Intelligence.
  • Pan, S. J., J. T. Kwok, and Q. Yang. 2008, July. Transfer Learning via Dimensionality Reduction. In AAAI ( Vol. 8, pp. 677–682). Palo Alto: The Association for the Advancement of Artificial Intelligence.
  • Pan, W., E. W. Xiang, N. N. Liu, and Q. Yang. 2010, July. Transfer Learning in Collaborative Filtering for Sparsity Reduction. In AAAI ( Vol. 10, pp. 230–235). Palo Alto: The Association for the Advancement of Artificial Intelligence.
  • Pu, F., J. Ma, D. Zeng, X. Xu, and N. Chen. 2015. Early warning of abrupt displacement change at the Yemaomian landslide of the Three Gorge Region, China. Natural Hazards Review 16 (4):04015004.
  • Psarakis, S., and G. E. A. Papaleonida. 2007. SPC procedures for monitoring autocorrelated processes. Quality Technology and Quantitative Management 4 (4):501–540.
  • Raina, R., A. Battle, H. Lee, B. Packer, and A. Y. Ng. 2007, June. Self-taught learning: transfer learning from unlabeled data. In Proceedings of the 24th International Conference on Machine Learning (pp. 759–766). New York: ACM.
  • Ren, H., J. Long, Z. Gao, and P. Orenstein. 2012. Passenger assignment model based on common route in congested transit networks. Journal of Transportation Engineering 138 (12):1484–1494.
  • Rubinstein, R. Y., and D. P. Kroese. 2016. Simulation and the Monte Carlo method. Hoboken: John Wiley and Sons.
  • Samarov, D. V., D. Allen, J. Hwang, Y. Lee, and M. Litorja. 2016. A Coordinate Descent based approach to solving the Sparse Group Elastic Net. Technometrics (just-accepted).
  • Samarov, D. V., J. Hwang, and M. Litorja. 2015. The spatial lasso with applications to unmixing hyperspectral biomedical images. Technometrics 57 (4):503–513.
  • Schwaighofer, A., V. Tresp, and K. Yu. 2004. Learning Gaussian process kernels via hierarchical Bayes. In Advances in Neural Information Processing Systems (pp. 1209–1216). Vancouver: Neural Information Processing Systems.
  • Setti, J. R., and B. G. Hutchinson. 1994. Passenger-terminal simulation model. Journal of Transportation Engineering 120 (4):517–535.
  • Shao, C., J. Ren, H. Wang, J. J. Jin, and S. J. Hu. 2017. Improving machined surface shape prediction by integrating multi-task learning with cutting force variation modeling. Journal of Manufacturing Science and Engineering 139 (1):011014.
  • Song, P., W. Zheng, J. Liu, J. Li, and X. Zhang. 2015, November. A novel speech emotion recognition method via transfer PCA and sparse coding. In Chinese Conference on Biometric Recognition (pp. 393–400). Berlin: Springer International Publishing.
  • Song, Z., K. Zhang, and F. Tsung. 2017. A Multi-task learning approach for improved forecast of passenger inflow rates into a URT System. Working paper.
  • Taylor, M. E., and P. Stone. 2009. Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research 10 (Jul):1633–1685.
  • Tseng, S. T., N. J. Hsu, and Y. C. Lin. 2016. Joint modeling of laboratory and field data with application to warranty prediction for highly reliable products. IIE Transactions 48 (8):710–719.
  • Wainwright, M. J., and M. I. Jordan. 2008. Graphical models, exponential families, and variational inference. Foundations and Trends® in Machine Learning 1 (1–2):1–305.
  • Wang, A., S. Song, Q. Huang, and F. Tsung. 2017. In-plane shape-deviation modeling and compensation for fused deposition modeling processes. IEEE Transactions on Automation Science and Engineering 14 (2):968–976.
  • Wei, Y., Y. Zheng, and Q. Yang. 2016, August. Transfer knowledge between cities. In Proceedings of the 22 nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1905–1914). New York: ACM.
  • Weiss, K., T. M. Khoshgoftaar, and D. Wang. 2016. Transfer Learning Techniques. In Big Data Technologies and Applications (pp. 53–99). Springer International Publishing.
  • Wu, P., and T. G. Dietterich. 2004, July. Improving SVM accuracy by training on auxiliary data sources. In Proceedings of the Twenty-first International Conference on Machine Learning (p. 110). New York: ACM.
  • Xu, Q., and Q. Yang. 2011. A survey of transfer and multitask learning in bioinformatics. Journal of Computing Science and Engineering 5 (3):257–268.
  • Yang, X., and L. Chen. 2010. Using multi-temporal remote sensor imagery to detect earthquake-triggered landslides. International Journal of Applied Earth Observation and Geoinformation 12 (6):487–495.
  • Yao, Y., and G. Doretto. 2010, June. Boosting for transfer learning with multiple sources. In Computer vision and pattern recognition (CVPR), 2010 IEEE conference on (pp. 1855–1862). Piscataway: IEEE.
  • Yu, K., V. Tresp, and A. Schwaighofer. 2005, August. Learning Gaussian processes from multiple tasks. In Proceedings of the 22 nd international conference on Machine learning (pp. 1012–1019). New York: ACM.
  • Zhai, C. 2008. Statistical language models for information retrieval. Synthesis Lectures on Human Language Technologies 1 (1):1–141.
  • Zhang, K., and F. Tsung. 2017. A Multi-task learning framework integrating non-contemporaneous autoregressive models. Working paper.
  • Zhang, Y., and D. Y. Yeung. 2014. A regularization approach to learning task relationships in multitask learning. ACM Transactions on Knowledge Discovery from Data (TKDD) 8 (3):12.
  • Zhang, Y., D. Y. Yeung, and Q. Xu. 2010. Probabilistic multi-task feature selection. In Advances in neural information processing systems (pp. 2559–2567). Vancouver: Neural Information Processing Systems.
  • Zhao, W. X., J. Jiang, J. Weng, J. He, E. P. Lim, H. Yan, and X. Li. 2011, April. Comparing twitter and traditional media using topic models. In European Conference on Information Retrieval (pp. 338–349). Heidelberg: Springer Berlin Heidelberg.
  • Zou, N., Y. Zhu, J. Zhu, M. Baydogan, W. Wang, and J. Li. 2015. A Transfer Learning Approach for Predictive Modeling of Degenerate Biological Systems. Technometrics 57 (3):362–373.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.