Search in:

IISE Transactions Volume 52, 2020 - Issue 1

Submit an article Journal homepage

541

Views

CrossRef citations to date

Altmetric

Quality & Reliability Engineering

Supervised subgraph augmented non-negative matrix factorization for interpretable manufacturing time series data analytics

Hongyue SunDepartment of Industrial and Systems Engineering, University at Buffalo, Buffalo, NY; Correspondence[email protected]

http://orcid.org/0000-0003-2871-5502 View further author information

Ran JinGrado Department of Industrial and Systems Engineering, Virginia Tech, Blacksburg, VA;

http://orcid.org/0000-0003-3847-4538 View further author information

Yuan LuoDepartment of Preventive Medicine, Northwestern University, Chicago, ILView further author information

Pages 120-131 | Received 19 Jul 2018, Accepted 03 Feb 2019, Published online: 06 May 2019

Cite this article
https://doi.org/10.1080/24725854.2019.1581389
CrossMark

Sample our Engineering & Technology journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/24725854.2019.1581389?needAccess=true

Abstract

Data analytics has been extensively used for manufacturing time series to reduce process variation and mitigate product defects. However, the majority of data analytics approaches are hard to understand for humans who do not have a data analysis background. Many manufacturing conditions, such as trouble shooting, need situation-dependent responses and are mainly performed by humans. Therefore, it is critical to discover insights from the time series and present those to a human operator in an interpretable format. We propose a novel Supervised Subgraph Augmented Non-negative Matrix Factorization (Super-SANMF) approach to represent and model manufacturing time series. We use a graph representation to approximate a human’s description of time series changing patterns and identify frequent subgraphs as common patterns. The appearances of the subgraphs in the time series are organized in a count matrix, in which each row corresponds to a time series and each column corresponds to a frequent subgraph. Super-SANMF then identifies groups of subgraphs as features that minimize the Kullback–Leibler divergence between measured and approximated matrices. The learned features can yield comparable prediction accuracy (normal or defective) in case studies, compared with the widely used basis expansion approaches (such as spline and wavelet), and are easy for humans to memorize and understand.

Keywords:

Interpretable data analytics
manufacturing time series
subgraph augmented matrix factorization

Additional information

Funding

Hongyue Sun is partially supported by Sustainable Manufacturing and Advanced Robotics Technologies, Community of Excellence (SMART CoE) at University of Buffalo.

Notes on contributors

Hongyue Sun

Hongyue Sun received a B.E. degree in mechanical engineering and automation from the Beijing Institute of Technology, Beijing, China, in 2012, an M.S. degree in statistics, and a Ph.D. degree in industrial engineering from Virginia Tech, Blacksburg, VA, USA, in 2015 and 2017, respectively. He is an assistant professor with the Department of Industrial and Systems Engineering, University at Buffalo, Buffalo, NY, USA. His research interests are data analytics for advanced manufacturing processes and energy systems. He is a member of INFOMRS, IISE, IEEE and ASME.

Ran Jin

Ran Jin received his Ph.D. degree in Industrial Engineering from Georgia Tech (2011), his master’s degree in industrial engineering (2007) and in statistics (2009), both from the University of Michigan, and his bachelor’s degree in electronic engineering from Tsinghua University (2005). He is an assistant professor at the Grado Department of Industrial and Systems Engineering at Virginia Tech. His research interests are in engineering-driven data fusion for manufacturing system modeling and performance improvements, such as the integration of data mining methods and engineering domain knowledge for multistage system modeling and variation reduction, and sensing, modeling, and optimization based on spatial correlated responses. He is a member of INFORMS, IISE, ASME, TMS, SME and IEEE.

Yuan Luo

Yuan Luo is an assistant professor at the Department of Preventive Medicine, Division of Health & Biomedical Informatics (at Feinberg School of Medicine) with courtesy appointments in IEMS and EECS (both at McCormick School of Engineering). He earned his PhD degree from MIT EECS in 2015. His research interests include machine learning, natural language processing, time series analysis, computational genomics and big data analytics, with a focus on medical applications. He is a member of Association for the Advancement of Artificial Intelligence (AAAI), American Association for the Advancement of Science (AAAS) and American Medical Informatics Association (AMIA). He was also a member of the Student Editorial Board for Journal of the American Medical Informatics Association.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text

Source: Oxford University Press (OUP)

Experiencing SAX: a novel symbolic representation of time series

Source: Springer Science and Business Media LLC

Temporal Skeletonization on Sequential Data: Patterns, Categorization, and Visualization

Source: Institute of Electrical and Electronics Engineers (IEEE)

Text Classification using Graph Mining-based Feature Extraction

Source: Springer London

Training and plasticity of working memory

Source: Elsevier BV

Supervised subgraph augmented non-negative matrix factorization for interpretable manufacturing time series data analytics

Source: Taylor & Francis

Semi-Supervised Nonnegative Matrix Factorization

Source: Institute of Electrical and Electronics Engineers (IEEE)

A review of symbolic analysis of experimental data

Source: AIP Publishing

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Source: Informa UK Limited

Better Subset Regression Using the Nonnegative Garrote

Source: Informa UK Limited

Challenges in Data Crowdsourcing

Source: Institute of Electrical and Electronics Engineers (IEEE)

Text Classification using Graph Mining-based Feature Extraction

Source: Springer London

Is perception discrete or continuous

Source: Elsevier BV

Automatic lymphoma classification with sentence subgraph mining from pathology reports

Source: Oxford University Press (OUP)

Functional Regression

Source: Annual Reviews

Informative Sensor and Feature Selection via Hierarchical Nonnegative Garrote

Source: Informa UK Limited

Supervised subgraph augmented non-negative matrix factorization for interpretable manufacturing time series data analytics

Source: Taylor & Francis

Better Subset Regression Using the Nonnegative Garrote

Source: Informa UK Limited

Group variable selection via a hierarchical lasso and its oracle property

Source: International Press of Boston

Knowledge discovery in time series databases

Source: Institute of Electrical and Electronics Engineers (IEEE)

Improving operator’s conformity with expectations in a cognitively automated assembly cell using human heuristics

Source: CRC Press

Large scale mining of molecular fragments with wildcards

Source: IOS Press

Reconfigured piecewise linear regression tree for multistage manufacturing process control

Source: Informa UK Limited

Monitoring and diagnosis of multichannel nonlinear profile variations using uncorrelated multilinear principal component analysis

Source: Informa UK Limited

Design and Implementation of a comprehensible cognitive assembly system

Source: CRC Press

A survey of frequent subgraph mining algorithms

Source: Cambridge University Press (CUP)

Logistic regression for crystal growth process modeling through hierarchical nonnegative garrote-based variable selection

Source: Informa UK Limited

Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers

Source: Now Publishers

Novel Online Methods for Time Series Segmentation

Source: Institute of Electrical and Electronics Engineers (IEEE)

The evolution and future of manufacturing: A review

Source: Elsevier BV

Regression Shrinkage and Selection via the Lasso

Source: Wiley

Supersparse linear integer models for optimized medical scoring systems

Source: Springer Nature

Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model

Source: Institute of Mathematical Statistics

Nonnegative Matrix Factorization: A Comprehensive Review

Source: Institute of Electrical and Electronics Engineers (IEEE)

Supporting Flexible, Efficient, and User-Interpretable Retrieval of Similar Time Series

Source: Institute of Electrical and Electronics Engineers (IEEE)

Integration of Data Fusion Methodology and Degradation Modeling Process to Improve Prognostics

Source: Institute of Electrical and Electronics Engineers (IEEE)

Comprehensible classification models

Source: Association for Computing Machinery (ACM)

A Survey of Methods for Explaining Black Box Models

Source: Association for Computing Machinery (ACM)

Working Memory: Looking Back and Looking Forward.

Source: Springer Science and Business Media LLC

SVD based initialization: A head start for nonnegative matrix factorization

Source: Elsevier BV

The magical number seven, plus or minus two: some limits on our capacity for processing information.

Source: American Psychological Association (APA)

Model selection and estimation in regression with grouped variables

Source: Wiley

Linking provided by

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Supervised subgraph augmented non-negative matrix factorization for interpretable manufacturing time series data analytics

Notes on contributors

Hongyue Sun

Ran Jin

Yuan Luo

Related Research Data

Information for

Open access

Opportunities

Help and information

Supervised subgraph augmented non-negative matrix factorization for interpretable manufacturing time series data analytics

Abstract

Additional information

Funding

Notes on contributors

Hongyue Sun

Ran Jin

Yuan Luo

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature