摘要(英) |
Advances in technology have allowed us to quickly collect streaming time-series data using sensors in the machine. Data mining can help us find useful information from thousands of data and assist managers to make appropriate decisions.
In data mining, anomaly detection is one of the popular technologies. In the past research, point anomalies and subsequence anomalies have been widely discussed, and the whole time series of time series anomaly detection has been rarely mentioned.
In the semiconductor industry, in the process of the cutting silicon ingot, the occurrence of anomalies will damage the product and cause a lot of money and delay in delivery.
Therefore, it is necessary to find the anomalies as soon as possible and let the operators perform maintenance and stop.
Our research data is taken from the machine data of the semiconductor industry factory. In this study, we define the label presented in binary and probability. When the label presented in binary, we find anomaly point with the help of a classification model. Accumulation of sliding windows is performed through anomaly points. We proposed a new alarm rule, which is used to design a monitoring scheme. In the label presented in probability, we define the label as the non-anomaly probability. With the help of the regression model, we observe the difference between the normal and abnormal in the past, propose a corresponding monitoring scheme, and use it as issuing an alarm.
Finally, we using cross validation to evaluate the performance and robustness of the system by catching rate and false alarm. |
參考文獻 |
[1] Ahmada Subutai, Lavina Alexander, Purdya Scott, Aghaab Zuha "Unsupervised real-time anomaly detection for streaming data." Neurocomputing, 2017.
[2] Akcay Samet , Atapour-Abarghouei Amir , Breckon Toby. "GANomaly: Semi-supervised anomaly detection via adversarial training." Asian Conference on Computer Vision, 2018
[3] Aggarwal Charu "Outlier Analysis (2 ed.)." Springer, 2016.
[4] Benkabou Seif-Eddine, Benabdeslem Khalid and Canitia Bruno. 2018. "Unsupervised outlier detection for time series by entropy and dynamic time warping." Knowledge
and Information Systems 54, 2 2018, 463–486.
[5] Breiman Leo, Friedman Jerome, Stone Charles J., Olshen R.A. "Classification and regression trees." Monterey, CA: Wadsworth & Brooks/Cole Advanced Books & Software, 1984.
[6] Bengio Yoshua, Courville Aaron, and Vincent Pascal. "Representation Learning: A Review and New Perspectives." IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013.
[7] Breiman, L. "Arcing The Edge" . Technical Report 486. Statistics Department, 1997.
[8] Chawla Nitesh V., Japkowicz Nathalie,Japkowicz Nathalie, Kotcz Aleksander. "Editorial: special issue on learning from imbalanced data sets. " SIGKDD Explorations 6, 1-6. 2004.
[9] Chalapath Raghavendra, Chawla Sanjay. "Deep Learning for Anomaly Detection: A Survey. " arXiv:1901.03407v2 23. 2019.
[10] Cortes Corinna and Vapnik Vladimir. "Support-vector networks". Machine Learning, 1995.
[11] Dasgupta Dipankar and Majumdar N. S. "Anomaly detection in multidimensional data using negative selection algorithm." In Proceedings of the IEEE Conference on Evolutionary Computation. 2002.
[12] Fox A. J. "Outliers in Time Series. " Journal of the Royal Statistical Society: Series B (Methodological) 34, 3, 1972.
[13] Friedman, J. H. "Greedy Function Approximation: A Gradient Boosting Machine". The Annals of Statistics, 1999.
[14] Friedman, J. H. "Stochastic Gradient Boosting". Computational Statistics & Data Analysis, 1999.
[15] Georgoulas George, Karvelis Petros, Loutas Theodoros, Styliosa Chrysostomos. "Rolling element bearings diagnostics using the Symbolic Aggregate approximation." Mech.Syst.SignalProcess. 2015.
[16] Gers Felix A., Schmidhuber Jurgen, Cummins Fred. "Learning to forget: Continual prediction with LSTM". 9th International Conference on Artificial Neural Networks, 1999.
[17] García Ane, Conde Angel, Mori Usue, Lozano Jose. "A review on outlier/anomaly detection in time series data." ACM Computing Surveys. 2020.
[18] Hochreiter Sepp and Schmidhuber Jürgen. "Long short-term memory". Neural Computation. 1997.
[19] Hsu Chia-Yu, Chien Chen-Fu and Chen Pei-Nong. "Manufacturing intelligence for early warning of key equipment excursion for advanced equipment control in semiconductor manufacturing. " Journal of the Chinese Institute of Industrial Engineers, 2012.
[20] Jones Michael, Nikovski Daniel, Imamura Makoto and Hirata Takahisa. "Exemplar learning for extremely efficient anomaly detection in real-valued time series. " Data Mining and Knowledge Discovery, 2016
[21] Lin Jessica, Keogh Eamonn, Lonardi Stefano and Chiu Bill, "A Symbolic Representation of Time Series, with Implications for Streaming Algorithms." Data Mining and Knowledge Discovery. 2003.
[22] Lindemann Benjamin, Fesenmyr Fabian, Jazdi Nasser, Weyrich Michael. "Anomaly detection in discrete manufacturing using self-learning approaches. " Procedia CIRP, 2018.
[23] Munir Mohsin, Siddiqui Shoaib Ahmed, Dengel Andreas and Ahmed Sheraz. "DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series." IEEE Access, 2019.
[24] Malhotra Pankaj, Vig Lovekesh, Shroff Gautam, Agarwal Puneet. "Long Short Term Memory Networks for Anomaly Detection in Time Series." Computational Intelligence and Machine Learning, 2015.
[25] Pearl Judea. "Bayesian Networks: A Model of Self-Activated Memory for Evidential Reasoning." Proceedings of the 7th Conference of the Cognitive Science Society, 1985.
[26] Rebbapragada Umaa, Protopapas Pavlos, Brodley Carla, Alcock Charles. "Finding anomalous periodic time series: An application to catalogs of periodic variable stars. " Machine Learning, 2009.
[27] Quinlan Ross. "Induction of Decision Trees." Mach. Learn,1986
[28] Quinlan Ross. "C4.5: Programs for Machine Learning." Morgan Kaufmann Publishers, 1993.
[29] Song Shaoxu, Zhang Aoqian, Wang Jianmin, Yu Philip. "SCREEN: Stream Data Cleaning under Speed Constraints." In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, 2015.
[30] Tsay Ruey S. "Outliers, Level Shifts, and Variance Changes in Time Series. " Journal of Forecasting 7" ,1988.
[31] Tsay Ruey, Peña Daniel and Pankratz Alan. "Outliers in multivariate time series. Biometrika",2000.
[32] Yan Weizhong and Yu Lijie. "On Accurate and Reliable Anomaly Detection for Gas Turbine Combustors: A Deep Learning Approach." Annual Conference of the Prognostics and Health Management Society, 2015. |