Review paper on Outlier Detection using Machine Learning Technique

Authors

  • Siddarth Pagariya, Dr. Rachna Kulhare

Keywords:

Outlier Detection (OD), Data Mining, Machine Learning (ML)

Abstract

OD is a Data Mining Application. Anomaly contains boisterous information which is explored in different areas. The different strategies are now being explored that is more conventional. We reviewed on different procedures and uses of OD that gives an original methodology that is more helpful for the novices. Machine learning methods are widely used for prediction and classification tasks in medical diagnosis. The classification of a disease with greater precision and efficiency for disease diagnosis are the goals of ML methods. The life support equipment and systems for patients are expanding gradually. Human life expectancy rises as a result of this growth. Yet, these medical care frameworks face the few difficulties and issues like deceiving patients' data, protection of information, absence of exact information, absence of medico data, classifiers for expectation and some more. Numerous disease diagnosis and prediction systems, including expert systems, clinical prediction systems, decision support systems, and personal health record systems, have been developed to address these issues. The objective of the proposed system is to assist physicians in making accurate diagnoses of heart and diabetes conditions.

References

Abdallah Abdellatif, Hamdan Abdellatef, Jeevan Kanesan, Chee-Onn Chow, Joon Huang Chuah and Hassan Muwafaq Gheni, “An Effective Heart Disease Detection and Severity Level Classification Model Using Machine Learning and Hyper parameter Optimization Methods”, IEEE Access, Vol. 10, 2022.

H. B. Kibria and A. Matin, “The severity prediction of the binary and multi-class cardiovascular disease machine learning-based fusion approach”, Comput. Biol. Chem., vol. 98, Art. no. 107672, 2022.

G. Magesh and P. Swarnalatha, “Optimal feature selection through a cluster-based DT learning (CDTL) in heart disease prediction”, Evol. Intell., vol. 14, no. 2, pp. 583-593, Jun. 2021.

Kangqing Yu, Wei Shi and Nicola Santoro, “Designing a Streaming Algorithm for Outlier Detection in Data Mining—An Incremental Approach”, Sensor, MDPI 2020.

Yu, K.; Shi, W.; Santoro, N.; Ma, X., “Real-time Outlier Detection over Streaming Data”, In Proceedings of the IEEE Smart World Congress, Leicester, UK, 19–23 August 2019.

N. Khateeb and M. Usman, “Efficient heart disease prediction system using K-nearest neighbor classification technique”, in Proc. Int. Conf. Big Data Internet Thing (BDIOT), pp. 21-26, 2017.

Nonso Nnamoko and Ioannis Korkontzelos, “Efficient treatment of outliers and class imbalance for diabetes prediction”, Artificial Intelligence in Medicine, Volume 104, pp. 01-12, 2020.

Muhammad Fazal Ijaz, Ganjar Alfian, Muhammad Syafrudin and Jongtae Rhee, “Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest”, Applied Science, Vol. 8, pp. 01-22, 2018.

Wang, Q.; Luo, Z.; Huang, J.; Feng, Y.; Liu, Z., “A novel ensemble method for imbalanced data learning: Bagging of extrapolation-SMOTE SVM”, Comput. Intell. Neurosci., 2017.

Hanifah, F.; Wijayanto, H.; Kurnia, A., “SMOTE Bagging Algorithm for Imbalanced Dataset in Logistic Regression Analysis”, Appl. Math. Sci., 9, pp. 6857–6865, 2015.

Tantithamthavorn, C.; Hassan, A.; Matsumoto, K., “The impact of class rebalancing techniques on the performance and interpretation of defect prediction models”, arXiv, 2018.

Sadik, S.; Gruenwald, L., “DBOD-DS: Distance based outlier detection for data streams”, In Proceedings of the 21st International Conference on Database and Expert Systems Applications (DEXA), Bilbao, Spain, 30 August–3 September, pp. 122–136, 2010.

Sadik, S.; Gruenwald, L., “Online Outlier Detection for Data Streams”, In Proceedings of the 15th Symposium on International Database Engineering & Applications Symposium (IDEAS), Lisboa, Portugal, 21–23, pp. 88–96, 2011.

Sadik, S.; Gruenwald, L.; Leal, E., “In pursuit of outliers in multi-dimensional data streams”, In Proceedings of the 4th IEEE International Conference on Big Data (Big Data), Washington, DC, USA, 5–8 December 2016; pp. 512–521.

Forrest, S., Esponda, F., and Helman, P., “A formal framework for positive and negative detection schemes”, In IEEE Transactions on Systems, Man and Cybernetics, Part B. IEEE, pp. 357–373, 2004.

Gonzalez, F. A. and Dasgupta, D., “Outlier detection using real-valued negative selection”, Genetic Programming and Evolvable Machines, Vol. 4, No. 4, pp. 383–403, 2003.

Lee, W., Stolfo, S. J., and Mok, K. W., “Adaptive intrusion detection: A data mining approach”, Artificial Intelligence Review, Vol. 14, No. 6, pp. 533–567, 2000.

Gwadera, R., Atallah, M. J., and Szpankowski, W., “Reliable detection of episodes in event sequences”, Knowledge and Information Systems, Vol. 7, No. 4, pp. 415–437, 2005.

Chow, C. and Yeung, D.-Y., “Parzenwin down network intrusion detectors”, In Proceedings of the 16th International Conference on Pattern Recognition, Vol. 4, IEEE Computer Society, Washington, DC, USA, 40385, 2002.

Lin, J., Keogh, E., Fu, A., and Herle, H. V., “Approximations to magic: Finding unusual medical time series”, In Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems, IEEE Computer Society, Washington, DC, USA, pp. 329–334, 2005.

Adeli, E, Thung, K. H., An, L., Wu, G., Shi, F., Wang, T. & Shen, D., “Semi-supervised discriminative classification robust to sample-outliers and feature noises”, IEEE Transactions on Pattern Analysis and Machine Intelligence (Early Access), pp. 1-1, 2018.

Ahmad, S., Lavin, A., Purdy, S. & Agha, Z., “Unsupervised real-time anomaly detection for streaming data”, Neurocomputing, vol. 262, pp. 134–147, 2017.

Alcalá-Fdez, J., Alcalá, R., Gacto, M. J. & Herrera, F., “Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms”, Fuzzy Sets and Systems, vol. 160, no. 7, pp. 905–921, 2009.

Au, W. H. & Chan, K. C., “Mining fuzzy association rules in a bank account database”, IEEE Transactions on Fuzzy Systems, vol. 11, no. 2, pp. 238–248, 2003.

Cai, R., Liu, M., Hu, Y., Melton, B. L., Matheny, M. E., Xu, H., Duan, L. & Waitman, L. R., “Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports”, Artificial Intelligence in Medicine, vol. 76, pp. 7–15, 2017.

Downloads

How to Cite

Siddarth Pagariya, Dr. Rachna Kulhare. (2023). Review paper on Outlier Detection using Machine Learning Technique. International Journal of Research & Technology, 11(4), 104–109. Retrieved from https://ijrt.org/j/article/view/219

Similar Articles

1 2 3 4 5 6 7 8 > >> 

You may also start an advanced similarity search for this article.