PriMera Scientific Engineering (ISSN: 2834-2550)

Research Article

Volume 6 Issue 1

Predictive Modeling for Breast Cancer Prognosis: A Machine Learning Paradigm

Sourav Mishra* and Vijay K Chaurasiya

December 19, 2024

DOI : 10.56831/PSEN-06-174

Abstract

The menace of breast cancer poses a formidable challenge to global public health, particularly affecting women across diverse regions. Timely identification and precise prognosis are imperative for efficacious treatment and enhanced patient outcomes. Conventional diagnostic methods, such as mammography and biopsy, though widely employed, can be invasive and occasionally yield imprecise results. Within this context, machine learning (ML) algorithms have emerged as a promising avenue for breast cancer prediction. These algorithms demonstrate proficiency in scrutinizing extensive datasets, discerning intricate patterns, and subsequently formulating predictions based on the analyzed information. The research presented in this paper is dedicated to the formulation of a sophisticated predictive model for breast cancer utilizing ML algorithms. The dataset utilized encompasses comprehensive clinical and imaging data from patients diagnosed with breast cancer. Subsequent to the extraction of pertinent features from the dataset, rigorous preprocessing procedures will precede the training and testing phases of the ML models. The primary objective of this study is to identify the most accurate algorithm for predicting breast cancer. A comprehensive evaluation of various ML algorithms, including logistic regression, decision trees, random forests, and neural networks, will be undertaken to assess their efficacy in breast cancer prediction. Logistic regression, a statistical method adept at analyzing datasets with one or more independent variables and a binary outcome variable, will be employed in discerning crucial factors such as age, family history, and prior cancer diagnoses in predicting breast cancer. Decision trees, an alternative ML algorithm for classification tasks, leverage a hierarchical structure to classify data based on a sequence of decisions derived from input features. Random forests, an extension of decision trees, employ multiple trees to enhance model accuracy, each trained on a random subset of the dataset. Neural networks, inspired by the intricate architecture of the human brain, comprise interconnected layers of nodes processing input data to generate predictions. The learning mechanism involves adjusting the weights of inter-node connections based on training data. The evaluation of ML algorithm performance will be based on standard metrics including accuracy, precision, recall, and F1-score. These metrics serve as robust indicators of the model’s effectiveness in accurately predicting breast cancer. The identification of pivotal features contributing to breast cancer prediction within this study is anticipated to yield insights into the potential applications of ML algorithms in this domain, contributing significantly to the development of precise prediction models for breast cancer. In summary, this research endeavor, focusing on the prediction of breast cancer using ML algorithms, holds promise for enhancing both diagnosis and treatment of this debilitating condition. The creation of precise prediction models employing clinical and imaging data can empower healthcare providers to identify individuals at elevated risk promptly and initiate appropriate interventions. The outcomes of this study may play a pivotal role in advancing more effective breast cancer screening programs and ultimately improving patient outcomes.

Keywords: Breast cancer; Machine learning; Predictive model; Clinical data; Diagnosis

References

  1. M Brown., et al. “Screening mammogra phy in community practice”. Amer. J. Roentgen 165 (1995).
  2. M Alhadidi, M Al-Gawagzeh and B Alsaaidah. “Solving A Mam mography Problems of Breast Cancer Detection Using Artificial Neural Networks and Image Processing techniques”. Indian Journal of Science and Technology 5.4 (2012).
  3. ED Ubeyli. “Implementing automated diagnostic systems for breast cancer detection”. in Expert systems with applications, Elsevier 33 (2007).
  4. D Kulkarni, S Bhagyashree and G Udupi. “Texture analysis of mam mographic images”. International Journal of Computer Applications 5 (2010).
  5. T Acharya and A Ray. “Image processing: principles and applications”. Hoboken new jersey: Wiley-Interscience (2005).
  6. Y Tsehay., et al. “Biopsy-guided learning with deep convolutional neural networks for Prostate Cancer detection on multiparametric MRI”. 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), Melbourne, VIC (2017): 642-645.
  7. MR Al-Hadidi, A Alarabeyyat and M Alhanahnah. “Breast Cancer Detection Using K-Nearest Neighbor Machine Learning Algorithm”. 2016 9th International Conference on Developments in Systems Engineering (DeSE), Liverpool (2016): 35-39.
  8. HR Mhaske and DA Phalke. “Melanoma skin cancer detection and classification based on supervised and unsupervised learning”. 2013 International conference on Circuits, Controls and Communications (CCUBE), Bglore (2013): 1-5.
  9. Sri Hari Nallamala, Siva Kumar Pathuri and Suvarna Vani Koneru. “A Literature Survey on Data Mining Approach to Effectively Handle Cancer Treatment”. International Journal of Engineering Technology (IJET) (UAE) 7. 2.7 (2018): 729-732.
  10. Sri Hari Nallamala, Pragnyaban Mishra and Suvarna Vani Koneru. “Qualitative Metrics on Breast Cancer Diagnosis with Neuro Fuzzy Inference Systems”. International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) 8.2 (2019): 259 -264.