Analysis and Prediction of Breast Cancer using AzureML Platform

Khaldoon Alshouiliy, Abhishek Shivanna, Sujan Ray, Ali Alghamdi, Dharma P. Agrawal

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Nowadays, healthcare sector starts relying on the datasets that are collected by clinics or some organizations to help doctors in predicting and analyzing the patient's status in early stage. There are many dangerous diseases around the world that people suffer from them, but one of the most dangerous diseases is cancer. Recent research shows that about 12% US women over the course of their life, develop invasive breast cancer. Thus, in this case, the breast cancer (BC) is categorized as a dangerous type among all cancer types. This study focuses on BC by using a well-known dataset titled Breast Cancer Wisconsin (Diagnostic) Data Set. It has 32 attributes and 569 instances. Some of those attributes have missing values and others are not necessary for our work. So, we removed the ID column and any instance that has a missing value. Our aims in this research is analyzing BC dataset and understand its features. Then, we upload it to Microsoft Azure machine learning (AzureML) platform for building our model. We use two classes Decision Jungle and two Classes Decision machine learning algorithms to predicate whether the patient diagnose is Benign or Malignant. We assess the performance of each algorithms in terms of different measures like Accuracy, Precision, Recall, F1 and AUC. The results of our study in this paper show that the accuracy of Decision Jungle is approximately 97%. On the other hand, the accuracy of Decision tree is approximately 95%.

Original languageEnglish
Title of host publication2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2019
EditorsSatyajit Chakrabarti, Himadri Nath Saha
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages212-218
Number of pages7
ISBN (Electronic)9781728125305
DOIs
StatePublished - Oct 2019
Externally publishedYes
Event10th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2019 - Vancouver, Canada
Duration: Oct 17 2019Oct 19 2019

Publication series

Name2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2019

Conference

Conference10th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2019
Country/TerritoryCanada
CityVancouver
Period10/17/1910/19/19

Keywords

  • Analysis
  • AzureML
  • Breast Cancer
  • Decision Tree
  • Jungle Tree
  • Machine Learning
  • Prediction UCI dataset

Fingerprint

Dive into the research topics of 'Analysis and Prediction of Breast Cancer using AzureML Platform'. Together they form a unique fingerprint.

Cite this