  <?xml version="1.0"?>
<journal>
 <journal_metadata>
  <full_title>Journal of Intelligent Systems and Internet of Things</full_title>
  <abbrev_title>JISIoT</abbrev_title>
  <issn media_type="print">2690-6791</issn>
  <issn media_type="electronic">2769-786X</issn>
  <doi_data>
   <doi>10.54216/JISIoT</doi>
   <resource>https://www.americaspg.com/journals/show/2524</resource>
  </doi_data>
 </journal_metadata>
 <journal_issue>
  <publication_date media_type="print">
   <year>2019</year>
  </publication_date>
  <publication_date media_type="online">
   <year>2019</year>
  </publication_date>
 </journal_issue>
 <journal_article publication_type="full_text">
  <titles>
   <title>Outlier Management and its Impact on Diabetes Prediction: A Voting Ensemble Study</title>
  </titles>
  <contributors>
   <organization sequence="first" contributor_role="author">Department of CSE, PVP Siddhartha Institute of Technology, Vijayawada, A.P, India</organization>
   <person_name sequence="first" contributor_role="author">
    <given_name>Aditi</given_name>
    <surname>Aditi</surname>
   </person_name>
   <organization sequence="first" contributor_role="author">Department of IT, Dhanekula Institute of Engineering &amp;Technology, Vijayawada, A.P, India</organization>
   <person_name sequence="additional" contributor_role="author">
    <given_name>Kotte</given_name>
    <surname>Sandeep</surname>
   </person_name>
   <organization sequence="first" contributor_role="author">Department of CSE, Koneru Lakshmaiah Education Foundation, Vaddeswaram, AP, India</organization>
   <person_name sequence="additional" contributor_role="author">
    <given_name>N. Raghavendra</given_name>
    <surname>Sai</surname>
   </person_name>
   <organization sequence="first" contributor_role="author">Department of Computer Science and Engineering, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India; IEEE Senior Member, Symbiosis International (Deemed University), Pune, India</organization>
   <person_name sequence="additional" contributor_role="author">
    <given_name>Aditi</given_name>
    <surname>Sharma</surname>
   </person_name>
   <organization sequence="first" contributor_role="author">Middle East College, Knowledge Oasis Muscat, Oman</organization>
   <person_name sequence="additional" contributor_role="author">
    <given_name>Jitendra</given_name>
    <surname>Pandey</surname>
   </person_name>
   <organization sequence="first" contributor_role="author">Canadian Institute for Cybersecurity, University of New Brunswick, Canada</organization>
   <person_name sequence="additional" contributor_role="author">
    <given_name>Vikas</given_name>
    <surname>Chouhan</surname>
   </person_name>
  </contributors>
  <jats:abstract xml:lang="en">
   <jats:p>The chronic metabolic disorder known as diabetes mellitus, which is defined by hyperglycemia, poses a significant threat to the health of people all over the world. The categorization is broken down into two primary categories: Type 1 and Type 2, with each category having its own unique causes and approaches to treatment. It is very necessary for the effective management of illnesses to have both the prompt detection and the exact prediction of outcomes. The applications of machine learning and data mining are becoming increasingly important as tools in this setting. The current research study analyses the usage of machine learning models, specifically Voting Ensembles, for the goal of predicting diabetes. Specifically, the researchers were interested in how accurate these models were. Using GridSearchCV, the Voting Ensemble, which consists of LightGBM, XGBoost, and AdaBoost, is fine-tuned to manage outliers. This may be done with or without the Interquartile Range (IQR) pre-processing. The results of a comparative analysis of performance, which is carried out, illustrate the benefits that are linked with outlier management. According to the findings, the Voting Ensemble model, when paired with IQR pre-processing, possesses greater accuracy, precision, and AUC score, which makes it more acceptable for predicting diabetes. Despite this, the strategy that does not use the IQR continues to be a workable and reasonable alternative. The current study emphasizes both the significance of outlier management within the area of healthcare analytics and the effect of data preparation procedures on the accuracy of prediction models. Both of these topics are brought up because of the relevance of the current work.</jats:p>
  </jats:abstract>
  <publication_date media_type="print">
   <year>2024</year>
  </publication_date>
  <publication_date media_type="online">
   <year>2024</year>
  </publication_date>
  <pages>
   <first_page>08</first_page>
   <last_page>19</last_page>
  </pages>
  <doi_data>
   <doi>10.54216/JISIoT.120101</doi>
   <resource>https://www.americaspg.com/articleinfo/18/show/2524</resource>
  </doi_data>
 </journal_article>
</journal>
