European Journal of Computer Science and Information Technology (EJCSIT)

EA Journals

Logistic regression

Integrated Machine Learning Model for Comprehensive Heart Disease Risk Assessment Based on Multi-Dimensional Health Factors (Published)

For a long time, Cardiovascular diseases (CVD) is still one of the leading causes of death globally. The rise of new technologies such as Machine Learning (ML) algorithms can help with the early detection and prevention of developing CVDs. This study mainly focuses on the utilization of different ML models to determine the risk of a person in developing CVDs by using their personal lifestyle factors. This study used, extracted, and processed the 438,693 records as data from the Behavioral Risk Factor Surveillance System (BRFSS) in 2021 from World Health Organization (WHO). The data was then partitioned into training and testing data with a ratio of 0.8:0.2 to have an unknown data to evaluate the model that will be trained on. One problem that this study faced is the Imbalance among the classes and this was solved by using sampling techniques in order to balance the data for the ML model to process and understand well. The performance of the ML models was evaluated using 10-Stratified Fold cross-validation testing and the best model is Logistic Regression (LR) with F1 score of 0.32564. Logistic Regression model was then subjected to hyperparameter tuning and got the best score of 0.3257 with C = 0.1. Feature Importance was also generated from the LR model and the features that have the most impact is Sex, Diabetes, and the General Health of an individual. After getting the final LR model, it was then evaluated in the testing data and got a F1 score of 0.33. The Confusion Matrix was also used to better visualize the performance. And, the LR model correctly classified 79.18 % of people with CVDs and 73.46 % of people that is healthy. The AUC-ROC Curve was also used as a performance metric and the LR model got an AUC score of 0.837. The Logistic Regression model can be used in the medical field and can be utilized more by adding medical attributes to the data. Overall, this study gave us an insight and significant knowledge that can help in predicting the risk of CVDs by only using the personal attributes of an individual.

Keywords: Logistic regression, cardiovascular diseases, hyper-parameter tuning, imbalance classification, machine learning algorithms

Predicting Student University Admission Using Logistic Regression (Published)

The primary purpose is to discuss the prediction of student admission to university based on numerous factors and using logistic regression. Many prospective students apply for Master’s programs. The admission decision depends on criteria within the particular college or degree program. The independent variables in this study will be measured statistically to predict graduate school admission. Exploration and data analysis, if successful, would allow predictive models to allow better prioritization of the applicants screening process to Master’s degree programme which in turn provides the admission to the right candidates.

Keywords: Logistic regression, college admission, data analytics, predictive analysis

AN INTEGRATED APPROACH TOWARDS A PENETRATION TESTING FOR CYBERSPACES (Published)

The attack on a computer system with the intention of finding security weaknesses are becoming increasingly frequent and evermore sophisticated, potentially gaining access to it, its functionality and data. Organizations wishing to ensure security of their systems may look towards adopting appropriate tests to protect themselves against potential security breaches. One such test is to hire the services of penetration testers (or “pen-tester”) to find vulnerabilities present in the case study for “Cairo Cleaning and Beautification Agency”, and provide recommendations as to how best to mitigate such risks. By using series of the standards built on the application of data mining methods specifically decision tress model, Logistic regression, association rules model, Bayesian network for making reference penetration testers. This paper discusses the definition and role of the modern pen-tester and summaries current standards and professional qualifications. The paper further identifies issues arising from pen-testers; their motivation is to improve security.

Keywords: Bayesian network, Logistic regression, Penetration testing, association rules model, cyber security, vulnerability assessments decision tress model

Scroll to Top

Don't miss any Call For Paper update from EA Journals

Fill up the form below and get notified everytime we call for new submissions for our journals.