Predicting water potability using machine learning classification methods

Authors

  • Abdelaziz SAHRAOUI University of ABBES Laghrour Khenchela, Algeria
  • Roufaida SOUAKRI University of Shahid Mustapha Benboulaid Batna 2, Algeria

Keywords:

Water Potability, Machine Learning, Classification,Logistic Regression, Support Vector Machines (SVM)

Abstract

Ensuring access to safe drinking water remains a critical global challenge, making the ability to assess water potability both efficiently and accurately increasingly important. This study explores the use of supervised machine learning techniques to predict the potability of water based on a set of physicochemical attributes, including pH, hardness, turbidity, dissolved solids, and various chemical concentrations. Two classification models are examined: logistic regression, which provides a linear and interpretable decision boundary, and Support Vector Machines (SVM), which offer greater flexibility by capturing complex, non-linear relationships within the data. A systematic evaluation framework is employed to compare both models using key performance indicators such as accuracy, recall, precision, and F1-score. These metrics allow for a comprehensive understanding of each model’s strengths, limitations, and capacity to generalize to unseen samples. The analysis aims not only to identify the more effective algorithm but also to highlight the potential of machine learning as a reliable tool for environmental monitoring and water quality assessment. The findings contribute to ongoing efforts to automate and enhance potability prediction, thereby supporting informed decision-making in public health and resource management.

References

Bishop, C. M. (2006). Pattern recognition and machine learning. Springer.

Cortes, C., &Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297. https://doi.org/10.1007/BF00994018

Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (3rd ed.). Wiley.

Kowsari, K., Brown, D. E., Heidarysafa, M., Meimandi, K. J., Gerber, M. S., & Barnes, L. E. (2019). Text classification algorithms: A survey. Information, 10(4), 150.https://doi.org/10.3390/info10040150

Sahu, A. K., & Kumar, D. (2020). Water quality prediction using machine learning techniques. Environmental Science and Pollution Research, 27(7), 7500–7511. https://doi.org/10.1007/s11356-019-06969-4

World Health Organization. (2017). Guidelines for drinking-water quality (4th ed.). WHO Press.

Chakraborty, S., & Sinha, S. (2021). Predicting water quality using machine learning: A case study. Journal of Environmental Management, 281, 111898. https://doi.org/10.1016/j.jenvman.2020.111898

Pham, T. T., Nguyen, H. T., & Le, T. H. (2020). Comparative analysis of machine learning algorithms for water quality prediction. Environmental Monitoring and Assessment, 192(10), 650. https://doi.org/10.1007/s10661-020-08440-3

Tiwari, S., & Mishra, S. (2019). Machine learning approaches for water quality prediction. Sustainable Water Resources Management, 5(4), 1689–1701. https://doi.org/10.1007/s40899-019-00367-2

Raj, M., & Prasad, K. (2022). Application of SVM and logistic regression in environmental data analysis. Environmental Data Science, 4(2), 45–56.

Li, X., Zhou, Q., & Wang, L. (2018). Assessment of drinking water quality using supervised machine learning algorithms. Journal of Cleaner Production, 198, 1232–1242. https://doi.org/10.1016/j.jclepro.2018.07.056

Published

2025-12-12

How to Cite

SAHRAOUI, A., & SOUAKRI, R. (2025). Predicting water potability using machine learning classification methods. International Journal of Economic Perspectives, 19(12), 19–28. Retrieved from http://www.ijeponline.org/index.php/journal/article/view/1236

Issue

Section

Peer Review Articles