Classification of Environmental Attitudes with Artificial Intelligence Algorithms

Nigmet KOKLU a,* ORCID.org, Suleyman Alpaslan SULAK b ORCID.org

a Vocational School of Technical Sciences, Konya Technical University, Konya, Türkiye

b Ahmet Kelesoglu Educational Faculty, Necmettin Erbakan University, Konya, Türkiye

 ABSTRACT

The study aims to examine people's attitudes towards the environment. Environmental education provides the necessary awareness to effectively address environmental issues. It is stated that attitudes towards the environment are very important and negative attitudes can worsen environmental problems. For this purpose, a dataset was obtained by using a scale consisting of 37 variables to a participant group consisting of 384 people. With this dataset, attitudes towards the environment have been analyzed using various classification algorithms. Logistic Regression (LR), Support Vector Machine (SVM) and Decision Tree (DT) models were used in the research. The LR, SVM, and DT models achieved 94.53%, 92.96%, and 82.55% classification success, respectively It is seen that the classification achievements of the models are at an acceptable level compared to the literature. As a result, the research sheds light on people's attitudes towards the environment through classification processes. Despite the acceptable classification achievements, alternative artificial Intelligence approaches can also be used to improve performance.

  1. INTRODUCTION

Environment is the whole of natural resources and ecosystems that constitute the common existence of humans and are necessary for their survival. Today, environmental problems have reached dangerous dimensions and have become a global crisis [1]. The environment is the totality of values that make up the common existence of people, and environmental problems have reached serious dimensions today. Among the main reasons for this are people's unconscious and selfish attitudes, unconscious practices that have emerged with the development of technology and industry [2]. Rapidly advancing with the 20th century, science and technology have increased the living standards of individuals, while at the same time causing great changes and losses in the environment. Environmental pollution has become an even more serious problem with the increase of the world population in the 21th century [3]. Global cooperation, sustainable development and the adoption of life styles that respect nature are necessary for the solution of environmental problems [4].

Environmental degradation causes negative effects that threaten the quality of life of living beings. External interventions to the ecological system lead to the deterioration of the natural balance and the emergence of various environmental problems [5]. For this reason, it is of great importance to train teachers who have a high environmental sensitivity, sufficient ecological knowledge, and who can successfully conduct theoretical and applied environmental studies. Environmental education plays an important role in this regard and aims to raise conscious individuals in the solution of environmental problems [6].

The main purpose of environmental education is to make individuals aware of their environment and to provide them with knowledge, skills, values and experience to solve environmental problems [7-9]. This education aims to provide individuals with multidimensional life skills related to themselves and their natural environment and aims to raise environmentally conscious people. Especially raising young generations with environmental awareness is critical for a sustainable future [10]. Environmental education aims to provide individuals with multidimensional life skills related to themselves and their natural environment and aims to raise people who are sensitive to the environment. The international community has begun to recognize the need for individuals to receive an effective lifelong environmental education related to the environment and environmental problems [11]. In this context, it is aimed to raise conscious individuals who will contribute to the solution of environmental problems by disseminating environmental education programs in schools and society [12].

With the increasing importance of environmental education, the development of positive attitudes and behaviors towards the environment has also come to the fore. Attitudes are the indirect observation of the integrity of an individual's feelings, thoughts and behavior towards an object [13]. Attitudes towards the environment play an important role in determining the environmental awareness and responsible behavior of individuals towards the environment. Individuals who have a negative attitude towards the environment remain insensitive to environmental problems and cause these problems to continue [14, 15].

Environmental education is necessary in order to ensure the formation of positive attitudes and values towards the environment [11, 16]. Environmental education programs should focus on developing positive attitudes towards the environment in individuals. Studies show that individuals who have received environmental education are more sensitive and solution-oriented towards environmental problems [17].  One of the most important factors that determine people's behavior is their attitude and behavior. The solution of environmental problems is possible not by technology or laws, but by changing individual behaviors. This change can be achieved through environmental education [18, 19].

Environmental education plays a critical role in developing positive attitudes and behaviors towards the environment of individuals. Individuals raised with environmental education will make important contributions to society as individuals who are sensitive to environmental problems, take an active role in solving these problems and have an awareness of protecting the environment [20]. Therefore, environmental education should be disseminated and supported at all levels and in all age groups. Research shows that environmental education programs increase individuals' awareness of the environment and help them adopt sustainable lifestyles [21, 22].

Environmental education provided from an early age creates permanent behavioral changes in individuals and enables raising an environmentally sensitive generation [23]. In addition, environmental education programs for all segments of society promote environmental awareness and active participation of large masses in environmental protection efforts [24]. In this context, the development and implementation of environmental education policies is vital for a sustainable future [25].

  1. Material and Methods

In this study, numerical data related to environmental attitude behavior was analyzed. Generally, the focus is on applying and analyzing various classification algorithms on the data. The flow model demonstrating the operation of the study is given in Figure 1. The dataset used, the classification algorithms employed in the study, the artificial intelligence (AI) of the algorithms, and other relevant details are listed in this section under headings

metin, diyagram, yazı tipi, tasarım içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 1. Schematic representation of the working process

  1. Environmental Attitude Dataset

With the literature review and knowledge accumulation conducted in the relevant field, the parametric factors necessary to create the dataset were determined. Later, a questionnaire template was created with variables that will determine the scales of environmentally responsible behavior. The environmental attitude dataset was obtained on the Internet by means of a questionnaire applied by the authors to a total of 384 people living in Turkey. The created dataset has 37 variables that can determine the state of environmentally responsible behavior. The environmental attitude dataset includes a group of 177 men and 207 women, whose ages vary between 18 and 45, as shown in Figure 2. The environmental attitude dataset included 177 men and 207 women aged between 18 and 45 years old. The fact that the study consisted of 384 people and 37 variables can be seen as a limitation.

metin, ekran görüntüsü, tasarım içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 2. Information belonging to the working group that created the dataset

  1. Performance Measures

The confusion matrix is used to calculate various performance metrics of a classification model, such as accuracy, accuracy, recall, and F1 score, as shown in Table 1. It is a useful tool for analyzing the strengths and weaknesses of a artificial Intelligence model and can help identify areas for improvement [26-28].

Table 1. Evaluation criteria affecting the two-class Confusion matrix

Evaluation Criteria

Definition

True Positive (TP)

TP refers to the cases where the model correctly predicts the class and the actual class is also positive.

False Positive (FP)

FP refers to the cases where the model predicts the class as positive, but the actual class is negative.

True Negative (TN)

TN refers to the cases where the model correctly predicts the class as negative and the actual class is also negative.

False Negative (FN)

FN refers to the cases where the model predicts the class as negative, but the actual class is positive.

In order to evaluate the trained model, Precision (P), Recall (R), F-measure (F) and Accuracy (AC) metrics are used. The calculation of the metrics is performed using the four different measurements given in Figure 3 [29, 30].

metin, ekran görüntüsü, diyagram, sayı, numara içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 3. Complexity matrix used in the classification

The preferred performance criteria in artificial intelligence algorithms are the metrics used to evaluate the performance of their models. These criteria are used to analyze how well a artificial intelligence algorithms performs a task, evaluate it, and determine the areas that need to be improved [31, 32]. There are several commonly used performance criteria:

Accuracy (AC): It measures the ratio of correctly classified examples in a dataset. It is a commonly used performance metric for classification problems [27].

(1)

Precision (P): It measures the ratio of true positives among the examples predicted as positive. It is a commonly used performance metric in cases where false positives are costly [27, 33].

(2)

Recall (R): t measures the rate at which true positives are correctly identified. It is a performance metric commonly used in situations where false negatives are costly [28].

(3)

F1-Score (F): It combines sensitivity and recall rate into a single metric. It is often used in situations where both sensitivity and recall are important [28, 31].

(4)

  1. Cross Validation

Cross-validation is a statistical technique used to evaluate the performance of artificial intelligence algorithms. It allows splitting the available data into two parts: a training set and a validation set. The model is then trained on the training set and tested on the validation set. The performance of the model is evaluated based on how well it performs on the validation set. The most commonly used form of cross-validation is k-fold cross-validation. The available data is divided into k equal-sized parts, as seen in Figure 4 [34, 35]. The classification model is trained on k-1 parts and tested on the remaining part. This process is repeated k times, and each part serves as the validation set once. The results obtained after each iteration are summed, and the average is taken to provide a general estimate of the model's performance. Cross-validation provides a more accurate estimate of how well the model can be applied to new data [36-40].

metin, ekran görüntüsü, yazı tipi içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 4. The complexity matrix used in the classification of the dataset

  1. Development of Artificial Intelligence Modelling
  1. Logistic Regression (LR)

LR model tree is a classification model formed by combining DT and LR learning, with an associated supervised learning algorithm. It predicts a numerical value for a sample defined over a fixed set of numerical or nominal attributes. It resembles ordinary regression trees but differs in that it constructs a piecewise linear approximation to the target function [41-43]. After all nominal attributes are replaced with binary ones, an unpruned regression tree is grown using variance reduction as the splitting criterion. Once the linear models are fitted, sub-trees are accepted for replacement based on the final error prediction for each linear model. A schematic diagram of the LR model is provided in Figure 5 [42, 44, 45].

metin, ekran görüntüsü, diyagram, tasarım içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 5. LR AI Model

  1. Support Vector Machine (SVM)

SVM is a supervised learning algorithm commonly used to recognize patterns in complex datasets and resolve complexities [27]. It performs discriminative classification by learning through examples, as seen in Figure 6, to predict classifications of previously unseen data [46]. It is particularly employed in various fields such as text classification, image recognition, and handwritten digit recognition. The algorithm is applied with three basic input parameters: a training dataset consisting of a series of class labels for training data, and a test dataset. There are several factors contributing to the popularity of the SVM algorithm. It scales well to relatively large datasets and utilizes a wide range of function classes through what are called kernel functions [27, 47-49].

metin, ekran görüntüsü, diyagram, yazı tipi içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 6. SVM AI Model.

  1. Decision Tree (DT)

DT induction algorithm has become increasingly popular in recent years. Reviews of the algorithm were conducted by Murthy in 1998 prior to its modeling. Due to its ease of implementation and relatively easy understanding compared to other classification algorithms, it is one of the commonly used algorithms for classification. Since it is constructed with given data, the value and characteristics of the data play a significant role. Before developing the DT, a target variable is determined for the suitability to be achieved by the user [50-53].

The amount of data to be used for classification, as seen in Figure 7, will also affect its outcome by altering the structure of tree construction. Typically, training data are present in a large portion of the dataset and are used to build the tree structure. The more training data collected, the higher the accuracy of the results. Depending on the test result, the data is divided into two or more subsets [54-56].

diyagram içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 7. DT AI Model.

  1. Results and Discussion

In this section, the classification successes and performance evaluations of LR, SVM, DT models, which are shown in Figure 1, obtained as a result of training with artificial intelligence algorithms are presented. The classification results of these models are compared. The accuracy rates of 3 different models obtained depending on the output classes of the models are given in the graph in Figure 8.

In the tables given to express the result evaluation metrics of the classification processes, the value “1” is the output classes named as positive and the value “0” is the output classes named as negative. Under the headings, confusion matrix structures are given and the relevant data are interpreted before and after each table.

  1. Classification Results Made with LR Model

When Table 3 is examined, it is seen that 158 true positive data belonging to the class labeled with the value “1” and 205 true negative data belonging to the class labeled with the value “0” are correctly classified. When TP, TN, FP and FN data are analyzed, it is seen that the classification prediction success of the model is high. According to the table, 11 data belonging to the “1” class and 10 data belonging to the “0” class were misclassified by the artificial intelligence algorithm.  This is most likely due to the number of features in the input parameters. The number of incorrectly predicted data is quite low compared to the number of correctly predicted data. It can also be said that the classes are confused with each other due to the fact that the data of the classes are very close to each other.

Table 3. The confusion matrix table of the classification made by the LR model

 

Actual Class

1

0

Predicted Class

1

158

11

0

10

205

The accuracy rates of the LR model, whose classification values are given in Table 3, were calculated according to the confusion matrix. When Figure 8 is examined, it is seen that it has a classification success rate of 94.53% accuracy. When the comparison is made by considering the literature, it can be said that the classification success of the model is high.

  1. Classification Results Made with SVM Model

When Table 4 is examined, it is seen that 152 real positive data belonging to the class named with the value “1” and 205 real negative data belonging to the class named with the value “0” are classified correctly. When the TP, TN, FP and FN data are examined, it is seen that the classification prediction success of the model is high. According to the table, 17 data belonging to the “1” class and 10 data belonging to the “0” class were misclassified by the artificial intelligence algorithm. This is most likely due to the number of features in the input parameters. The number of incorrectly predicted data is low compared to the number of correctly predicted data. It can also be said that the classes are confused with each other due to the fact that the data of the classes are very close to each other.

Table 4. The confusion matrix table of the classification made by the SVM model

 

Actual Class

1

0

Predicted Class

1

152

17

0

10

205

The accuracy rates of the SVM model, whose classification values are given in Table 4, were calculated according to the confusion matrix. When Figure 8 is examined, it is seen that it has a classification success rate of 92.96% accuracy. When the comparison is made by considering the literature, it can be said that the classification success of the model is also high.

  1. Classification Results Made with Decision Tree Model

When Table 5 is examined, it is seen that 131 true positive data belonging to the class labeled with the value “1” and 186 true negative data belonging to the class labeled with the value “0” are correctly classified. When TP, TN, FP and FN data are analyzed, it is seen that the classification prediction success of the model is not very high. According to the table, 38 data belonging to the “1” class and 29 data belonging to the “0” class were misclassified by the artificial intelligence algorithm.  This is due to the number of features in the input parameters. Although the number of incorrectly predicted data is low compared to the number of correctly predicted data, it is desirable to have a lower number of incorrect predictions. In addition, it can also be said that the classes are confused with each other due to the close proximity of the data of the classes and the structure of the model.

Table 5. The confusion matrix table of the classification made by the DT model

 

Actual Class

1

0

Predicted Class

1

131

38

0

29

186

The accuracy rates of the Bagging model, whose classification values are given in Table 5, were calculated according to the confusion matrix. When Figure 8 is examined, it is seen that it has a classification success rate of 82.55% accuracy. When a comparison is made taking into account the literature, it can be said that the classification success of the model is at an acceptable level, but not at a very high level.

  1. Classification Results and Evaluation Graphs of Models

The accuracy, precision, recall and F1 Score values of each model were obtained by using the confusion matrix data of all models. The classification result values are shown graphically in Table 6 and the data belonging to the models are shown graphically in Figure 8.

Table 6. Performance rates tables of the study

Algorithm

Accuracy (%)

Precision (%)

Recall (%)

F1-Score (%)

LR

94.53

94.50

94.50

94.50

SVM

92.96

93.00

93.00

93.00

DT

82.55

82.50

82.60

82.50

metin, ekran görüntüsü, silindir içeren bir resim

Açıklama otomatik olarak oluşturuldu

Figure 8. Performance rates and graphs of the study.

When the results in Figure 8 are analyzed, it is seen that the LR model has the highest classification success. The model with the lowest classification success is DT. Again according to Table 6, in parallel with the classification success values, the model with the highest values in metrics such as precision, recall and f1-score is the LR model. Again, the model with the lowest values in these metrics is the DT model.

  1. Conclusion

In this study, classification processes were carried out with different artificial intelligence algorithms in order to determine the sensory tendencies of people towards the environment. There are 37 features and two classes in the dataset used. Using the data obtained by us from 384 people, classification processes were performed with LR, SVM and DT algorithms. Cross validation algorithms was used for training and testing. According to the accuracy values in Figure 8, the LR model achieved 94.53% classification success, SVM model achieved 92.96% classification success and DT model achieved 82.55% classification success. It is seen that the classification success of the models is acceptable when compared with the literature.

In order to achieve higher classification success, the number of data in the dataset can be increased. In addition, equal distribution of data between classes will also increase the classification success. In addition, the use of different artificial intelligence algorithms will provide changes in classification success.

This study contributes significantly to the existing body of knowledge in several aspects. Methodologically, it demonstrates the effective utilization of artificial intelligence algorithms in assessing environmental sensitivities. The performance of different algorithms has been compared to determine their suitability. The obtained high classification success rates strongly advocate for the effective application of such methods in domains such as environmental sensitivity analysis.

These findings hold potential implications for the development of environmental policies and educational programs. Specifically, they can inform the design of strategies aimed at enhancing environmental awareness and sustainability practices.

Moreover, the results underscore that increasing the size of datasets and ensuring balanced class distributions can notably improve classification success rates. This suggests avenues for future research to explore how the choice of artificial intelligence algorithms influences classification outcomes.

In conclusion, this research highlights the promise of artificial intelligence techniques in the analysis of environmental sensitivities and underscores their relevance across various research domains.

Data Availability

The data used to support the findings of this study are available on the
https://nigmetkoklu.com/dataset/environmental_attitude_dataset.zip

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Ethics statement

This is to confirm that the relevant informed consent was obtained from the respondents. During the distribution of questionnaire, respondents were explicitly informed of the voluntary nature of their participation. They were also made aware that the data provided would be exclusively used for academic purposes, specifically for the dissemination of findings through publication in scholarly journals. Furthermore, prior to initiating the data collection process, ethical approval from the Necmettin Erbakan University Social and Behavioral Sciences Institutional Review Board Ethics Committee was granted. The ethical approval bears the protocol number 2023-02.

References

  1. Singh, R.L., Singh, P.K. (2017). Global Environmental Problems. In: Singh, R. (eds) Principles and Applications of Environmental Biotechnology for a Sustainable Future. Applied Environmental Science and Engineering for a Sustainable Future. Springer, Singapore. https://doi.org/10.1007/978-981-10-1866-4_2
  2. Kılıç, Ç. (2018). 2-MEV (iki faktörlü çevresel değerler modeli) tutum ölçeğinin Türkçeye adaptasyonu. Yüksek Lisans Tezi, Gazi Üniversitesi Eğitim Bilimleri Enstitüsü, Ankara
  3. Bradshaw, C. J., & Brook, B. W. (2014). Human population reduction is not a quick fix for environmental problems. Proceedings of the National Academy of Sciences111(46), 16610-16615. https://doi.org/10.1073/pnas.1410465111
  4. Kaypak, Ş. (2011). Küreselleşme sürecinde sürdürülebilir bir kalkınma için sürdürülebilir bir çevre. Karamanoğlu Mehmetbey Üniversitesi Sosyal ve Ekonomik Araştırmalar Dergisi, 2011(1), 19-33.
  5. Schlüter, M., Haider, L. J., Lade, S. J., Lindkvist, E., Martin, R., Orach, K., ... & Folke, C. (2019). Capturing emergent phenomena in social-ecological systems. Ecology and Society24(3). https://www.jstor.org/stable/26796977
  6. Doğan, F., & Keleş, Y. (2020). Environmental awareness and environmental behavior in secondary and high school students. NEÜ Ereğli Eğitim Fakültesi Dergisi, 2(1), 80-90.
  7. Vaughan, C., Gack, J., Solorazano, H. & Ray, R. (2003). The effect on environmental education on schoolchildren, their parents, and community members: a study of intergenerational and intercommunity learning. The Journal of Environmental Education. 34(3), 12-21 https://doi.org/10.1080/00958960309603489
  8. Esteban Ibáñez M, Musitu Ferrer D, Amador Muñoz LV, Claros FM, Olmedo Ruiz FJ. University as Change Manager of Attitudes towards Environment (The Importance of Environmental Education). Sustainability. 2020; 12(11):4568. https://doi.org/10.3390/su12114568
  9. Öztürk, S., Tekin, N. & Yılmaz, S. (2023). The Opinions of Eighth Grade Students on Renewable Energy Sources in their Environment: A Local Perspective. Ahmet Keleşoğlu Eğitim Fakültesi Dergisi (AKEF) Dergisi, 5(2), 474-487. https://doi.org/10.38151/akef.65
  10. Nurwidodo, N., Amin, M., Ibrohim, I., & Sueb, S. (2020). The role of eco-school program (adiwiyata) towards environmental literacy of high school students. European Journal of Educational Research, 9(3), 1089-1103. https://doi.org/10.12973/eu-jer.9.3.1089
  11. Uralovich, K. S., Toshmamatovich, T. U., Kubayevich, K. F., Sapaev, I., Saylaubaevna, S. S., Beknazarova, Z., & Khurramov, A. (2023). A primary factor in sustainable development and environmental sustainability is environmental education. Caspian Journal of Environmental Sciences21(4), 965-975. doi: 10.22124/cjes.2023.7155
  12. Stevenson, R. B. (2007). Schooling and environmental education: contradictions in purpose and practice. Environmental Education Research, 13(2), 139–153. https://doi.org/10.1080/13504620701295726
  13. Greenwald, A. G., & Banaji, M. R. (1995). Implicit social cognition: Attitudes, self-esteem, and stereotypes. Psychological Review, 102(1), 4–27. https://doi.org/10.1037/0033-295X.102.1.4
  14. Oskamp, S., & Schultz, P.W. (2005). Attitudes and Opinions (3rd ed.). Psychology Press. https://doi.org/10.4324/9781410611963
  15. Uzun, N. ve Sağlam, N. (2006). “Ortaöğretim Öğrencileri İçin Çevresel Tutum Ölçeği Geliştirme ve Geçerliliği.” Hacettepe Üniversitesi Eğitim Fakültesi Dergisi, 30, 240-250.
  16. Akçay, S. & Şengül, H. (2023). A Study on Environmental Literacy of Middle School Students. Ahmet Keleşoğlu Eğitim Fakültesi Dergisi (AKEF) Dergisi, 5(1), 139-169. https://doi.org/10.38151/akef.2023.48
  17. Rickinson, M. (2001). Learners and Learning in Environmental Education: A critical review of the evidence. Environmental Education Research, 7(3), 207–320. https://doi.org/10.1080/13504620120065230
  18. Erten, S. (2005). Okul öncesi öğretmen adaylarında çevre dostu davranışların araştırılması. Hacettepe Üniversitesi Eğitim Fakültesi Dergisi, 28, 91-100
  19. Ajzen, I., Fishbein, M., Lohmann, S., & Albarracín, D. (2018). The influence of attitudes on behavior. The handbook of attitudes, volume 1: Basic principles, 197-255. https://doi.org/10.4324/9781315178103
  20. Sukma, E., Ramadhan, S., & Indriyani, V. (2020, March). Integration of environmental education in elementary schools. In Journal of Physics: Conference Series (Vol. 1481, No. 1, p. 012136). IOP Publishing. https://doi.org/10.1088/1742-6596/1481/1/012136
  21. Bergman, B. G. (2015). Assessing impacts of locally designed environmental education projects on students’ environmental attitudes, awareness, and intention to act. Environmental Education Research, 22(4), 480–503. https://doi.org/10.1080/13504622.2014.999225
  22. Athman, J. A., & Monroe, M. C. (2001). Elements of Effective Environmental Education Programs.
  23. Aydoğan, Y., Okyay, Ö., & Aksut, P. How to Raise the Environmental Awareness in Children. Environment and Ecology at the Beginning of 21st Century, 700.
  24. Palmer, J. (2002). Environmental education in the 21st century: Theory, practice, progress and promise. Routledge.
  25. Yadav, S. K., Banerjee, A., Jhariya, M. K., Meena, R. S., Raj, A., Khan, N., ... & Sheoran, S. (2022). Environmental education for sustainable development. In Natural Resources Conservation and Advances for Sustainability (pp. 415-431). Elsevier. https://doi.org/10.1016/B978-0-12-822976-7.00010-7
  26. Taspinar, Y. S., Cinar, I., & Koklu, M. (2021). Prediction of computer type using benchmark scores of hardware units. Selcuk University Journal of Engineering Sciences, 20(1), 11-17.
  27. Vapnik, V. (1999). The nature of statistical learning theory. Springer science & business media. https://doi.org/10.1007/978-1-4757-3264-1
  28. Powers, D. M. (2020). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061. https://doi.org/10.48550/arXiv.2010.16061
  29. Sun, A., & Lim, E.-P. (2001). Hierarchical text classification and evaluation. Proceedings 2001 IEEE International Conference on Data Mining, https://doi.org/10.1109/ICDM.2001.989560
  30. Tang, T. A., Mhamdi, L., McLernon, D., Zaidi, S. A. R., & Ghogho, M. (2018). Deep recurrent neural network for intrusion detection in sdn-based networks. 2018 4th IEEE Conference on Network Softwarization and Workshops (NetSoft), https://doi.org/10.1109/NETSOFT.2018.8460090
  31. Sokolova, M., Japkowicz, N., Szpakowicz, S. (2006). Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation. In: Sattar, A., Kang, Bh. (eds) AI 2006: Advances in Artificial Intelligence. AI 2006. Lecture Notes in Computer Science(), vol 4304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11941439_114
  32. Ferri, C., Hernández-Orallo, J., & Modroiu, R. (2009). An experimental comparison of performance measures for classification. Pattern recognition letters, 30(1), 27-38. https://doi.org/10.1016/j.patrec.2008.08.010
  33. Dwivedi, A. K. (2018). Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Computing and Applications, 29, 685-693. https://doi.org/10.1007/s00521-016-2604-1
  34. Browne, M. W. (2000). Cross-validation methods. Journal of mathematical psychology, 44(1), 108-132. https://doi.org/10.1006/jmps.1999.1279
  35. Anguita, D., Ghelardoni, L., Ghio, A., Oneto, L., & Ridella, S. (2012). The'K'in K-fold Cross Validation. ESANN,
  36. Berrar, D. (2019). Cross-Validation. . https://doi.org/10.1016/B978-0-12-809633-8.20349-X
  37. Kohavi, R., & John, G. H. (1997). Wrappers for feature subset selection. Artificial intelligence, 97(1-2), 273-324. https://doi.org/10.1016/S0004-3702(97)00043-X
  38. Shao, J. (1993). Linear Model Selection by Cross-validation. Journal of the American Statistical Association, 88(422), 486–494. https://doi.org/10.1080/01621459.1993.10476299
  39. Raschka, S. (2018). Model evaluation, model selection, and algorithm selection in machine learning. arXiv preprint arXiv:1811.12808. https://doi.org/10.48550/arXiv.1811.12808
  40. Ramezan C, A. Warner T, E. Maxwell A. Evaluation of Sampling and Cross-Validation Tuning Strategies for Regional-Scale Machine Learning Classification. Remote Sensing. 2019; 11(2):185. https://doi.org/10.3390/rs11020185
  41. Frank, E., Wang, Y., Inglis, S., Holmes, G., & Witten, I. H. (1998). Using model trees for classification. Machine learning, 32, 63-76. https://doi.org/10.1023/A:1007421302149
  42. Landwehr, N., Hall, M., & Frank, E. (2005). Logistic model trees. Machine learning, 59, 161-205. https://doi.org/10.1007/s10994-005-0466-3
  43. Sharma, P. (2014). Comparative analysis of various decision tree classification algorithms using WEKA. International Journal on Recent and Innovation Trends in Computing and Communication, 3(2), 684-690.
  44. Chan, K.-Y., & Loh, W.-Y. (2004). LOTUS: An algorithm for building accurate and comprehensible logistic regression trees. Journal of Computational and Graphical Statistics, 13(4), 826-852. https://doi.org/10.1198/106186004X13064
  45. Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). The annals of statistics, 28(2), 337-407. https://doi.org/10.1214/aos/1016218223
  46. Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other kernel-based learning methods. Cambridge university press.
  47. Cervantes, J., Garcia-Lamont, F., Rodríguez-Mazahua, L., & Lopez, A. (2020). A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing, 408, 189-215. https://doi.org/10.1016/j.neucom.2019.10.118
  48. Pavlidis, P., Wapinski, I., & Noble, W. S. (2004). Support vector machine classification on the web. Bioinformatics, 20(4), 586-587. https://doi.org/10.1093/bioinformatics/btg461
  49. Platt, J. (1998). Sequential minimal optimization: A fast algorithm for training support vector machines.
  50. Murthy, S. K. (1998). Automatic construction of decision trees from data: A multi-disciplinary survey. Data mining and knowledge discovery, 2, 345-389. https://doi.org/10.1023/A:1009744630224
  51. Pal, M., & Mather, P. M. (2003). An assessment of the effectiveness of decision tree methods for land cover classification. Remote sensing of environment, 86(4), 554-565. https://doi.org/10.1016/S0034-4257(03)00132-9
  52. Tu, M. C., Shin, D., & Shin, D. (2009). A comparative study of medical data classification methods based on decision tree and bagging algorithms. 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing, https://doi.org/10.1109/DASC.2009.40
  53. Bansal, M., Goyal, A., & Choudhary, A. (2022). A comparative analysis of K-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning. Decision Analytics Journal, 3, 100071. https://doi.org/10.1016/j.dajour.2022.100071
  54. Aitkenhead, M. J. (2008). A co-evolving decision tree classification method. Expert Systems with Applications, 34(1), 18-25. https://doi.org/10.1016/j.eswa.2006.08.00
  55. Kotsiantis, S. B. (2013). Decision trees: a recent overview. Artificial Intelligence Review, 39, 261-283. https://doi.org/10.1007/s10462-011-9272-4
  56. Yadav, S. K., & Pal, S. (2012). Data mining: A prediction for performance improvement of engineering students using classification. arXiv preprint arXiv:1203.3832.
    https://doi.org/10.48550/arXiv.1203.3832