Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction

dc.contributor.authorRajya Lakshmi, L.
dc.date.accessioned2024-05-07T09:58:46Z
dc.date.available2024-05-07T09:58:46Z
dc.date.issued2023
dc.description.abstractIn this paper, we review various k-Nearest-Neighbor (k-NN) based models and their accuracies to develop a better model to predict concentrations of air pollutants. The proposed model splits the range of target variable values into a number of buckets first. Then, a hybrid k-NN model, which is a combination of weighted attribute k-NN and distance-weighted k-NN, and where the weights are assigned by calculating Information Gain, is used for each attribute, to calculate the target variable value of each test case. The proposed model decreases the root mean square error (RMSE) of predicted NO, NO 2 and NO x values by 28.29%, 29.44%, and 16.51% respectively, compared to the state-of the-art. Similarly, the mean absolute error (MAE) values for NO, NO 2 , and NO x are decreased by 18.26%, 33.67%, and 14.54%, compared to the state-of the-art. This model gives good results when the size of each bucket is nearly equal.en_US
dc.identifier.urihttps://ieeexplore.ieee.org/document/10216028
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14746
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectComputer Scienceen_US
dc.subjectk-Nearest-Neighbor (k-NN)en_US
dc.subjectRandom forestsen_US
dc.subjectAir Pollution Data Analysisen_US
dc.titleImproved k-NN Regression Model Using Random Forests for Air Pollution Predictionen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: