
Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/14746
Title: | Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction |
Authors: | Rajya Lakshmi, L. |
Keywords: | Computer Science k-Nearest-Neighbor (k-NN) Random forests Air Pollution Data Analysis |
Issue Date: | 2023 |
Publisher: | IEEE |
Abstract: | In this paper, we review various k-Nearest-Neighbor (k-NN) based models and their accuracies to develop a better model to predict concentrations of air pollutants. The proposed model splits the range of target variable values into a number of buckets first. Then, a hybrid k-NN model, which is a combination of weighted attribute k-NN and distance-weighted k-NN, and where the weights are assigned by calculating Information Gain, is used for each attribute, to calculate the target variable value of each test case. The proposed model decreases the root mean square error (RMSE) of predicted NO, NO 2 and NO x values by 28.29%, 29.44%, and 16.51% respectively, compared to the state-of the-art. Similarly, the mean absolute error (MAE) values for NO, NO 2 , and NO x are decreased by 18.26%, 33.67%, and 14.54%, compared to the state-of the-art. This model gives good results when the size of each bucket is nearly equal. |
URI: | https://ieeexplore.ieee.org/document/10216028 http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14746 |
Appears in Collections: | Department of Computer Science and Information Systems |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.