DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14746
Title: Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction
Authors: Rajya Lakshmi, L.
Keywords: Computer Science
k-Nearest-Neighbor (k-NN)
Random forests
Air Pollution Data Analysis
Issue Date: 2023
Publisher: IEEE
Abstract: In this paper, we review various k-Nearest-Neighbor (k-NN) based models and their accuracies to develop a better model to predict concentrations of air pollutants. The proposed model splits the range of target variable values into a number of buckets first. Then, a hybrid k-NN model, which is a combination of weighted attribute k-NN and distance-weighted k-NN, and where the weights are assigned by calculating Information Gain, is used for each attribute, to calculate the target variable value of each test case. The proposed model decreases the root mean square error (RMSE) of predicted NO, NO 2 and NO x values by 28.29%, 29.44%, and 16.51% respectively, compared to the state-of the-art. Similarly, the mean absolute error (MAE) values for NO, NO 2 , and NO x are decreased by 18.26%, 33.67%, and 14.54%, compared to the state-of the-art. This model gives good results when the size of each bucket is nearly equal.
URI: https://ieeexplore.ieee.org/document/10216028
http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14746
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.