BITS Faculty Publications

Permanent URI for this communityhttp://localhost:4000/handle/123456789/1867

Browse

Search Results

Now showing 1 - 10 of 24
  • Item
    Medical image segmentation using advanced UNETt: VMSE-Unet and VM-Unet CBAM+
    (2025-07) Chalapathi, G.S.S.
    In this paper, we present the VMSE U-Net and VM-Unet CBAM+ model, two cutting-edge deep learning architectures designed to enhance medical image segmentation. Our approach integrates Squeeze-and-Excitation (SE) and Convolutional Block Attention Module (CBAM) techniques into the traditional VM U-Net framework, significantly improving segmentation accuracy, feature localization, and computational efficiency. Both models show superior performance compared to the baseline VM-Unet across multiple datasets. Notably, VMSEUnet achieves the highest accuracy, IoU, precision, and recall while maintaining low loss values. It also exhibits exceptional computational efficiency with faster inference times and lower memory usage on both GPU and CPU. Overall, the study suggests that the enhanced architecture VMSE-Unet is a valuable tool for medical image analysis. These findings highlight its potential for real-world clinical applications, emphasizing the importance of further research to optimize accuracy, robustness, and computational efficiency.
  • Item
    Transformers for vision: a survey on innovative methods for computer vision
    (IEEE, 2025-05) Kumar, Dhruv; Chalapathi, G.S.S.
    Transformers have emerged as a groundbreaking architecture in the field of computer vision, offering a compelling alternative to traditional convolutional neural networks (CNNs) by enabling the modeling of long-range dependencies and global context through self-attention mechanisms. Originally developed for natural language processing, transformers have now been successfully adapted for a wide range of vision tasks, leading to significant improvements in performance and generalization. This survey provides a comprehensive overview of the fundamental principles of transformer architectures, highlighting the core mechanisms such as self-attention, multi-head attention, and positional encoding that distinguish them from CNNs. We delve into the theoretical adaptations required to apply transformers to visual data, including image tokenization and the integration of positional embeddings. A detailed analysis of key transformer-based vision architectures such as ViT, DeiT, Swin Transformer, PVT, Twins, and CrossViT are presented, alongside their practical applications in image classification, object detection, video understanding, medical imaging, and cross-modal tasks. The paper further compares the performance of vision transformers with CNNs, examining their respective strengths, limitations, and the emergence of hybrid models. Finally, current challenges in deploying ViTs, such as computational cost, data efficiency, and interpretability, and explore recent advancements and future research directions including efficient architectures, self-supervised learning, and multimodal integration are discussed.
  • Item
    Macro and micronutrient based soil fertility zonation using fuzzy logic and geospatial techniques
    (Springer Nature, 2025-07) Srinivas, Rallapalli; Chalapathi, G.S.S.; Singh, Amit Rajnarayan
    Modeling the spatial variability and uncertainty of soil fertility parameters is crucial for sustainable agriculture but remains a challenge due to complex interactions between soil properties. Traditional models often assess individual parameters, such as pH or nitrogen (N), without considering their combined influence and uncertainty. This study develops a fuzzy logic and geoinformatics-based approach to simultaneously assess multiple soil fertility parameters. The model integrates 80 fuzzy rules to evaluate macro- and micronutrients, incorporating 250 soil samples analyzed using the PUSA Soil Test and Fertilizer Recommendation Meter (STFR). Experimental results showed soil fertility parameter ranges: pH (7.46–8.26), ECe (0.267–0.807 dS m−1), organic carbon (0.24–0.56%), N (85.56–146.32 kg ha−1), P (21.99–34.28 kg ha−1), K (116.41–156.16 kg ha−1), S (5.60–20.86 mg kg−1), Fe (1.065–5.095 mg kg−1), Mn (2.058–2.637 mg kg−1), Zn (0.748–1.105 mg kg−1), B (0.372–0.530 mg kg−1), and Cu (0.230–0.788 mg kg−1). The fuzzy model-derived fertility scores ranged from 41.55 to 52.60, with pH, organic carbon, nitrogen, phosphorus, potassium, and iron as critical parameters influencing fertility. Geostatistical kriging interpolation estimated fertility values at unsampled locations, generating a continuous, high-resolution soil fertility map for precision agriculture. Validation with crop yield data ranked suitability as: Pearl millet (0.919) > Mustard (0.890) > Wheat (0.863) > Barley (0.861). Multi-criteria decision analysis confirmed pearl millet as the most suitable crop based on fertility and yield potential. The study categorizes soil into low and moderate fertility zones across Jhunjhunu, Rajasthan, ensuring a systematic assessment for optimal nutrient management. By integrating fuzzy logic with GIS-based spatial modeling, this study enhances soil fertility classification, site-specific nutrient recommendations, and sustainable crop planning, reinforcing the role of fuzzy-GIS frameworks in precision agriculture.
  • Item
    Enhancing data privacy: a comprehensive survey of privacy-enabling technologies
    (IEEE, 2025-02) Chalapathi, G.S.S.
    Privacy is a fundamental human right, especially crucial in our modern digital age. With the rapid advancement of technology, ensuring individuals’ privacy has become increasingly complex. Our survey paper aims to shed light on various privacy engineering technologies that play a crucial role in protecting personal data. We delve into four key areas: data anonymization, data encryption, synthetic data generation, and differential privacy. These technologies serve as essential tools in safeguarding online privacy. Data anonymization, for instance, includes removing or modifying identifiable information from datasets to protect individuals’ identities. Encryption secures data by converting it into a code that can only be decoded by authorized parties. Synthetic data generation creates artificial data that closely resembles real data but doesn’t contain any identifiable information. Differential privacy adds a small amount of controlled noise to protect sensitive information. Throughout our exploration, we not only explain the principles and techniques behind these technologies but also the tools used for each of these techniques and evaluation criteria and also examine their practical applications. By understanding their strengths, limitations, and real-world implementations, we gain valuable insights into how they contribute to the broader goal of ensuring privacy in our digital world.
  • Item
    From information overload to lucidity: a survey on leveraging gpts for systematic summarization of medical and biomedical artifacts
    (IEEE, 2024-12) Chalapathi, G.S.S.; Singh, Amit Rajnarayan
    In medical research, the rapid proliferation of condition-specific studies has led to an information overload, making it challenging for researchers and practitioners to stay abreast of the latest findings. This paper presents a comprehensive survey on leveraging Generative Pretrained Transformers (GPTs) to summarize medical and biomedical artifacts systematically. We delve into the current applications of GPTs in this domain, discussing their role in understanding and summarizing research papers, medical dialogues, and medical records. Through a comparative analysis of recent studies and methodologies, we highlight the effectiveness of GPTs in distilling complex medical information into concise, understandable summaries. Our survey underscores the potential of GPTs as a tool for navigating the information overload in medical research and bringing clarity to healthcare professionals. This transformation will enhance patient care and outcomes, such as improving the accessibility and comprehensibility of medical research, assisting in rapid information retrieval, and facilitating the summarization of complex medical studies for broader audiences.
  • Item
    Hardware-Based Implementation of Target Tracking in Unmanned Aerial Vehicles (UAVs)
    (IEEE, 2023) Chalapathi, G.S.S.
    Unmanned Aerial Vehicles (UAVs) have gained sig-nificant attention in various fields, including surveillance, search and rescue, and monitoring applications. One important application for UAV s is target tracking, which requires detecting and tracking a specific object of interest in real time. This paper surveys work done so far in the area of target-tracking in UAV s. It then presents a comprehensive hardware-based framework for target tracking in UAV s. This work utilizes the State-of-the-Art YOLOv8 (You Only Look Once) algorithm for target detection, an efficient high-speed target tracking model, and a Proportional Derivative (PD) control algorithm for precise drone movement control. YOLOv8 provides fast, accurate, and real-time detection of the object of interest, allowing the UAV to detect and identify the target object quickly and reliably. Subsequently, a robust tracking algorithm tracks the identified object across consecutive frames, ensuring accurate localization and trajectory estimation. Furthermore, a PD control algorithm is integrated into the system to enable precise and smooth drone movement. The proposed framework is integrated and used for target tracking in UAV s. Further, this framework is implemented on a UAV. The results demonstrate the effectiveness and robustness of the proposed framework, showcasing its potential for real-world applications.
  • Item
    Singaporean Conversational English-Malay Code-Switching Speech: An Analysis Based on Code-switching Points and Part -of-Speech
    (IEEE, 2023) Chalapathi, G.S.S.
    This paper investigates various code-switching prop-erties of conversational speech from bilingual English-Malay Singaporean speakers with data obtained from the National Speech Corpus (NSC) and provides baseline language models for various combinations between English-Malay monolingual and codeswitching transcripts. Specifically, the study analyzed the correlation between code-switching patterns and (i) trigger words and code-switched word pairs at code-switching points, and (ii) wordwise POS and pairwise POS tags. Our analysis shows there is a certain set of words that frequently “triggered” code-switching behavior, and speakers tend to code-switch more frequently around nouns. Additionally, we provide perplexities for language models built on the selected datasets. These perplexities could serve as baselines for future language models for Singaporean speech, especially, English-Malay code-switch speech.
  • Item
    Non-Fungible Tokens (NFTs)—Survey of Current Applications, Evolution, and Future Directions
    (IEEE, 2023-12) Chalapathi, G.S.S.
    Non-fungible tokens (NFTs) have become an exciting technology that provides a fresh perspective on asset ownership, provenance, and value exchange. NFTs, a blockchain-based technology, are distinct and indivisible cryptographic tokens used to confirm and record the ownership of digital and physical assets in an immutable and transparent way. The fundamental block of NFT is a smart contract built on a blockchain network. This contract contains specific information about the asset it represents, such as its unique identifier, metadata, and ownership details. The information is kept private and tamper-proof due to the decentralized and distributed structure of the blockchain, boosting faith in the token’s authenticity. The NFT is gaining popularity, but it is still in the developing stage. There is a need for a comprehensive survey to guide future research and development in NFTs. Thus, this paper presents the technical components of NFTs, their features, and the minting process. Further, this survey paper describes different token standards for NFTs. It presents various applications of NFTs in healthcare, supply chain, gaming, identity verification, agriculture, intellectual property, smart cities, charity and donation, and education. The article also emphasizes the significant difficulties faced currently in implementing NFT technology from the viewpoints of ownership, governance, and property rights, as well as security, privacy, and environmental effects. This work also elucidates the future directions to overcome the challenges in adopting NFTs in various applications.
  • Item
    LoRa-Based Wireless Sensor Network Testbed for Precision Agriculture Application
    (IEEE, 2024) Chalapathi, G.S.S.
    Over the past few years, Wireless Sensor Network (WSN) has seen many improvements, and there are various applications of WSNs in various domains. Most communication technologies have a trade-off between distance and power consumption, i.e., to reach a longer distance, high power is consumed. LoRa overcomes this problem by consuming less power and transmitting small data packets for a long distance. The objective of this research work is to use LoRa technology in Precision Agriculture applications to help the stakeholders in better decision-making. A small experimental testbed is set up for precision agriculture applications. This testbed had sensors to monitor pH, soil moisture, soil temperature, and NPK parameters. An Automatic Weather Station (AWS) is set up to monitor ambient weather parameters-temperature, humidity, rainfall, wind speed and direction, barometric pressure, solar radiation, and leaf wetness. These sensor parameters were collected at the LoRa Gateway and forwarded to a network server hosting the The Things Network (TTN) LoRa stack. Transmission statistics are collected and analyzed for this application for remote monitoring of agricultural farms for quick and efficient decision-making.
  • Item
    Multi-Objective MDP-Based Routing in UAV Networks for Search-Based Operations
    (IEEE, 2024-05) Chalapathi, G.S.S.; Chamola, Vinay
    Unmanned aerial vehicle (UAV) systems have gained widespread recognition due to their versatility and autonomy. Their deployment for disaster mitigation and management operations is seen as one of their most important applications over the past decade. In such UAV networks, routing plays a crucial role in determining network performance parameters such as network lifetime, data transmission latency, and packet delivery ratio. This paper presents a novel routing mechanism - Multi-Objective Markov Decision Based Routing (MOBMDP) for UAV networks carrying out search-based operations. MOBMDP models routing decisions in a Markov Decision Process (MDP) framework and uses Q-learning to take decisions. It compares routing paths using three metrics, viz., Remaining Energy of the Minimum Energy Node (REMEN), Power Distance ratio (PD), and Expected Delay (ED). Amongst these metrics, PD is a novel metric proposed by this work. PD simultaneously optimizes the energy efficiency and energy distribution in the network. Further, MOBMDP uses a novel reinforcement learning inspired method to estimate transmission delay in a given path. Intensive simulation studies compare MOBMDP to four state-of-the-art routing protocols. Results show a significant improvement in network lifetime, packet delivery ratio, energy efficiency, average data transmission delay, and error in delay estimation.