Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/20516Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Basu, Sushmita | - |
| dc.date.accessioned | 2026-01-09T11:34:26Z | - |
| dc.date.available | 2026-01-09T11:34:26Z | - |
| dc.date.issued | 2025-09 | - |
| dc.identifier.uri | https://www.sciencedirect.com/science/article/pii/S002228362500302X | - |
| dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/20516 | - |
| dc.description.abstract | Interactions between proteins and lipids are crucial for numerous cellular processes. Some of the lipid interacting segments in protein sequences are intrinsically disordered regions (IDRs), which may gain secondary structures upon binding. We collected experimentally annotated lipid-interacting IDRs, named membrane molecular recognition features (MemMoRFs). We used this dataset to develop and test an accurate and relatively fast sequence-based MemMoRF predictor, pLMMoRF, thereby supporting tedious and costly experimental identification of MemMoRFs. Our predictor utilizes a protein language model (pLM) which we processed to generate inputs to a deep convolutional neural network. We considered various pLMs (ESM-2, ProstT5, ProtT5 and Ankh) and applied feature selection to reduce their outputs, creating a more compact neural network model. pLMMoRF leverages the Ankh-based model, selected for its higher accuracy compared to our other models. Tests on low similarity test datasets demonstrate that pLMMoRF is more accurate than the sole current predictor of MemMoRFs, CoMemMoRFPred. Moreover, pLMMoRF has a relatively small computational footprint because of the compact network size and use of dedicated GPU nodes. This allowed us to make MemMoRF predictions for the human proteome. We analyzed these predictions and made them publicly available, facilitating an improved understanding of functions of membrane-coupled proteins. Our work underscores the importance of selecting key embedding features to enhance predictive performance and reduce computational footprint of sequence-based predictors of protein functions. The web server for the pLMMoRF predictor and the predictions for human proteins | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | Elsevier | en_US |
| dc.subject | Biology | en_US |
| dc.subject | Membrane interacting molecular recognition feature | en_US |
| dc.subject | Intrinsically disordered protein regions | en_US |
| dc.subject | Machine learning (ML) | en_US |
| dc.subject | Protein language model | en_US |
| dc.title | pLMMoRF: A web server that accurately predicts membrane-interacting molecular recognition features by employing a protein language model | en_US |
| dc.type | Article | en_US |
| Appears in Collections: | Department of Biological Sciences | |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.