Uncategorized

When Protein Structure Embedding Meets Large Language Models



In this study, we propose a novel method for designing numerical embeddings in Euclidean space for proteins by leveraging 3D structure information, specifically employing the concept of contact maps. These embeddings are synergistically combined with features extracted from LLMs and traditional feature engineering techniques to enhance the performance of embeddings in supervised protein analysis. Experimental results on benchmark datasets, including PDB Bind and STCRDAB, demonstrate the superior performance of the proposed method for protein function prediction.

Source: Genes Category: Genetics & Stem Cells Authors: Tags: Article Source Type: research



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *