TY - JOUR
T1 - Novel “extended sequons” of human N-glycosylation sites improve the precision of qualitative predictions
T2 - an alignment-free study of pattern recognition using ProtDCal protein features
AU - Ruiz-Blanco, Yasser B.
AU - Marrero-Ponce, Yovani
AU - García-Hernández, Enrique
AU - Green, James
N1 - Publisher Copyright:
© 2016, Springer-Verlag Wien.
PY - 2017/2/1
Y1 - 2017/2/1
N2 - N-Glycosylation is a common post-translational modification that plays an important role in the proper folding and function of many proteins. This modification is largely dependent on the presence of a sequence motif called a “sequon” defined as Asn-Xxx-Ser/Thr. However, evidence has shown that the presence of such a “sequon” is insufficient to determine the occurrence of N-glycosylation with high precision. This study aims to elucidate patterns that can more accurately predict N-glycosylation sites in human proteins. The novel motifs are evaluated using benchmarking data from 188 organisms. Performance is largely sustained compared to the human data, which validates the robustness of the novel extracted “extended sequons”. We, therefore, introduce new knowledge about sequence-related factors that control N-glycosylation.
AB - N-Glycosylation is a common post-translational modification that plays an important role in the proper folding and function of many proteins. This modification is largely dependent on the presence of a sequence motif called a “sequon” defined as Asn-Xxx-Ser/Thr. However, evidence has shown that the presence of such a “sequon” is insufficient to determine the occurrence of N-glycosylation with high precision. This study aims to elucidate patterns that can more accurately predict N-glycosylation sites in human proteins. The novel motifs are evaluated using benchmarking data from 188 organisms. Performance is largely sustained compared to the human data, which validates the robustness of the novel extracted “extended sequons”. We, therefore, introduce new knowledge about sequence-related factors that control N-glycosylation.
KW - Glycosylation motif
KW - PTM
KW - Post-translational modification
KW - Protein descriptor
UR - http://www.scopus.com/inward/record.url?scp=84997693898&partnerID=8YFLogxK
U2 - 10.1007/s00726-016-2362-5
DO - 10.1007/s00726-016-2362-5
M3 - Artículo
C2 - 27896447
AN - SCOPUS:84997693898
SN - 0939-4451
VL - 49
SP - 317
EP - 325
JO - Amino Acids
JF - Amino Acids
IS - 2
ER -