Protein

Protein accession
A0A125V6U0 [UniProt]
Representative
1lyG
Source
UniProt (cluster: phalp2_1125)
Protein name
Cell surface protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRTIRRVLALGLTLAIFLINVPNVDALTSDTIKGNNIYETAGLIADKKSYDTAIMVNMDNSIADGLSASGLAGAVDAPILLAQKNKIPNETKQRLKNVKKIYIIGKELSISKSVETELKNTGAQVTRLGGDDRIKTSYSVAKEVNGIKKVDEVILTNAYKGEADTISAAPVSVRDIAPIVLTDGKSVPFSTSNVKTYAVGGSISMSTSLVNKTNAKRLGGSDRYDTNKKVIKEFYPDASEFYLSDGYDLVNALTGSTIAKENPIVLVSESSDKSILAGADKITRLGSISDSVYNKCVSAAQNNGDSSTKGESPMKNETSILGQPTASLEACLKWAKSKKANDLFIELIPILYDTAVQEGVNPVLAVAQSAKETGFCNFGGVLDASFKNPCGLKTSVGGSDTDKNAHSRFDTWEEGILAQIQHLCLYAGQDGYPLSNPVDPRHEKSLFGKAKTVESLSNNWAGGQYGQDLVRMMGEIEATK
Physico‐chemical
properties
protein length:480 AA
molecular weight:51315,6 Da
isoelectric point:6,86
hydropathy:-0,23
Representative Protein Details
Accession
1lyG
Protein name
1lyG
Sequence length
394 AA
Molecular weight
42703,80870 Da
Isoelectric point
8,55452
Sequence
MPDITQKLLTKGAAHGRTGEPLSAVGVVIHYVGNPGSSAIANRNYFENGSGGNYVSAHYVVGLNGEIIQCVPENERAQHAGKSYAPQYKETAKLNNARYLGIENCHPDSGGKFSDITRKSLVALSADICFRYNFELSAVFRHYDVTGKSCPMYYVNNSGEWTKLKNDIASGVIALMGKTKIVAKLGQSQASVAKSLEEWAKGKNATALFVSLAEKYVKYAPTCGGVNPVVAYCQAAKETAFGRFGGVLNESFKNPCGMKTAAGGGDFDKNAHQKFDSWDDGIKAQLDHLALYAGAEGYPRKDTTDPRHFPEIKGTAATVEALGGKWAGSLYYGQDVVKLSEGIRLAEVITVEDKLLELVKGSSINSPQYWINALKDFKYFDGFVGAMYEKYCGK
Other Proteins in cluster: phalp2_1125
Total (incl. this protein): 9 Avg length: 441,3 Avg pI: 8,16

Protein ID Length (AA) pI
1lyG 394 8,55452
14uhJ 530 9,15311
3d6zE 354 7,28655
3epi1 522 9,48145
3erkf 503 9,42897
4m1ua 529 9,20185
88Ju 350 8,42655
8mIZG 310 5,02027
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3828
6v3ib
489 29,1% 353 3.799E-42
2 phalp2_5406
2FYAK
2 27,4% 332 2.716E-40
3 phalp2_6744
29vRH
5 29,7% 366 2.301E-34
4 phalp2_10091
5D9X
97 24,7% 360 8.698E-26
5 phalp2_20231
8lqkG
12 26,0% 276 3.659E-19
6 phalp2_9221
6Q2zg
10 21,6% 300 6.474E-07

Domains

Domains [InterPro]
Representative sequence (used for alignment): 1lyG (394 AA)
Member sequence: A0A125V6U0 (480 AA)
1 394 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510, PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Peptoclostridium phage phiCDIF1296T
[NCBI]
1677909 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
CP011968 [NCBI]
CDS location
range 2393609 -> 2395051
strand -
CDS
ATGAGAACAATTAGAAGAGTTTTAGCATTAGGTTTAACATTAGCTATATTTCTAATAAATGTACCTAATGTAGATGCATTAACATCAGATACAATTAAAGGTAACAATATATATGAGACGGCTGGATTAATAGCAGATAAAAAGAGTTATGATACAGCAATTATGGTAAATATGGATAATTCTATAGCAGATGGTCTTTCAGCAAGTGGTCTTGCAGGTGCTGTTGATGCTCCAATTTTGCTTGCACAAAAAAATAAGATACCAAATGAAACAAAACAAAGATTAAAAAATGTTAAAAAGATATACATAATAGGTAAAGAATTATCAATAAGTAAGTCAGTAGAAACTGAACTAAAAAACACAGGGGCACAGGTAACTAGATTGGGTGGAGATGACAGAATAAAGACTAGTTATAGTGTTGCTAAAGAGGTAAATGGTATCAAAAAAGTTGATGAAGTGATATTAACTAATGCATATAAAGGAGAAGCAGATACAATAAGTGCTGCACCTGTATCAGTAAGAGATATAGCTCCTATAGTACTTACAGATGGAAAGAGTGTGCCTTTTTCAACAAGTAATGTAAAAACATATGCAGTAGGTGGGAGTATATCAATGAGCACTAGTTTAGTTAATAAAACTAATGCTAAAAGACTTGGTGGTTCAGATAGATATGATACAAATAAGAAAGTTATAAAAGAGTTTTATCCAGATGCATCAGAATTTTATTTGAGTGATGGATATGATTTAGTAAATGCACTTACAGGTTCTACAATTGCTAAGGAAAATCCAATTGTATTGGTGTCAGAAAGTAGTGATAAGTCTATATTAGCAGGAGCAGATAAGATTACTAGATTAGGTTCAATAAGTGACAGTGTGTATAATAAGTGTGTTTCTGCTGCACAAAATAATGGCGATTCGTCTACAAAAGGTGAGTCACCTATGAAAAATGAGACAAGCATATTAGGTCAGCCAACAGCTAGTTTAGAAGCGTGCTTAAAATGGGCAAAATCTAAAAAAGCTAATGATTTATTTATAGAGTTAATACCAATATTATATGATACTGCTGTTCAGGAAGGTGTTAATCCTGTTTTGGCAGTAGCTCAATCTGCAAAAGAAACTGGTTTTTGTAATTTTGGTGGAGTATTAGATGCATCATTTAAGAATCCTTGTGGACTTAAAACTTCTGTAGGTGGTTCTGATACTGATAAGAATGCCCATTCAAGATTTGATACTTGGGAAGAAGGAATATTAGCTCAAATTCAACATTTATGCTTATATGCAGGACAAGATGGTTACCCACTTTCAAATCCAGTAGACCCTAGACATGAAAAATCTTTATTTGGTAAGGCTAAAACAGTTGAAAGTTTATCTAATAATTGGGCTGGAGGTCAATATGGACAAGATTTAGTAAGAATGATGGGAGAAATTGAGGCAACAAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
A0A125V6U0
Method AlphaFoldDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00006dca48_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1lyG) rather than this protein.
PDB ID
1lyG
Method AlphaFoldv2
Resolution 88.88
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50