Protein

Protein accession
A0A125V4H3 [UniProt]
Representative
4ukUL
Source
UniProt (cluster: phalp2_30305)
Protein name
Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MKKKAALATLAMLPLGVVNAHADGDIGIVTINYLNVRNEPTAESSIAFVAKKDDKVLIKDSSNGWYKIKAESGQEGWASSKYIAKSNSDSLRTSTNKEKQVISNSLNMRNGAGTSYRVITVLKKGQKVEVISESNGWSKIKYDGRLGYVSSSYLGDVSNSTNKSKTKQVNTTSLNVRSGPNTSYGLLGKLPKGSKVEVISESNGWSKIKYNGKDAYVSSMYLSDVSQSNSDNSSQSNDKKNTDKVVNTASLNVRSGPGSTYSKLGKVYKGSKVTVLSESSGWAKINFNNKEAFVVGNYLSTSADTSNNNSNSNSDNSSNSNGNNSSSSGQVNGMSGISGAKIDYKSLSYTLESHISKQVEKAASGGNVIAPSNRKSTPSPEFSTFSAQRTSSFVNASSSDIEYYLNPKNFTNTTKGMMQFLKINSYRDGISESSLNSYLNGLSSSVFKNQGAAFINAAKKYNIDVVYLVSHAMWETAYGKSTLAQGQTLTSYKGQPLSKPVKVYNFFGIGAIDKSANVSGAEAAYSNGWTSVEATIDGSAKWISQNYVNSSKYNQNTIYKMKWNYDYTWHQYATDVNWANGISGIMENLIGLYGGGSSLVFEVPQYK
Physico‐chemical
properties
protein length:607 AA
molecular weight:65619,0 Da
isoelectric point:9,50
hydropathy:-0,55
Representative Protein Details
Accession
4ukUL
Protein name
4ukUL
Sequence length
416 AA
Molecular weight
44570,80470 Da
Isoelectric point
9,15350
Sequence
MSYRIFLDPGHGGSDRANRGPTGYVEADGVLDIARRLRSELQALGFEVSMSRDKDATVNLSQRGKMAGQFKADLFLSIHSNAGSAVATGTEVYYSVNLPQTKAIAAKMSKAVANILGIPDRGAKVRESQNYPGEDYYTVIDTAQDTGVPRVFLIEVAFHSNPKEEALLKQPAVREKIARALADVIADTFGVQGTTPQITPMADVRGVVKVNSSLNVRNGPGEQYKIIGKLQNGDVVTINGKSGNWYRIKYNNGVAYVSGQYLVVSGTSSTPAPAPTPSAQTGTVKVNTTLNVRSGAGTQYKVVGSLKNGTKVEVLDKSGSWYKIKYGSITGYVSGQYLVVDNSNSDVGDDDVLEQIVLYLGDIDALSAIVVAQKLHAPAMRKSDFDTSGIKAKKIIQIGGGEGDRFDTFKKAAQYL
Other Proteins in cluster: phalp2_30305
Total (incl. this protein): 3 Avg length: 456,0 Avg pI: 8,34

Protein ID Length (AA) pI
4ukUL 416 9,15350
4ugc8 345 6,35837
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22864
2G8aw
1 42,7% 274 1.553E-50
2 phalp2_24300
3nP7b
4 32,9% 386 2.620E-42
3 phalp2_39497
5jS1N
14 33,6% 351 2.936E-36
4 phalp2_19894
5T9s7
90 38,0% 260 5.959E-35
5 phalp2_9571
1kahy
4 28,4% 369 8.012E-28
6 phalp2_5970
6Cyvx
55 30,4% 282 1.538E-26
7 phalp2_22085
5PV79
1 29,7% 299 6.711E-26
8 phalp2_29154
7mKv1
17 28,1% 284 4.479E-21
9 phalp2_5199
40Vx7
1 27,9% 347 2.572E-19
10 phalp2_12323
7wjSB
3 29,4% 275 1.231E-11

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4ukUL (416 AA)
Member sequence: A0A125V4H3 (607 AA)
1 416 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520, PF08239

Taxonomy

  Name Taxonomy ID Lineage
Phage Peptoclostridium phage phiCDIF1296T
[NCBI]
1677909 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
CP011968 [NCBI]
CDS location
range 1407487 -> 1409310
strand +
CDS
GTGAAGAAAAAAGCAGCTTTAGCAACATTGGCTATGTTACCCTTAGGTGTGGTAAATGCACATGCTGATGGAGATATAGGTATAGTGACTATAAATTATTTAAATGTAAGAAATGAACCAACTGCCGAAAGTAGCATAGCCTTTGTTGCTAAAAAAGATGATAAAGTTTTGATTAAAGACTCTTCTAATGGATGGTATAAAATAAAAGCTGAATCTGGACAAGAAGGTTGGGCTTCATCAAAATATATAGCAAAATCAAATAGTGACTCTTTAAGAACGTCTACAAATAAAGAAAAACAAGTTATTTCAAATAGTTTAAATATGAGAAATGGAGCAGGTACAAGTTACAGAGTAATAACTGTACTAAAAAAAGGTCAAAAAGTAGAAGTAATATCAGAGAGTAATGGATGGTCTAAAATCAAATATGATGGAAGACTAGGATATGTATCTAGCTCTTATTTAGGAGACGTTTCAAACTCAACTAATAAATCAAAAACTAAACAAGTGAATACAACTTCACTAAATGTAAGGAGTGGACCAAATACAAGTTATGGTTTGTTAGGCAAGTTGCCAAAAGGAAGTAAAGTAGAAGTAATATCAGAAAGTAATGGATGGTCAAAAATAAAGTATAATGGAAAAGATGCCTATGTATCTAGTATGTACTTATCGGATGTAAGTCAAAGTAATTCAGACAATTCTAGTCAAAGCAATGATAAAAAGAATACTGATAAGGTTGTAAATACAGCTTCTTTAAATGTAAGAAGTGGACCAGGTTCTACATATAGTAAGTTGGGAAAAGTTTATAAAGGAAGTAAAGTAACTGTACTATCAGAAAGTAGTGGATGGGCTAAGATTAATTTTAACAATAAAGAAGCATTTGTAGTAGGCAATTATTTATCTACTTCAGCAGATACTTCAAATAATAACTCAAATAGTAACTCTGATAACAGTTCTAATAGTAATGGCAATAATTCTTCATCATCAGGTCAAGTAAATGGTATGTCAGGTATAAGTGGGGCTAAAATTGATTATAAATCTTTGAGTTATACTTTAGAATCTCATATAAGTAAACAAGTTGAGAAAGCAGCATCAGGAGGAAATGTGATAGCTCCAAGTAATAGGAAAAGCACTCCAAGTCCTGAATTTAGTACTTTTTCAGCTCAAAGAACAAGCTCATTTGTAAATGCAAGTTCAAGTGATATAGAATATTATTTAAATCCTAAGAATTTTACAAATACTACTAAAGGTATGATGCAGTTTTTAAAGATTAACAGTTATAGAGATGGTATTTCAGAATCAAGTCTTAACTCGTATCTTAATGGTTTATCTTCTAGTGTATTTAAAAACCAGGGAGCAGCCTTTATAAATGCTGCCAAGAAGTATAATATAGATGTTGTATATCTAGTATCTCATGCTATGTGGGAAACTGCTTATGGTAAGTCTACACTTGCCCAAGGTCAAACATTAACATCTTATAAGGGACAACCTCTTAGTAAGCCAGTTAAAGTATACAATTTTTTTGGTATAGGAGCAATAGATAAAAGTGCTAATGTTTCAGGTGCAGAAGCAGCGTATTCCAATGGCTGGACTAGTGTAGAAGCTACAATAGATGGCTCTGCAAAGTGGATATCACAAAATTACGTAAATAGTTCTAAGTACAATCAGAATACTATTTACAAGATGAAGTGGAATTATGATTATACATGGCATCAGTATGCAACTGATGTAAACTGGGCTAATGGAATTTCTGGAATTATGGAAAATTTAATTGGTCTTTATGGAGGAGGAAGTAGTTTGGTTTTTGAAGTTCCTCAATATAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
A0A125V4H3
Method AlphaFoldDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00016c6087_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4ukUL) rather than this protein.
PDB ID
4ukUL
Method AlphaFoldv2
Resolution 88.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50