Protein

Protein accession
A0A125V3Z1 [UniProt]
Representative
7gIqH
Source
UniProt (cluster: phalp2_3949)
Protein name
Transglycosylase
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MNSKKVLILSIFIILFGALLMESKVIHKFLYPKKYSEYVEKYSKEFNLDENIVYSVIKAESKFNSSAVSKKEAKGLMQILDITRDWGAEELNLKNVDIFDPETNIRLGCWYLSKLYKEFGKLDLVIAAYNGGSGNVKKWLENNEYSKDGENLHDIPFKQTSKYVEKVKNNYEHYNKIYGKKGKN
Physico‐chemical
properties
protein length:184 AA
molecular weight:21403,4 Da
isoelectric point:9,09
hydropathy:-0,52
Representative Protein Details
Accession
7gIqH
Protein name
7gIqH
Sequence length
150 AA
Molecular weight
17544,73890 Da
Isoelectric point
5,65988
Sequence
MKYRDYIDMYANEHKLDPYFVAAVIKTESNFKEDAASKKNAQGLMQITPETGEWVAEKMGMKDFNIDDLKDPETNIKMGCWYLNNLKEEFDGNMDLVLAAYNGGRGNVQKWLKDSEHSKDGESLHYIPFKETDKYVKKVKAIYNIYRFFI
Other Proteins in cluster: phalp2_3949
Total (incl. this protein): 9 Avg length: 175,3 Avg pI: 7,08

Protein ID Length (AA) pI
7gIqH 150 5,65988
2DcGA 159 4,97844
3KTrN 201 9,32859
3uEkt 195 5,72308
7iYtf 171 9,08516
7lCZq 181 9,30158
zil4 182 5,40433
zlI0 155 5,11741
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9988
3eSmS
4 38,5% 153 9.543E-31
2 phalp2_21911
4JxwN
4 34,2% 140 1.095E-25
3 phalp2_4421
2GvBa
2 34,4% 145 2.813E-25
4 phalp2_11133
5t07n
26 33,7% 145 4.312E-23
5 phalp2_37089
16u4b
9 36,0% 150 3.893E-22
6 phalp2_32676
3NC2g
2 39,6% 111 9.992E-22
7 phalp2_8756
8sxGC
25 32,8% 137 1.368E-21
8 phalp2_31027
1wb8Z
51 31,9% 141 1.233E-20
9 phalp2_19158
1QRTe
2 31,7% 145 5.927E-20
10 phalp2_3377
3nm4J
1 35,8% 145 8.112E-20

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7gIqH (150 AA)
Member sequence: A0A125V3Z1 (184 AA)
1 150 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Peptoclostridium phage phiCDIF1296T
[NCBI]
1677909 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
CP011968 [NCBI]
CDS location
range 1243006 -> 1243560
strand +
CDS
GTGAATAGTAAGAAAGTATTGATTCTATCTATTTTTATAATCTTATTTGGGGCACTATTAATGGAAAGCAAAGTAATACATAAATTTTTATATCCTAAAAAATATTCAGAGTATGTAGAAAAGTATTCGAAAGAATTTAATTTAGATGAAAATATAGTTTACAGTGTTATTAAAGCCGAAAGTAAGTTTAATAGTTCTGCTGTTTCAAAAAAGGAAGCAAAAGGATTAATGCAAATATTAGACATAACTAGAGATTGGGGAGCAGAGGAACTAAATTTAAAAAATGTGGATATTTTCGACCCAGAGACTAATATAAGACTTGGCTGTTGGTATTTAAGTAAGTTATACAAAGAATTTGGTAAATTAGATTTAGTGATAGCTGCATATAATGGTGGTTCAGGTAATGTGAAAAAATGGTTAGAAAATAATGAATATAGTAAAGATGGCGAAAATCTACATGATATACCTTTTAAGCAAACTTCAAAATATGTAGAAAAAGTAAAAAATAATTACGAACATTATAATAAGATATATGGCAAGAAAGGAAAAAACTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
A0A125V3Z1
Method AlphaFoldDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi0000da5101_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7gIqH) rather than this protein.
PDB ID
7gIqH
Method AlphaFoldv2
Resolution 97.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50