Protein

Protein accession
A0A4Y5FEU0 [UniProt]
Representative
4c4WR
Source
UniProt (cluster: phalp2_31782)
Protein name
Peptidoglycan-binding protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MKNTVKTILVSASVVTGMFILGTQANADTVTVASGDTVSKIAQEHGDTINHIVSVNQLANPDFIQVGEKLQVGTSLQKNSTSTASSGYSGVTNTETAASTSEVQNKPSTGYTSATTSGSVAEQAAQLMAQKTGVSASEWSHIIQRESNGNPSAQNSSSSAHGLFQRLGETSNDWRTQVNNAAELYSKQGLNAWSETK
Physico‐chemical
properties
protein length:197 AA
molecular weight:20614,2 Da
isoelectric point:5,76
hydropathy:-0,43
Representative Protein Details
Accession
4c4WR
Protein name
4c4WR
Sequence length
186 AA
Molecular weight
20418,03170 Da
Isoelectric point
9,86020
Sequence
GVGHLVLRNDKVLKSVVGKDYNDVVRGKRALTDRQMEQLFNIDVKSKIKAAQRKIPKFNSYPQYIRNAIVDGFFRGDLSGSKDTLALINQGEFKAAAKEYLNHAGYRKSKAEGTGVAGRMERNAAAFATFGGDVPTQPVTTDFYTVKPGDTLSKIAKQSGKSINDLIKVNNLSDPDKLQVGQRLSL
Other Proteins in cluster: phalp2_31782
Total (incl. this protein): 2 Avg length: 191,5 Avg pI: 7,81

Protein ID Length (AA) pI
4c4WR 186 9,86020
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2048
2AGmm
1 89,6% 126 2.902E-68
2 phalp2_17880
v3wn
54 37,6% 170 4.688E-26
3 phalp2_30621
6CSYi
90 50,0% 130 6.409E-26
4 phalp2_9501
ZcC9
1 38,6% 137 4.874E-21
5 phalp2_31984
55Yxp
114 36,4% 137 3.509E-16
6 phalp2_20206
81gIU
77 30,4% 125 1.242E-13
7 phalp2_1915
4hcut
54 29,0% 196 1.071E-12
8 phalp2_32470
17zvi
3 34,6% 130 1.658E-09

Domains

Domains [InterPro]
Unannotated
LysM
Representative sequence (used for alignment): 4c4WR (186 AA)
Member sequence: A0A4Y5FEU0 (197 AA)
1 186 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage 3-521
[NCBI]
2510943 Herelleviridae > Watanabevirus > Watanabevirus wv3521
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MK504444 [NCBI]
CDS location
range 28165 -> 28758
strand -
CDS
ATGAAAAATACAGTAAAGACTATTTTGGTAAGTGCTTCAGTTGTTACAGGGATGTTTATTTTAGGTACCCAAGCTAATGCAGATACTGTTACGGTAGCTTCAGGAGATACAGTAAGCAAGATTGCACAAGAACATGGAGATACTATCAATCACATCGTCTCAGTTAATCAATTAGCTAACCCTGATTTCATTCAGGTAGGAGAAAAGCTACAGGTTGGTACTTCATTGCAAAAAAACAGTACCTCAACGGCTTCCAGTGGGTACTCAGGTGTAACTAACACGGAAACGGCTGCAAGTACCTCAGAAGTCCAAAATAAGCCAAGCACAGGGTACACCTCTGCAACTACTTCTGGTAGTGTTGCAGAACAGGCAGCACAATTGATGGCACAAAAAACGGGTGTTAGTGCTTCAGAATGGTCACATATTATTCAACGGGAATCAAATGGTAACCCTTCTGCTCAAAATTCGTCCAGCTCAGCTCATGGGTTATTTCAACGGTTAGGTGAAACAAGTAATGACTGGCGTACTCAAGTTAACAATGCTGCTGAATTGTACAGCAAGCAAGGACTAAATGCCTGGTCTGAAACTAAGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4c4WR) rather than this protein.
PDB ID
4c4WR
Method AlphaFoldv2
Resolution 90.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50