Protein

Protein accession
4g4R2 [EnVhog]
Representative
4g4R2 (this protein)
Source
EnVhog (cluster: phalp2_14358)
Protein name
4g4R2
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKTVFNSKVINDYRSHVPKGFYNAIKGHTGVDLEYSYENLFSPVTGEVVGLTTQTEMGRCIYLRDVKGTVHVFAHMSAISVSLHAKVKRDDMLGITGNTGSRTTKPHLHYEVICTSTSVQKSSPYDFIMTRKELPFKGFNRNPLKYLAQLYEEYGVIIPDAKD
Physico‐chemical
properties
protein length:163 AA
molecular weight:18399,9 Da
isoelectric point:8,99
hydropathy:-0,33
Other Proteins in cluster: phalp2_14358
Total (incl. this protein): 10 Avg length: 180,7 Avg pI: 8,35

Protein ID Length (AA) pI
1KD8Y 198 5,71700
1pqbI 176 9,21139
2wKY9 196 8,31096
3Qzd8 162 9,51987
802Qx 159 9,79257
8a63L 201 8,50623
8oqmx 178 9,80630
8redu 190 7,08955
luWG 184 6,60000
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10380
1KHNN
2 37,9% 145 4.268E-53
2 phalp2_14848
7w0BH
74 27,4% 164 3.309E-19
3 phalp2_9303
7pYyV
24 32,4% 157 1.712E-16
4 phalp2_21601
2tLnI
8 25,0% 124 9.842E-15
5 phalp2_22730
8dH10
29 28,9% 114 1.344E-14
6 phalp2_27905
l4Bb
39 26,1% 134 4.664E-14
7 phalp2_17326
4kAwN
8 24,4% 131 1.185E-13
8 phalp2_21800
4e6HN
71 28,9% 107 3.149E-11
9 phalp2_5933
67ENK
29 26,5% 128 1.085E-10
10 phalp2_14951
kt60
3 27,0% 137 2.012E-10

Domains

Domains
Protein sequence: 4g4R2
1 163
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4g4R2
Method AlphaFoldv2
Resolution 84.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50