Protein

Protein accession
3Qzd8 [EnVhog]
Representative
4g4R2
Source
EnVhog (cluster: phalp2_14358)
Protein name
3Qzd8
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKHILEAVIWNHYRSKKPRGFYNIIKGHTGVDLNYIFEPLPSPVTGQIVKILEQKEMGKVIYLCDIASGSIHVFAHMKQVDVTEGQKVKRNQVLGITGNSGAKTTSAHLHYEIIVFKKPEKLLDRIMARSLATYKGWNTDPIMYLRELYGKYGFGPSGDLIT
Physico‐chemical
properties
protein length:162 AA
molecular weight:18362,2 Da
isoelectric point:9,52
hydropathy:-0,23
Representative Protein Details
Accession
4g4R2
Protein name
4g4R2
Sequence length
163 AA
Molecular weight
18399,85020 Da
Isoelectric point
8,98536
Sequence
MKTVFNSKVINDYRSHVPKGFYNAIKGHTGVDLEYSYENLFSPVTGEVVGLTTQTEMGRCIYLRDVKGTVHVFAHMSAISVSLHAKVKRDDMLGITGNTGSRTTKPHLHYEVICTSTSVQKSSPYDFIMTRKELPFKGFNRNPLKYLAQLYEEYGVIIPDAKD
Other Proteins in cluster: phalp2_14358
Total (incl. this protein): 10 Avg length: 180,7 Avg pI: 8,35

Protein ID Length (AA) pI
4g4R2 163 8,98536
1KD8Y 198 5,71700
1pqbI 176 9,21139
2wKY9 196 8,31096
802Qx 159 9,79257
8a63L 201 8,50623
8oqmx 178 9,80630
8redu 190 7,08955
luWG 184 6,60000
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10380
1KHNN
2 37,9% 145 4.268E-53
2 phalp2_14848
7w0BH
74 27,4% 164 3.309E-19
3 phalp2_9303
7pYyV
24 32,4% 157 1.712E-16
4 phalp2_21601
2tLnI
8 25,0% 124 9.842E-15
5 phalp2_22730
8dH10
29 28,9% 114 1.344E-14
6 phalp2_27905
l4Bb
39 26,1% 134 4.664E-14
7 phalp2_17326
4kAwN
8 24,4% 131 1.185E-13
8 phalp2_21800
4e6HN
71 28,9% 107 3.149E-11
9 phalp2_5933
67ENK
29 26,5% 128 1.085E-10
10 phalp2_14951
kt60
3 27,0% 137 2.012E-10

Domains

Domains
Representative sequence (used for alignment): 4g4R2 (163 AA)
Member sequence: 3Qzd8 (162 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3Qzd8
Method AlphaFoldv2
Resolution 87.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4g4R2) rather than this protein.
PDB ID
4g4R2
Method AlphaFoldv2
Resolution 84.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50