Protein

Protein accession
4GoZK [EnVhog]
Representative
4D2kq
Source
EnVhog (cluster: phalp2_27366)
Protein name
4GoZK
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGTFDLAWGQKVSPTFRAKVLEICRNFGWTNDHASWLMSCMAFESGETFSPRVRNAAGSGAVGLIQFMPNTAHDMGTTTDALAEMSSVQQLDYVQRYFKPYAARIESLSDMYMAILLPKYVGQPDDAVLFSGGIAYRQNAVLDADSDGQVTKAEAADKVTEKYIKGLAFSTEEPSV
Physico‐chemical
properties
protein length:176 AA
molecular weight:19302,6 Da
isoelectric point:4,86
hydropathy:-0,16
Representative Protein Details
Accession
4D2kq
Protein name
4D2kq
Sequence length
206 AA
Molecular weight
22540,09670 Da
Isoelectric point
4,65264
Sequence
MKLSKERLRQLIKEELNYLIKEYDVSSLHSRDAGPDGEEIKLSGALLAADRDFVSKVNNVALQISANPDHLMNIMKFESGLDPAIVNSASGATGLIQFMPATARSLGTTTAQLGDMSGLDQMDFVSDYFSGSGPYDSATDLYLKVFYPYAINQEGDYIIGSEVSLARAQQIAEQNPYFDKNEDGLVSKQSIVDKMEAVINRAAARV
Other Proteins in cluster: phalp2_27366
Total (incl. this protein): 9 Avg length: 188,0 Avg pI: 6,47

Protein ID Length (AA) pI
4D2kq 206 4,65264
10rWT 176 8,86616
3h1n 176 9,02108
4FU9y 178 9,25761
4Gorc 176 4,67634
4NbR2 217 5,84301
4w4wn 176 4,60591
7Jmeq 211 6,41010
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38137
6S6gW
387 38,3% 159 6.061E-57
2 phalp2_19338
7nG3N
9 33,3% 168 6.285E-42
3 phalp2_26552
7YsQE
123 46,0% 141 8.552E-39
4 phalp2_36417
T712
4 38,5% 161 4.102E-38
5 phalp2_27181
3ezVK
6 34,6% 156 1.437E-37
6 phalp2_34411
4p3Yl
41 40,9% 144 7.566E-35
7 phalp2_31418
3cbTJ
6 45,0% 131 3.621E-34
8 phalp2_1276
Fzg9
27 30,8% 159 5.408E-32
9 phalp2_38395
XYcA
8 31,5% 200 7.393E-32
10 phalp2_7456
4NsYt
13 40,1% 162 2.302E-30

Domains

Domains
Representative sequence (used for alignment): 4D2kq (206 AA)
Member sequence: 4GoZK (176 AA)
1 206 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4D2kq) rather than this protein.
PDB ID
4D2kq
Method AlphaFoldv2
Resolution 88.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50