Protein

Protein accession
4gVhN [EnVhog]
Representative
8aIyh
Source
EnVhog (cluster: phalp2_7026)
Protein name
4gVhN
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MLSIEAFQDSSFKFTPQEFVHPAIIQSVGLTRCLYYISDFQINYAALLREVLNTPVILNTYSYKKPGPGYYVGRGTRPSFYRPEGGGTLSQHYFANALDASTKLYSPREIFEAIMANEARFKAIGLTTIEDLDSTPGWIHGDGRRIIPGLYPEKGFLIVRPL
Physico‐chemical
properties
protein length:162 AA
molecular weight:18233,6 Da
isoelectric point:7,83
hydropathy:-0,14
Representative Protein Details
Accession
8aIyh
Protein name
8aIyh
Sequence length
162 AA
Molecular weight
18193,63030 Da
Isoelectric point
7,78677
Sequence
MLSIEAFQDPSFKFTPQEFVHPAIIQSVGLTRCLYYISDFQINYAALLREVLNTPVILNTLSYKKPGPGYYVGRGTRPSFYRPGGGGTLSQHYFANALDASTKLYSPREIFEAIMANEARFKAIGLTTIEDIDSTPGWIHGDCRRFIEGIHPEKGFLIVRPV
Other Proteins in cluster: phalp2_7026
Total (incl. this protein): 8 Avg length: 155,8 Avg pI: 8,16

Protein ID Length (AA) pI
8aIyh 162 7,78677
1IQdC 156 9,51503
1Mci3 164 8,81465
2f5gd 159 7,84775
3YLOo 159 7,84775
4of9c 159 7,84801
5fsmH 125 7,82274
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6542
UVYl
146 34,6% 98 6.983E-15
2 phalp2_10855
4iQ7K
18 28,5% 133 1.568E-13
3 phalp2_17599
6zH5y
2 25,8% 151 2.566E-12
4 phalp2_34764
3iKHD
290 31,8% 135 8.868E-12
5 phalp2_1968
4BR1x
969 27,8% 133 1.957E-10
6 phalp2_1804
38zLG
170 28,4% 130 1.957E-10
7 phalp2_31809
4kTVa
1 26,7% 131 3.630E-10
8 phalp2_7525
4Yxu6
7 27,6% 159 4.275E-09
9 phalp2_32678
80b3k
2 26,6% 150 7.909E-09
10 phalp2_37920
4UNwK
25 24,3% 160 1.444E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 8aIyh (162 AA)
Member sequence: 4gVhN (162 AA)
1 162 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (8aIyh) rather than this protein.
PDB ID
8aIyh
Method AlphaFoldv2
Resolution 94.98
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50