Protein

Protein accession
8lHWH [EnVhog]
Representative
62kfV
Source
EnVhog (cluster: phalp2_6205)
Protein name
8lHWH
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSRLDNFLWWCDFYCNGITVYYSQENRQDGFDNPGSPTYMDCSSMTIIAARQAGYPTGGAWYTGDMVPAFISAGWECLDYSWDTMQPGDVVIRPANAWRGGHVVVVGYQGTCYEAYTDEIAPEDQVRQTAIYEFGADYILRPPADDYSAPTPEPEPKPEPTITPTEGTPMFVRVNFGESYGYALIPYGLGAQGLDQDQADRYYRAGLRPVEVSAEDFTALVADSWAHYSACFGSLASKSDVQAQTDAVVNAVKAAATKAD
Physico‐chemical
properties
protein length:260 AA
molecular weight:28580,2 Da
isoelectric point:4,20
hydropathy:-0,34
Representative Protein Details
Accession
62kfV
Protein name
62kfV
Sequence length
260 AA
Molecular weight
28604,22480 Da
Isoelectric point
4,20702
Sequence
VSRLDNLLWWCDFYCNGITVYYSQENRQDGFDNPGSPTYMDCSSMTIVAARQAGYPTGGAWYTGDMVPAFISAGWECLDYSWDAMQPGDVVIRPANAWRGGHVVVIGYQGTCYEAYTDETAPEDQVRQTSIYEFGADYILRPPADNYSAPAPEPEPEPEPTINTTEGLLMFVRVNFGESYGYALIPYGLGAQGLNQDQADRYYRAGLRPVEVTAEDFTALVAESWSHYSACFGSLASKSDVQAQTDAVVNAVKAASIKTD
Other Proteins in cluster: phalp2_6205
Total (incl. this protein): 15 Avg length: 259,7 Avg pI: 4,19

Protein ID Length (AA) pI
62kfV 260 4,20702
1kqIj 261 4,32769
3c8DB 261 4,10982
5Mj1b 263 4,17701
5Mj2m 260 4,19525
5PrJr 259 4,21685
5Y4oz 263 4,13739
64vCy 261 4,21731
6wzcB 235 4,20344
7MxSV 265 4,08362
7ZzAp 263 4,20929
8r4Qs 262 4,20696
Hyv4 263 4,20929
OgqM 260 4,17706
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40345
3lM8u
52 30,5% 272 2.422E-19
2 phalp2_2710
7eYI9
24 25,7% 202 1.974E-05

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 62kfV (260 AA)
Member sequence: 8lHWH (260 AA)
1 260 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (62kfV) rather than this protein.
PDB ID
62kfV
Method AlphaFoldv2
Resolution 81.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50