Protein

Protein accession
8wI0E [EnVhog]
Representative
33NbB
Source
EnVhog (cluster: phalp2_9963)
Protein name
8wI0E
Lysin probability
50%
PhaLP type
endolysin
Probability: 83% (predicted by ML model)
Protein sequence
mpqaarttdpisphspcppekcgpgsnnviiqglpayrvgdstvphgvpkprvgcvphvtplvkgshnvfvngqpagrvgdshscgvvviagankvninggtgsnhhpsfttgghsvsgkpiesnspsatqtkttsggvpsslsnfiqqkegfvscafldgsqytngfgteanssteciseteaktrmdsdlatrrtfvtnygnnngynwsstqidaltsfaynlgtgaiaqvtanstrtdavivdkillynkasgvvssgltirrqeesdwfksgmn
Physico‐chemical
properties
protein length:278 AA
molecular weight:29045,8 Da
isoelectric point:8,21
hydropathy:-0,39
Representative Protein Details
Accession
33NbB
Protein name
33NbB
Sequence length
273 AA
Molecular weight
28558,27050 Da
Isoelectric point
7,66158
Sequence
MPHAARTSDPISPHSPCGPEKCGAGSKNVIIQGLPAYRVGDSTVPHGVPHYVPAFTCVPHVTPLVSGSHNVFVNGQPAGRVGDSHSCGVKVLAGANKVHINDAGTGGGYSSVSARVTESKSQASATQTKTISGGVPSSLSSFIQQKEGFVSCAFLDGSQYTNGFGTEANSPTECISESEAKTRMDSDLATRRTFVTNYSTNNGYNWSSTQIDALTSFAYNLGTGAIAQVTANSTRTDAVIVDKILLYNKASGVVSSGLTQRRQEESDWFKSGM
Other Proteins in cluster: phalp2_9963
Total (incl. this protein): 12 Avg length: 267,3 Avg pI: 6,74

Protein ID Length (AA) pI
33NbB 273 7,66158
10hjW 265 5,65221
1Zomz 265 5,48464
22X8o 265 7,71665
2uDnS 268 5,03403
34N9K 265 5,48464
3SHQv 258 8,84347
6CXl3 265 7,71296
8AxjC 262 5,71348
8wIzH 279 7,65368
8wQsX 265 5,65880
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24929
8Gjzo
1 76,5% 188 3.140E-88
2 phalp2_9007
4WhBz
18 44,3% 176 5.136E-46
3 phalp2_31389
2UiYx
4 22,7% 189 9.322E-06

Domains

Domains
Representative sequence (used for alignment): 33NbB (273 AA)
Member sequence: 8wI0E (278 AA)
1 273 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959, PF05488

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
8wI0E
Method AlphaFoldv2
Resolution 84.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (33NbB) rather than this protein.
PDB ID
33NbB
Method AlphaFoldv2
Resolution 85.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50