Protein

Protein accession
5sT8i [EnVhog]
Representative
34A4t
Source
EnVhog (cluster: phalp2_12942)
Protein name
5sT8i
Lysin probability
77%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MHFGWNEPYPANLAHLTSDGKHKGADYLVCIGTPVIASVDGIISEISETRDFGLHIIIKFTTGMLCKTYYHLILAHLSQIITKKTVGNKINKTEIIGLSGDSGSAKGHPHLHVQCNKLVKSVWVPVNPAFAIGES
Physico‐chemical
properties
protein length:135 AA
molecular weight:14673,8 Da
isoelectric point:8,45
hydropathy:0,06
Representative Protein Details
Accession
34A4t
Protein name
34A4t
Sequence length
131 AA
Molecular weight
14042,64260 Da
Isoelectric point
5,55638
Sequence
VNNDNLFGTPDPIYTELGYLGHPGCDFLAPQGTPVRASADGICTTARAVGTAGLMIRLEHEAYGYATRYLHLSGVLVGEGMHVKRGQTIGLTGNTGLSTGPHLHFDVYDYRESVLNGYGQRVDPLPLLEAI
Other Proteins in cluster: phalp2_12942
Total (incl. this protein): 4 Avg length: 139,3 Avg pI: 8,28

Protein ID Length (AA) pI
34A4t 131 5,55638
1KPx2 146 9,38120
2A1P6 145 9,73448
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18623
3ebV
83 47,5% 122 1.079E-23
2 phalp2_27905
l4Bb
39 43,0% 130 2.528E-22
3 phalp2_22730
8dH10
29 43,3% 127 3.149E-21
4 phalp2_40658
5cY9d
663 45,5% 112 1.719E-18
5 phalp2_32733
8iF6d
6 38,5% 127 5.487E-17
6 phalp2_26412
1I66q
14 47,1% 87 1.411E-16
7 phalp2_17326
4kAwN
8 42,8% 105 1.932E-16
8 phalp2_15826
4gsPm
3 41,3% 121 2.647E-16
9 phalp2_5756
7Czmx
45 42,2% 109 4.966E-16
10 phalp2_34915
4pbkX
159 39,2% 102 1.276E-15

Domains

Domains
Representative sequence (used for alignment): 34A4t (131 AA)
Member sequence: 5sT8i (135 AA)
1 131 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (34A4t) rather than this protein.
PDB ID
34A4t
Method AlphaFoldv2
Resolution 94.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50