Protein

Protein accession
3Smx9 [EnVhog]
Representative
5k0pl
Source
EnVhog (cluster: phalp2_20737)
Protein name
3Smx9
Lysin probability
75%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSNRTRIIFKRDSESSVGRGAVIYHAHFRPEIATIIYWAAFYAPSELGEEMWITEGWRDIRDKRDLHKECRAFDIDCMRINAESYSLKFLIAQGWASCLADELGSDYQVILHGGEKALHLHVELDP
Physico‐chemical
properties
protein length:126 AA
molecular weight:14557,4 Da
isoelectric point:5,64
hydropathy:-0,30
Representative Protein Details
Accession
5k0pl
Protein name
5k0pl
Sequence length
124 AA
Molecular weight
14112,78770 Da
Isoelectric point
6,70322
Sequence
MKGPRIVFKDDRDSRVGRGCHLHAGHFRPEMAKVIYEAALTAPREADVMVVSEGWRHIRDSRDLHEEGRAFDLSLNIVTGLSFDQRKTMGTEWSNRLRAKLGRDYDVIVHGDGGNLHIHVELDP
Other Proteins in cluster: phalp2_20737
Total (incl. this protein): 21 Avg length: 126,7 Avg pI: 6,37

Protein ID Length (AA) pI
5k0pl 124 6,70322
1CWTI 157 5,95061
1E0fJ 157 7,02845
1WfRp 131 5,47549
22I7R 121 5,24035
2rDSD 132 6,51167
3Hf0u 125 6,81252
4JQPk 124 6,74755
4vMSw 121 4,97355
5kdlB 127 4,67708
6AKUY 92 5,40899
7CDHn 127 6,70214
7ZUfJ 133 7,86342
80Ba3 133 7,87206
8dpWj 133 7,01532
8drtH 125 6,28551
8izxL 127 7,97250
8jkyX 117 6,21031
8v8KX 117 6,09902
NYF9 111 6,51366
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10850
4hLdm
1 27,7% 108 1.688E-11
2 phalp2_10609
2cCGa
16 30,9% 84 1.677E-08
3 phalp2_20342
2Cqqu
18 30,3% 99 4.289E-08
4 phalp2_18707
p3aX
2 24,0% 100 7.612E-05
5 phalp2_29888
8eLad
5 27,0% 111 2.631E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 5k0pl (124 AA)
Member sequence: 3Smx9 (126 AA)
1 124 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5k0pl) rather than this protein.
PDB ID
5k0pl
Method AlphaFoldv2
Resolution 94.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50