Protein

Protein accession
2LSJY [EnVhog]
Representative
3gq1r
Source
EnVhog (cluster: phalp2_10728)
Protein name
2LSJY
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSNGEQPTNGQDKSYIADAINPVEADAELLAKVFFAEDAMGGTEAWMGIGNVAINRLRDGRYGKDLKTVLNKMSSAISTNSPQWQKVNSGEMNPFENMALKRMKEAAAQVLSEDNQDNTNGATLFENIDKFGFPKTWDKSKVEAVKKIGRHTYFKEK
Physico‐chemical
properties
protein length:157 AA
molecular weight:17440,4 Da
isoelectric point:5,91
hydropathy:-0,70
Representative Protein Details
Accession
3gq1r
Protein name
3gq1r
Sequence length
157 AA
Molecular weight
17421,54390 Da
Isoelectric point
9,09683
Sequence
MNGEGQVYTADAINPLDVDSTLMAKMIFAEDAGGGPQAWIAVGNAALNRLKSGRYGKSLSQVIKGMSSAIQTKSPQWQKADNLEFNDFEQRVFNKIKDVTDGLVSGKIPDTIKGATHFENLNRFPLPYWAKDMDAVARVGRHTYFREKPRLDTKKGE
Other Proteins in cluster: phalp2_10728
Total (incl. this protein): 5 Avg length: 161,6 Avg pI: 7,96

Protein ID Length (AA) pI
3gq1r 157 9,09683
4hQW6 165 9,17574
fdCR 166 8,77713
iyKg 163 6,85259
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21204
16ZPT
8 39,0% 123 5.013E-19
2 phalp2_11411
d7uy
20 29,4% 136 1.514E-14
3 phalp2_34191
2Ld5M
16 23,5% 140 5.606E-12
4 phalp2_19965
6LklU
2 29,1% 137 7.648E-12
5 phalp2_15439
40vWj
11 32,8% 140 1.423E-11
6 phalp2_4552
3YPbL
2 33,1% 148 3.610E-11
7 phalp2_38160
6XLXK
8 30,8% 146 9.440E-09
8 phalp2_5684
4E1ry
21 28,4% 137 2.379E-08
9 phalp2_18656
8z3RS
5 28,2% 117 3.236E-08
10 phalp2_12064
5eok8
21 26,4% 151 2.780E-07

Domains

Domains
Representative sequence (used for alignment): 3gq1r (157 AA)
Member sequence: 2LSJY (157 AA)
1 157 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2LSJY
Method AlphaFoldv2
Resolution 90.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3gq1r) rather than this protein.
PDB ID
3gq1r
Method AlphaFoldv2
Resolution 89.90
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50