Protein

Protein accession
4Sq2A [EnVhog]
Representative
89E7V
Source
EnVhog (cluster: phalp2_28607)
Protein name
4Sq2A
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MFMRTNSSKGSLPVVVLGSAAIGVLLVGLFRNRGTRGREADDFPSAPVFTQNRVKVDDYGSLSPSDSRLVEVPGINSRTIVVHSEMADRLNALIRDAGSDGFPDIRIASGHRPHRWESWDQYAEAMRREYGSAEEGRQWRAYKSPHETGLAVDFGSHGLYPSRSGQGPFGLSNEQQKQTPFYKWLKANAHKYGITPYKLEAWHWEARIPYKDWV
Physico‐chemical
properties
protein length:214 AA
molecular weight:24137,7 Da
isoelectric point:9,30
hydropathy:-0,63
Representative Protein Details
Accession
89E7V
Protein name
89E7V
Sequence length
164 AA
Molecular weight
18555,03990 Da
Isoelectric point
10,36879
Sequence
MRVKVADYGRLPRARTVPCESRRLHPAAAPLLSRMLDAVALALGERPRLASAWRAHRWRDRAHYETFLVQRYGSVAKGRRWLAFDSPHETGLAVDFGSLGLRPSSATADAQRRSPLYLWLLANAAGFGWHPYLAEPWHWECPLPRDVWAGVRDLSPGDGPPWKL
Other Proteins in cluster: phalp2_28607
Total (incl. this protein): 12 Avg length: 183,3 Avg pI: 7,78

Protein ID Length (AA) pI
89E7V 164 10,36879
1iqSu 174 9,81707
1l8Bu 179 8,81942
2Tw76 208 7,00816
4I0k9 162 5,68307
5FdaI 171 8,92850
5Idfp 204 5,65084
5yRSW 186 11,09006
6PCZP 200 5,49340
6PD28 190 6,07196
6Woz8 147 5,14418
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31915
4NgvU
6 29,5% 132 8.325E-09
2 phalp2_29849
3NxxR
10 27,8% 133 8.325E-09
3 phalp2_29962
8rfJR
9 27,5% 174 1.314E-07
4 phalp2_36221
7r53d
5 29,1% 151 3.287E-07
5 phalp2_9358
bQoR
17 30,2% 142 4.461E-07
6 phalp2_2044
2zxzD
1 26,8% 123 6.053E-07
7 phalp2_3004
1EbtH
1 30,0% 123 9.357E-06
8 phalp2_2020
2jgvd
6 19,5% 184 4.740E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 89E7V (164 AA)
Member sequence: 4Sq2A (214 AA)
1 164 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (89E7V) rather than this protein.
PDB ID
89E7V
Method AlphaFoldv2
Resolution 94.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50