Protein

Protein accession
2SUi2 [EnVhog]
Representative
1Is7j
Source
EnVhog (cluster: phalp2_29998)
Protein name
2SUi2
Lysin probability
53%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MHRTLVGLFALLLIGSTSRGTFEERLRDVALSQPSPWYEPGSTALGALETPDQYAERVTMATRELALATDRDKPPEWRWGRKALAVAVLAHWYEESRFALEVHEGIEHPVWTQDKGRARCLGQLHVGLVPEYEWQRLAGLDEAATRRCALWTARALTRMAGYCRGDKTRLQDVLVPMFSGLGGGGCASTASGRAKAARFAKMWREVER
Physico‐chemical
properties
protein length:208 AA
molecular weight:23267,3 Da
isoelectric point:9,00
hydropathy:-0,34
Representative Protein Details
Accession
1Is7j
Protein name
1Is7j
Sequence length
160 AA
Molecular weight
17248,32010 Da
Isoelectric point
7,30190
Sequence
LLTVASILVAALSLSSPHGTGESAEDYVVRLDTIARAIVAESDSTDEALAVLVLWHSESKFDPLIHAGEPHPLWHQDHGRARCGLQVHRSRLIPDWDAITGTHLAATRRCVAAGLRVLRHGLTVCGRGYVGRERMARGFQAYASGHCGAPSAESLRRSAE
Other Proteins in cluster: phalp2_29998
Total (incl. this protein): 5 Avg length: 197,2 Avg pI: 8,15

Protein ID Length (AA) pI
1Is7j 160 7,30190
3Q82W 238 9,12906
4Cq1d 184 9,33465
6TQVi 196 5,98449
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26946
4DlSe
19 38,8% 170 1.045E-30
2 phalp2_13141
2Rtef
18 39,0% 146 2.210E-25
3 phalp2_5327
2aY9Y
3 27,4% 164 2.120E-11
4 phalp2_17689
6XyCu
13 28,5% 168 2.202E-07
5 phalp2_27047
7ZSka
2 22,3% 152 2.540E-06
6 phalp2_917
8hAuw
31 29,6% 179 1.164E-05
7 phalp2_7095
2bdJA
18 22,0% 181 1.164E-05
8 phalp2_40451
4kzcC
17 28,8% 125 2.138E-05
9 phalp2_16466
6IGgW
27 25,8% 147 2.138E-05
10 phalp2_10980
4SQC9
24 31,9% 166 3.922E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 1Is7j (160 AA)
Member sequence: 2SUi2 (208 AA)
1 160 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2SUi2
Method AlphaFoldv2
Resolution 89.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1Is7j) rather than this protein.
PDB ID
1Is7j
Method AlphaFoldv2
Resolution 93.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50