Protein

Protein accession
81rS1 [EnVhog]
Representative
89iBG
Source
EnVhog (cluster: phalp2_31187)
Protein name
81rS1
Lysin probability
94%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKQNFILISAIITLIIITKPVIRKINKVILTTSKKDFINSIKKEVEQIGLQIGVPYKFMIAQIILETGWGKSTLFSKYYNVGGIKAVKGQKFISLPTIEYIKGVKTNLNQNFAVYPDLKSGFIAYSKILQNRYFKKYLNKTTNPIEYAKLLQSGEPKYATDINYISKIENLINQINTII
Physico‐chemical
properties
protein length:180 AA
molecular weight:20575,2 Da
isoelectric point:9,90
hydropathy:-0,01
Representative Protein Details
Accession
89iBG
Protein name
89iBG
Sequence length
189 AA
Molecular weight
21426,99790 Da
Isoelectric point
9,77510
Sequence
MINDKKKYIYLGGLIFLYFTLNKKSDKQSEVKTKNKTKLTEKQINFINSILPASKTIEKQIGVPYQFIIAQICLESGFGKSSLTSKYFNFGGIKALKNLPSVRLLTTECKAGVCKKVYQNFAVFKTALEGLMAQAKIYSNKYFKKYLNKTTNPYEYVKLLQSGDVKYATSPTYVKDISKLIDLVVKAGF
Other Proteins in cluster: phalp2_31187
Total (incl. this protein): 9 Avg length: 194,4 Avg pI: 8,65

Protein ID Length (AA) pI
89iBG 189 9,77510
1Cw55 217 9,54998
5t24j 206 9,33807
6Dg7f 195 5,19084
6SR1G 195 5,94987
7y7NT 202 8,50056
89iz7 177 9,77471
8njXv 189 9,88837
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5784
7N5g9
18 29,8% 161 9.186E-34
2 phalp2_37638
4fbgK
5 34,5% 142 1.245E-30
3 phalp2_5346
2cQBB
648 29,4% 139 3.187E-30
4 phalp2_28108
7xE0g
17 32,6% 138 1.116E-29
5 phalp2_374
7enmQ
47 30,8% 149 6.535E-28
6 phalp2_34505
4LEw1
3 27,0% 148 2.214E-24
7 phalp2_2210
4f9X6
94 32,3% 139 5.648E-24
8 phalp2_23271
4Y2T6
95 27,8% 151 1.441E-23
9 phalp2_8561
1Ibzf
44 28,8% 149 1.968E-23
10 phalp2_16906
24Pp0
1 29,2% 147 6.076E-22

Domains

Domains
Disordered region
GLUCO
Representative sequence (used for alignment): 89iBG (189 AA)
Member sequence: 81rS1 (180 AA)
1 189 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
81rS1
Method AlphaFoldv2
Resolution 83.91
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (89iBG) rather than this protein.
PDB ID
89iBG
Method AlphaFoldv2
Resolution 83.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50