Protein

Protein accession
4AAWy [EnVhog]
Representative
100RI
Source
EnVhog (cluster: phalp2_11551)
Protein name
4AAWy
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VIEYTKSSPNWSPRPQGLNDVWGVLLHHTATGGDTGQAVANFFSKRASEVSAHDVIDEHGVVWHCVSIEKAAWHAGQCRRYDWDRDGIHEDWEQYVNSHTIGIEFCNTGSTSDNYPPAQIKSAAQLIRRWDAKCPNLKLRNVTDHQAVNLNGKVDVKSNFPAALLFWYILHPTMAPPDNNIYHALPGWARLQVDEIKK
Physico‐chemical
properties
protein length:198 AA
molecular weight:22377,7 Da
isoelectric point:6,42
hydropathy:-0,52
Representative Protein Details
Accession
100RI
Protein name
100RI
Sequence length
200 AA
Molecular weight
22524,32750 Da
Isoelectric point
9,48770
Sequence
MTLINKKIWSPNWSNRPNGIRDVYGVILHHTASPGDSAIGVAQYLARPSTEASAHLIVGDNGYTIQGVSLERAAWHAGTARYDFNHDGRISPSERYVNTHTIGIEQCNTGKLSDNYPNLQIKRVAQLILWLDRKCPNLSLRNITDHEAVNLRGKVDVGANYPAAKLFWYILHPKSTMQPPKDVYAQLPKWAQRQCAEIKR
Other Proteins in cluster: phalp2_11551
Total (incl. this protein): 7 Avg length: 198,9 Avg pI: 8,10

Protein ID Length (AA) pI
100RI 200 9,48770
4DloT 198 7,73518
4G6FW 198 7,20397
4NoCm 196 8,33997
6QVKa 198 9,03971
otTg 204 8,44653
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11760
45aqa
242 32,4% 157 3.413E-22
2 phalp2_18899
1Z9i0
2 27,3% 168 3.865E-14
3 phalp2_31152
3LHkO
133 23,8% 197 4.483E-13
4 phalp2_38722
YdMl
4 26,9% 193 1.122E-12
5 phalp2_17296
4dV4i
8 28,1% 153 2.804E-12
6 phalp2_3939
7bfSL
153 23,8% 180 3.805E-12
7 phalp2_39806
zxOV
15 28,0% 150 7.003E-12
8 phalp2_25288
86peP
187 27,0% 137 3.210E-11
9 phalp2_35209
6wYEn
2 26,5% 143 4.351E-11
10 phalp2_38674
87xKu
58 26,0% 146 1.987E-10

Domains

Domains
Representative sequence (used for alignment): 100RI (200 AA)
Member sequence: 4AAWy (198 AA)
1 200 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (100RI) rather than this protein.
PDB ID
100RI
Method AlphaFoldv2
Resolution 95.05
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50