Protein

Protein accession
16GME [EnVhog]
Representative
4FhUy
Source
EnVhog (cluster: phalp2_21892)
Protein name
16GME
Lysin probability
67%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MASSGYNDWVRAGRPYSLIRPARDIQRNTRAHGFTVYDYPNDEHLKAANPQDHTPFSVTGWPYTSQERVRWRGRALDVMPRGDSAAARKEIANLARQIIRDKDAQAGGSQDPGVQGTKAIKYMNWTDENGTCRQENWKTGRRVTTSSTDKGHIHLSFRDDMDESNTAASYDPWARMNGQPAPQQEDDMPFVAKDGNTGQYYVCDMITSRAVPKEGVDDVLYLARQLNYGHGTPGAEWTDVVDGVGWTRLGWSEKVFGTLQGTVKVTVEADAEMVKEGTLQALQSAEGQTALTHAAETAEDS
Physico‐chemical
properties
protein length:301 AA
molecular weight:33403,4 Da
isoelectric point:5,43
hydropathy:-0,81
Representative Protein Details
Accession
4FhUy
Protein name
4FhUy
Sequence length
288 AA
Molecular weight
31444,71150 Da
Isoelectric point
5,24518
Sequence
MASQAYYDWKAAGEPYTRARPTLEFLQMLRGHGYTVYDYPDIDHLTAEPPEDHTPFSATGWPIASKRWVGHAVDVMPPAVSSALPTLPNLARQIIKDKDAKVHGTEWIKYMNWTDEDGICRKERWWLDGSRTTTSSTDKGHVHISGRSDMDTSDVVSASGWDPVARLRGTDMSLTPAEHNTQIAIDARVRTLMFDADFADFQLLNSDGTPGETRHEANKSKAARAAIAADVAELKARPPVTFTAEQIAALANAIAAELIASDANALTMADHDGIVADVKSALREGSAT
Other Proteins in cluster: phalp2_21892
Total (incl. this protein): 9 Avg length: 295,2 Avg pI: 5,43

Protein ID Length (AA) pI
4FhUy 288 5,24518
5nGM6 279 5,79811
6AXKt 321 5,46827
6FAWR 316 5,63010
7Fum4 292 5,49868
hKsa 292 5,60082
lEFW 285 4,94342
trdU 283 5,21483
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17915
QMjJ
3 45,4% 218 8.015E-70
2 phalp2_9856
YoF6
7 44,3% 185 5.456E-58
3 phalp2_21548
4Maun
13 36,8% 179 2.418E-24
4 phalp2_36414
SkOD
1 29,8% 181 2.668E-18

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 4FhUy (288 AA)
Member sequence: 16GME (301 AA)
1 288 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
16GME
Method AlphaFoldv2
Resolution 84.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4FhUy) rather than this protein.
PDB ID
4FhUy
Method AlphaFoldv2
Resolution 78.18
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50