Protein

Protein accession
6STZU [EnVhog]
Representative
4OaGo
Source
EnVhog (cluster: phalp2_11011)
Protein name
6STZU
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VPAELRAAAEGGRVRPRILRRDSSGADVAAVQDLLRARGHDPGPSDGIFGRKTEAAARAFQAARGLHADGIVGPQTRAELADEATPPQAERRRVEKVHTPLSEVGLANALARGHEAVFGAQPTRSRLAVAWAHVALENGRGQHVYCNNFGNITGFRWQGSVYVIRVQERIEQNPDRWEWRDMLFRAHGTPQAGSADYWRVMSKTYEDALPLFDAGRPYDAALKLGELVWFTETPDQYAHRMTGVYRSFPG
Physico‐chemical
properties
protein length:250 AA
molecular weight:27676,6 Da
isoelectric point:9,16
hydropathy:-0,54
Representative Protein Details
Accession
4OaGo
Protein name
4OaGo
Sequence length
233 AA
Molecular weight
25599,57430 Da
Isoelectric point
9,12919
Sequence
VRPRVLRSGLSGVDVVTLQTTLRARGHDPGPADGKFGPRTNAAVRAFQASRGLHADGIVGPQTRAELAVESTPERRRVPKTHTPLSAAGLMNALAWGHEAYFGVQPTKARLSVAWAHVAIENGRGAELYCNNFGNITGFRWPGAVYAITVEERDIETNEWAPKEMLFRAHGTPQSGSASYWDVLHKDYGEAIPLFDLGRPFEAALKLGELVWFTETPEQYASRMTRVHRAFPG
Other Proteins in cluster: phalp2_11011
Total (incl. this protein): 3 Avg length: 241,3 Avg pI: 9,12

Protein ID Length (AA) pI
4OaGo 233 9,12919
2YGQl 241 9,07588
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38073
6x3xX
3 28,5% 235 3.707E-29
2 phalp2_38125
6QfkX
1 33,5% 170 2.097E-27
3 phalp2_23191
4Qw94
23 32,2% 152 4.060E-25
4 phalp2_6302
6URI3
3 32,4% 151 6.758E-20
5 phalp2_24286
3gmlB
71 28,4% 158 1.674E-17
6 phalp2_721
1J3zP
1 27,2% 154 2.481E-14
7 phalp2_30410
4RtBC
33 26,8% 164 4.614E-11

Domains

Domains
PG_1
Unannotated
Representative sequence (used for alignment): 4OaGo (233 AA)
Member sequence: 6STZU (250 AA)
1 233 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6STZU
Method AlphaFoldv2
Resolution 89.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4OaGo) rather than this protein.
PDB ID
4OaGo
Method AlphaFoldv2
Resolution 92.60
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50