Protein

Protein accession
5B2QT [EnVhog]
Representative
4osFp
Source
EnVhog (cluster: phalp2_31597)
Protein name
5B2QT
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALYSGAAQMLLPESATQPRIRPVLVVLHTNGGNNTLESSYRYWQGADTEAHFQLECGPPNGRGRLGQYIDTGTRADSQSAANSFYRAGVLCGAISIETSDLGSPWEKSWTDLGQRQTLEDWLVWACATHGIPPVLAYADPAGGWTGIGYHHQVPSWSNAAHICPGPGKIREVPNLIAAVAARLKEEPMTPEDRAYLDAKFAAVAPYLSDPSLPDWPTFGTGQAIGDTRAFAQMNNQALARIEALLAGVDLDAIRQAVIDTIEGMTVGATLDAVTIAAIAAATVTKFAETVNLHPVVLA
Physico‐chemical
properties
protein length:299 AA
molecular weight:31862,5 Da
isoelectric point:5,01
hydropathy:-0,05
Representative Protein Details
Accession
4osFp
Protein name
4osFp
Sequence length
267 AA
Molecular weight
28800,85360 Da
Isoelectric point
5,49630
Sequence
VARYPKARWLPVTGLSNDPVIIPVGVILHIDGGNASSLYEYFNGPSGGIESTLFVNKQGNWEQFRDTTNEADANADGNSWIGSDGKRYGFNSVETQGTCNDSGWNDIQLKELAEFLAWHHEVHGTKLVLATGSHGHGVGYHKQFPGWNPNGHECPCDSRVAQIPELLRRANDIVNSAGLEIIPEDVNMRIVRHVGDNKVFLVIPGHSVTWVTNPATAQGLCVITGQDYNNLPVVNAATLTSVPVFDQNALVNAIAAKLAPPAPPPSV
Other Proteins in cluster: phalp2_31597
Total (incl. this protein): 9 Avg length: 276,2 Avg pI: 5,86

Protein ID Length (AA) pI
4osFp 267 5,49630
11z2u 286 8,42358
2ZAQt 285 5,33044
4EFKo 253 4,73465
4EUUc 229 4,54493
4Ne0L 289 5,00288
4lCvN 293 9,40918
umF9 285 4,74983
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23309
5kneL
13 47,3% 169 1.496E-65
2 phalp2_18282
4XixX
11 41,8% 172 1.803E-58
3 phalp2_28756
4LkIh
22 30,5% 226 7.933E-41
4 phalp2_20891
6SNpy
16 32,6% 205 2.756E-40
5 phalp2_14594
5ioYm
94 34,3% 221 6.189E-39
6 phalp2_12974
3gmT9
125 32,9% 194 3.999E-38
7 phalp2_23139
4Fkvn
15 32,3% 195 2.086E-33
8 phalp2_15963
4U1uS
1 30,2% 185 5.472E-31
9 phalp2_16991
80xHs
2 29,7% 259 1.883E-30
10 phalp2_2602
6IbUl
53 35,2% 176 4.756E-30

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4osFp (267 AA)
Member sequence: 5B2QT (299 AA)
1 267 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5B2QT
Method AlphaFoldv2
Resolution 73.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4osFp) rather than this protein.
PDB ID
4osFp
Method AlphaFoldv2
Resolution 77.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50