Protein

Protein accession
808rH [EnVhog]
Representative
4PpCO
Source
EnVhog (cluster: phalp2_13494)
Protein name
808rH
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIDPDTGRFDWMEWIPGPPDRVYPGRNTLEFYIPHDMVGYYNGSGPPGVMLDPNRSASWPYTIGETKAFQHYPIWSPCWTSGSEYWNRRGIAPETVRTVRPLQDPAEPLTPIQVEMHLRAIHDIEEYRAGIGLPPIVFRRSLGTIKEHREVSPVPTSCPSGRMQPLYDALEVDDVTPEQVQQMIDAALERRDAEYAAAVGAKPIRHAQAVNQRLWAIHRDTDPKQVPGDNP
Physico‐chemical
properties
protein length:231 AA
molecular weight:26204,2 Da
isoelectric point:5,36
hydropathy:-0,62
Representative Protein Details
Accession
4PpCO
Protein name
4PpCO
Sequence length
219 AA
Molecular weight
24391,95440 Da
Isoelectric point
5,10951
Sequence
MSLEWAPFAVRRPGPAWKVGYSFVGEGGPKRGDVKHSAEGYWSGIYAVLQGNRRASWHFTVGFDRVEQHYPISAYCWHAGDVDDDGGVAANLDLVGIEHLGVAGQPLTPYQIEMTTEITRWCAEQNGYKRFARYPIQDGVWTIAEHNEVSDVPTSCPSGRIPHGLILGDLYNVEDDMADEETRKQWAEAQAIFERAGAYAAQGLPLPPDLLAQLRYLTR
Other Proteins in cluster: phalp2_13494
Total (incl. this protein): 19 Avg length: 228,1 Avg pI: 5,69

Protein ID Length (AA) pI
4PpCO 219 5,10951
16tQn 238 6,70054
1Eby6 227 5,89099
2ATpT 203 6,37093
2CTce 231 5,81567
2Saii 227 5,40672
44Rbw 218 5,44554
4JRNJ 211 6,09055
4JTKc 190 4,76938
4JUtR 222 5,34630
4OkgY 264 5,22466
4fS1I 230 6,24788
5k70h 210 6,00341
5k7bu 256 5,85518
5ka0e 198 5,41155
5klOm 247 5,23853
fg6Z 222 6,79791
iZfW 289 5,05187
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34349
3YRKh
5 36,9% 157 1.593E-45
2 phalp2_3554
4DvvK
9 45,0% 171 4.083E-45
3 phalp2_38789
1lmMl
2 29,9% 214 1.557E-39
4 phalp2_12419
gjha
11 32,9% 188 2.454E-32
5 phalp2_4479
3gnNS
19 27,0% 174 1.419E-25
6 phalp2_33973
40pAF
5 31,3% 185 1.342E-20
7 phalp2_22310
9onk
1 27,0% 207 1.174E-17
8 phalp2_14594
5ioYm
94 27,2% 220 1.595E-17
9 phalp2_20257
8iV4A
2 28,1% 220 7.394E-17
10 phalp2_9423
jh6D
5 27,3% 212 2.458E-14

Domains

Domains
Representative sequence (used for alignment): 4PpCO (219 AA)
Member sequence: 808rH (231 AA)
1 219 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
808rH
Method AlphaFoldv2
Resolution 83.74
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4PpCO) rather than this protein.
PDB ID
4PpCO
Method AlphaFoldv2
Resolution 92.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50