Protein

Protein accession
23ePP [EnVhog]
Representative
23ePP (this protein)
Source
EnVhog (cluster: phalp2_21372)
Protein name
23ePP
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKPRKNKPQAGNKYYISTKDGGYNGAEGNPLRRNKDLTALPNCVAIYGWFNEVGENGMQYLKAPWYPWAVIDAAKREGLTVTVEPTVGGIMVWTGGKTGDGHVEGCVSFYEDGSILTASSEYYGRDWVNFHRCKGDGNWRDGCYWMDKSYTYRGCIKNPFVEEKPMTYEQFKDYMNRYLREIANAPADAWAVPAIEYCKEHGLMVGDESGNFKPQSPVRREELAAVIKGLTE
Physico‐chemical
properties
protein length:232 AA
molecular weight:26218,2 Da
isoelectric point:6,35
hydropathy:-0,66
Other Proteins in cluster: phalp2_21372
Total (incl. this protein): 32 Avg length: 246,7 Avg pI: 6,93

Protein ID Length (AA) pI
21JG4 252 5,34516
21JpN 215 5,11314
21kwh 227 7,48691
21y8M 241 7,46491
23BdB 252 5,13184
23D76 248 5,30299
23K7x 252 8,72884
23hXN 246 7,50840
23weA 247 5,64755
38Lhm 261 7,53903
38OKK 245 5,65482
3VJuw 247 5,65925
3ZJrs 254 8,01589
3ZLhA 253 7,50720
3ZO0w 256 8,06198
3fO6b 244 6,60864
3gKLQ 242 5,54040
3iXto 252 7,50760
3vekq 250 8,52893
40aaD 249 8,17654
40dzd 253 8,56716
40kXg 244 6,74533
40lBH 256 8,55549
4Mfhj 235 8,77810
4OfOQ 249 8,17080
71jff 247 5,92799
7DOeb 247 5,95743
813kv 246 8,34796
81MzN 252 5,45975
87zOZ 256 5,48243
8mj1H 243 6,91716
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17558
5WoZV
75 36,1% 177 6.015E-44
2 phalp2_2544
6h4Z7
13 37,3% 166 2.303E-41
3 phalp2_38902
21KkT
20 38,0% 171 1.504E-40
4 phalp2_12682
21Kjh
3 33,7% 178 1.198E-38
5 phalp2_15440
410GZ
43 28,6% 227 9.100E-34
6 phalp2_18807
1gBg7
77 28,8% 180 1.850E-24
7 phalp2_40337
3iVXO
43 25,8% 228 1.634E-17
8 phalp2_4546
3WKbr
13 30,3% 165 4.681E-16
9 phalp2_10722
3dYCr
65 23,4% 162 4.449E-14
10 phalp2_11680
1Nyjf
74 25,8% 170 3.670E-10

Domains

Domains
Unannotated
SLH
Protein sequence: 23ePP
1 232
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00395

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
23ePP
Method AlphaFoldv2
Resolution 75.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50