Protein

Protein accession
4k7zF [EnVhog]
Representative
4k7zF (this protein)
Source
EnVhog (cluster: phalp2_34905)
Protein name
4k7zF
Lysin probability
88%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MVKKSVGFIVGVIFSSLVMSLNVSAMNVEYATPDEIVTISYVGDIQEEKPEPEFDEDEVELIAKVVLGEAEGESELGKRLVASTILNRVDSDIWPDTVSGVCYQSGQFACLHNGRCNRVKITDSIRELVREEMAARSNYSVMYFSCGGYHNGTPMFKEGAHYFSGR
Physico‐chemical
properties
protein length:166 AA
molecular weight:18390,6 Da
isoelectric point:4,74
hydropathy:-0,12
Other Proteins in cluster: phalp2_34905
Total (incl. this protein): 37 Avg length: 188,4 Avg pI: 4,89

Protein ID Length (AA) pI
1GtuF 178 4,73426
1HtrF 210 4,09061
21n0g 153 4,44716
23RNO 195 4,32706
23T7L 164 5,06711
24E3i 171 4,31592
24rFM 169 4,81599
3TQPX 163 4,96974
3TRJG 186 4,96377
3VEmm 210 4,94661
3WHh1 201 4,20128
3WJaR 223 4,99537
3dZBD 217 4,29756
3gSuS 214 4,32308
3h08T 169 4,91159
3rOnU 190 4,50332
3rT8X 188 4,39862
41ekK 224 4,95201
41la3 206 4,70055
4NMHQ 190 5,04426
4kpEy 141 4,51378
5Wibc 186 6,76409
61lK5 190 4,32632
65VUF 229 4,83583
6u8hh 179 4,44455
7Uj3V 175 4,50889
7Z7FF 186 6,76199
7xWK7 174 4,41829
7xWuN 188 4,39862
812S4 211 8,91522
82or2 174 7,85259
8bstV 193 4,11477
8klzb 176 4,54333
8mo5W 233 4,33303
8n7dB 159 4,20742
8ryWK 190 4,27869
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1335
14NLd
89 44,3% 133 2.593E-53
2 phalp2_19010
13q6h
28 57,1% 105 2.357E-52
3 phalp2_2534
6887x
25 48,8% 129 4.429E-52
4 phalp2_40085
7WX1M
32 39,3% 150 8.795E-46
5 phalp2_6197
5Whe9
20 38,8% 144 2.264E-45
6 phalp2_12629
1Io7o
128 37,7% 143 3.860E-44
7 phalp2_4192
21QOx
13 42,2% 123 6.733E-40
8 phalp2_3197
7nFFO
87 35,5% 121 1.080E-28
9 phalp2_14799
71jqQ
2 29,9% 117 1.333E-27
10 phalp2_29561
40ZMT
29 36,9% 146 1.825E-27

Domains

Domains
Disordered region
Hydro_2
Protein sequence: 4k7zF
1 166
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4k7zF
Method AlphaFoldv2
Resolution 81.10
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50