Protein

Protein accession
24m07 [EnVhog]
Representative
4klcC
Source
EnVhog (cluster: phalp2_2231)
Protein name
24m07
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALYGIDISNYQRNVDYNAYAFYIMKASEGRTYKDPMLDRHYNAVKAAGKLYGFYHYARPENNSMRAEVDHFLSLVGAHVGKAIFALDWEGNALRYGADKALEWLDYFYARTGVRPLFYCSDSQTGRYAKVAQKNYGLWDAKYSSHGPSHAGWSNIAIWQYQGSPIDKNVFYGDASTWMKYAAKDGQKIIVPQTAQTATANVKRDQWVKDIQTQLNRQYHAGL
Physico‐chemical
properties
protein length:223 AA
molecular weight:25506,3 Da
isoelectric point:9,15
hydropathy:-0,57
Representative Protein Details
Accession
4klcC
Protein name
4klcC
Sequence length
184 AA
Molecular weight
21614,33710 Da
Isoelectric point
8,77229
Sequence
MIKGCDVSHWNSIVNYANYKFCIIKATEGVNFKDVKKDTHATNAAAFGCDLGFYHYARPDKGNNPKLEAEYFLQSIKKYLGHCVLALDWEQKSLNCNIEWARQWLDYVYKQTGIRPLFYCQSSYTNRIDLIREGDYGLWIAQYNNKIQKPKVKDGKGYAIWQFTSRPIDEDLFNGNIEQFRKYM
Other Proteins in cluster: phalp2_2231
Total (incl. this protein): 109 Avg length: 235,4 Avg pI: 6,71

Protein ID Length (AA) pI
4klcC 184 8,77229
136vP 196 5,54370
1gE2Y 195 8,60435
1gGDX 191 8,77745
1k2Yq 197 9,17774
1nXOd 283 5,36739
1p0wA 159 5,25428
21qQ8 206 8,41198
232q3 213 5,34516
233Ml 202 8,46181
23bCy 192 6,94910
23qKf 206 8,45595
23smr 195 6,10760
23uub 300 9,68813
2llkZ 197 7,09501
2uSI 281 5,67068
38GoB 266 9,52026
3NZwF 269 5,40308
3TDQY 191 8,92624
3TQWm 198 6,69588
3VJEO 202 8,78570
3WMCE 189 6,51388
3a4J9 196 9,18573
3dQNN 186 8,83612
3gaoR 192 8,87654
3lyfd 331 4,41670
3nLjv 194 8,79750
3nyof 189 7,00208
3v5YP 208 5,49345
41cWB 192 8,87435
41iSs 154 4,86879
41mHa 191 8,92624
4CytV 191 9,01437
4GWvq 165 4,83657
4k3Mg 250 6,43948
4k59O 250 6,16779
4k8Uc 210 8,84456
5C2ao 341 4,41835
5KU4O 281 5,31856
5MRp7 281 5,21375
5N4YO 283 5,37426
5OieG 197 7,05584
5Ps18 281 5,22125
5QNOl 281 5,48800
5ZXxj 199 8,81710
5j0Ix 343 4,37100
60c7r 281 5,10382
61CKp 244 7,72348
64Lss 281 5,20608
66ePy 199 7,77310
66jw8 281 4,93149
66osI 283 5,34812
67SRK 283 5,52199
6YLBj 281 5,09541
6aJ3E 243 6,52292
6acqj 199 7,76804
6b2pN 197 7,77806
6iuy6 281 5,33510
6jlEQ 255 9,25497
6kdKC 283 5,52818
6leST 255 9,27237
6nXY8 199 7,12695
6nbRU 199 7,12633
6nfBf 286 5,28383
6p9jL 259 5,27929
6qHaO 281 5,00771
6sENx 281 5,10382
6t4wt 262 7,03112
6uNMR 197 7,77806
6v0cc 281 5,20608
6voAb 281 5,11206
6vrqw 281 5,65914
6vym8 281 5,65425
751ZK 305 9,68143
7KyR2 283 5,73945
7WDTU 190 8,98375
7Wjla 261 5,19022
7XgOc 250 5,63243
7Yg59 262 8,10679
82GQL 198 6,52423
82H3q 198 6,95422
82qeD 256 8,28072
82sKn 199 6,70378
834Iy 196 9,02230
84NGz 262 5,64465
84qAm 194 5,09871
85FuT 213 6,04792
87ED2 188 6,59227
8bsgq 190 8,78538
8dhu4 261 4,94172
8fBqv 203 6,64490
8fJon 281 6,12653
8fkNv 201 5,75793
8fkP0 197 8,55736
8fw2r 197 6,90420
8fw5s 199 7,64896
8kIiq 204 5,85876
8lNyp 263 5,50846
8lfsl 197 6,37446
8npNm 204 5,80902
8qW5Q 281 4,93109
8rL5a 266 7,05346
8resQ 281 5,10303
8rlh9 197 6,70452
8v6vp 281 5,19482
EZ8K 281 5,47134
JT1g 281 5,20386
NEAj 281 4,87482
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37341
8v6XY
562 33,9% 203 2.877E-59
2 phalp2_27394
4HDMm
9 36,7% 204 8.557E-54
3 phalp2_16579
7zIOC
8 32,0% 206 1.064E-52
4 phalp2_20117
23DDy
70 37,5% 176 7.038E-52
5 phalp2_39912
1jXPT
300 32,1% 199 4.657E-51
6 phalp2_5615
4g2xP
40 32,6% 202 3.826E-49
7 phalp2_38994
84ePK
348 34,6% 176 1.242E-44
8 phalp2_36157
6TiaL
25 31,6% 177 1.539E-43
9 phalp2_20360
2RVWF
69 31,0% 190 5.525E-37
10 phalp2_1614
888Gb
6 30,2% 195 6.822E-36

Domains

Domains
Representative sequence (used for alignment): 4klcC (184 AA)
Member sequence: 24m07 (223 AA)
1 184 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4klcC) rather than this protein.
PDB ID
4klcC
Method AlphaFoldv2
Resolution 95.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50