Protein

Protein accession
40KDn [EnVhog]
Representative
3e43j
Source
EnVhog (cluster: phalp2_18011)
Protein name
40KDn
Lysin probability
99%
PhaLP type
endolysin
Probability: 85% (predicted by ML model)
Protein sequence
MIKESNIIICGHGSGRPSTKNMYAYLSSRYGQKASNGVRKGLVCVRRLKDITEAGRKVYHDTYRSILGRNYYNQNLREYCFVPYKGNYYSDCSSSQMLSLNKVGYATGGTLNTAGIYYSRLFETVPVKIVNGHVQNPELLKVGDQLLFAGNDPSRPLQIGHVEAVYEIPDAYEAEEKWVYVSGKWYYRLRDGQNSYGWKDINKHWYYFDEKGMMLTGWQFIEGDWWYFLDTKGAQYEGALWHQKPGREGAWERWDL
Physico‐chemical
properties
protein length:256 AA
molecular weight:29757,2 Da
isoelectric point:8,80
hydropathy:-0,65
Representative Protein Details
Accession
3e43j
Protein name
3e43j
Sequence length
276 AA
Molecular weight
31402,04670 Da
Isoelectric point
9,31911
Sequence
MTEKDIVICGHGSGTPSYKNMYSYLTLRYNAKADNGLRKQIVAVRRLKGMTDDKRQLFQQYYSSIIGRNYYNQDRREYCYTPYKDGKYYSDCSSSGIKTYVKCGFSFSSTLNTAGIYNSSLFENVPVKIQNGHITNPEILKVGDALLFVGNDPKRPKQIGHVEFVFNLTNNANAKPTTTTTTTVKQTVKYPDWVHVGTGINSKWYYREKEGVNAHGWKNINGHRYYFDEKTGLMAKDWLKINGKWYYFQPESGEGAVLAGALYVSDADGAQHILKV
Other Proteins in cluster: phalp2_18011
Total (incl. this protein): 101 Avg length: 283,8 Avg pI: 6,95

Protein ID Length (AA) pI
3e43j 276 9,31911
13CJ9 284 5,48703
13yvE 280 7,48555
1bUFd 278 6,92659
1bZDq 279 6,17507
1bs4y 294 6,49229
1c4dL 279 6,32575
1c9F4 279 5,53228
1cLVp 309 8,72130
1cfyM 295 8,47787
1d9co 260 8,31637
1dMPk 279 6,51934
1kB1P 297 5,79305
1kBf3 297 5,31316
1kK 279 5,19806
1kkPz 280 6,80212
21ypq 316 9,24523
2358I 315 9,39035
23DJT 304 6,99128
23P1U 274 6,14904
23SJg 289 9,02830
23SKS 255 6,30824
23diM 317 9,67898
256 303 8,35241
28D 303 8,34268
2mu65 279 6,73004
38Hdk 295 5,48032
38Nkk 256 8,66463
3B6P8 282 5,33885
3TJxV 304 6,60392
3WJ8f 255 6,54981
3WOtm 297 5,62515
3ZB9s 309 8,95932
3ZYUv 303 5,64345
3bVxL 279 5,14168
3icbu 284 8,56548
3iiGN 297 5,53677
3pFRu 284 6,24157
3pFjZ 282 5,16606
3pJPp 280 6,33018
3qq1O 314 8,76082
3sAp2 278 5,01209
3sM19 276 5,32964
3sRht 303 7,57927
3t4Ol 258 8,51906
3uB34 280 5,97102
3wAQf 280 8,00744
3ycci 260 7,57342
3yqkv 279 5,99011
41clo 297 5,34454
41ivM 309 8,91967
4JZfD 266 8,84134
4kxVw 279 7,46850
5PJL5 303 6,94813
5RfGn 280 6,81451
5Whts 280 6,17217
60ABa 279 8,00834
64W64 260 8,33127
65geQ 280 6,72606
66Ak 251 7,05897
66kBF 273 5,59650
67QGh 303 5,81442
68iiM 279 8,26203
6SgFE 280 7,98526
6SyMt 280 7,97849
6rSt5 279 7,44764
708Nx 297 5,99904
70fYR 279 8,00828
70imp 279 8,00093
72AH0 279 5,85546
737Qt 284 5,80692
73Oiu 284 5,80692
7BPUj 279 7,46929
7ByY7 284 5,48703
7XkCZ 252 7,50857
7b6tk 278 6,92870
7bNuG 280 6,72538
7cxKS 282 6,00171
7ejjC 280 6,72635
7r33c 279 8,29503
7sEzT 285 6,24157
7skPK 279 6,31523
7tAmv 280 5,50215
7tNUk 284 5,90980
7tUEj 282 5,70882
7tUGF 280 8,56458
8aTBW 303 8,67256
8eyNZ 280 6,72504
8f7So 280 5,96175
8fCj0 298 6,44625
8fiBW 297 5,60412
8ftDG 279 5,64618
8kR59 280 7,47878
8lVBU 279 8,00099
8mqLh 289 8,79660
8pb7H 282 5,54899
8qzxL 280 6,72504
8sgpQ 280 7,48117
8tH0i 280 5,44656
8uIYC 280 6,72754
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6725
21J6O
3 54,0% 285 7.139E-98
2 phalp2_22618
1NA5i
16 68,6% 169 2.371E-84
3 phalp2_39689
6a3H
7 66,6% 168 4.499E-75
4 phalp2_20441
3nWor
16 33,0% 224 1.405E-50
5 phalp2_19270
84Tjp
100 34,0% 194 4.122E-14
6 phalp2_5203
41aUo
11 19,8% 222 8.467E-12

Domains

Domains
Unannotated
Choline_bind_3
Representative sequence (used for alignment): 3e43j (276 AA)
Member sequence: 40KDn (256 AA)
1 276 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF19127

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3e43j) rather than this protein.
PDB ID
3e43j
Method AlphaFoldv2
Resolution 82.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50