Protein

Protein accession
1NATe [EnVhog]
Representative
5QSR9
Source
EnVhog (cluster: phalp2_215)
Protein name
1NATe
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNSPYMGKFKVTQEFKGTWHDGLDLVGIDSKEIHATASGFIRYAGWENAENHQQGFGQYVCIEAGDGNFYYFGHLSEIKVKMGDSVKITDVIGIEGDTGYSFGSHCHYCIRPEFRAGCFLNVSEISGIPNELGTYDDGYRPQKTKQKEKISVVVDIDERKYSGLLEEI
Physico‐chemical
properties
protein length:168 AA
molecular weight:18867,9 Da
isoelectric point:5,20
hydropathy:-0,46
Representative Protein Details
Accession
5QSR9
Protein name
5QSR9
Sequence length
241 AA
Molecular weight
25945,22760 Da
Isoelectric point
9,39467
Sequence
VLREAVLMVSAAFLFGIEVDFLISPYKGTFRVSQAYRNLRANGTYHQGYDLVGIGDKSIYCPVYGTVIRAGWECATLPKKGFGQRVVVRIGSTAYYMYFGHLSKINVAVGQKLKPGDLIGAEGSTGHSTGSHLHWEIRINDISTGYVSVHQYAGIPNVAGSTAYTSNWVAELFGPSNLKKSTSGFPQRLYNSVLQGALGIDKDGIFGANTEKTVKEFQSAHGLTSDGIAGAKTKVALAKQL
Other Proteins in cluster: phalp2_215
Total (incl. this protein): 117 Avg length: 206,1 Avg pI: 8,36

Protein ID Length (AA) pI
5QSR9 241 9,39467
13Bqq 170 6,74090
13yF1 241 9,53644
1GbtJ 173 6,10931
1NzaU 274 9,12577
1gFI9 195 6,69424
1gyD9 170 6,36571
23Lll 160 6,30654
24qtF 168 6,27823
2LlV4 133 6,00222
2m1IB 222 9,39506
2oEIq 190 9,83286
2ujS 169 7,05215
39Xb9 166 8,43345
39XbA 169 8,74419
3Askc 222 9,38636
3KNX5 225 9,98630
3VFa4 184 5,82704
3WMss 178 5,79584
3dTmH 171 6,20616
3de6K 188 6,69685
3e3Yi 191 4,86936
3fTlV 170 6,11425
3gac2 169 5,85904
3ik8H 173 5,51704
3iozJ 193 4,60194
3iwmZ 189 6,69190
3jL2X 220 9,47938
3kfiN 222 9,30139
3kxVi 220 9,51806
3lnz5 222 9,39506
3lo7h 225 9,94330
3m5I8 222 9,41304
3prpU 220 9,58872
3qQG8 222 9,55642
3qm0E 222 9,75157
3sQFh 234 9,39422
3sQJs 220 9,58872
3sTsx 220 9,51897
3tDjz 190 9,39499
3w5NN 222 9,39506
40but 175 5,79709
4FXWF 188 6,74323
4L9TX 165 6,03314
4LaZT 165 6,03786
4Miwc 169 7,04374
4Mj7i 183 6,99315
4eZTL 221 5,73314
4f8U4 217 9,36534
4fuqv 217 9,56964
4yfIl 225 9,23318
5KuOI 240 9,28495
5L9Rh 163 7,77877
5MYDH 241 9,39403
5McRK 241 9,39467
5PF3H 225 9,94330
5QZsj 240 9,28495
5TakE 227 9,13893
5UaT8 190 9,56712
5Ujh9 241 9,39467
5XA1d 241 9,39467
5XPYr 241 9,39467
5Xw5I 238 9,57905
5Y7D1 241 9,39467
5Z2h7 165 7,04198
5Z4le 227 9,67691
5ZPnj 225 9,98630
61LiS 240 9,28495
63xcQ 186 9,76285
645YF 225 9,91126
64p5E 227 9,23492
67IQY 225 9,94401
687oY 241 9,39467
68EDe 160 8,44499
6Z7lX 225 10,03130
6bBjH 240 9,39467
6cuaT 238 9,63894
6ebnY 225 10,03130
6fCUC 190 9,39499
6ft8o 215 9,04216
6hRrH 161 8,44499
6kf71 241 9,47900
6mBUt 241 9,39467
6mfwU 227 9,13893
6mhK9 220 9,59775
6mpC8 178 9,86245
6rGtc 241 9,39467
6s36O 227 9,25007
6tCtg 225 10,12020
6uHZo 225 9,98630
6vXAr 225 9,98630
6vZQx 241 9,47874
712U1 240 9,37733
75er3 191 6,22855
7BiqR 227 9,31763
7DI1I 175 7,63913
7WOfa 171 6,63887
7WXLk 190 9,39428
7XkOu 236 9,46030
7Y4Qb 164 7,00691
7nFCL 191 6,22855
7q3t1 200 7,09523
81Li3 171 5,68074
81lwC 163 8,42236
86tY8 171 6,63751
87yE3 171 6,21054
8bsny 195 6,14199
8btfy 195 5,85785
8cDaU 225 10,07971
8ghes 190 9,39428
8lcmr 190 9,39428
8m4HK 251 7,14059
8mQYs 168 6,14494
8qVtP 240 9,28385
NAhV 240 9,53663
b2SE 220 9,58872
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4480
3h5QZ
10 43,7% 160 1.063E-72
2 phalp2_216
5St5t
5 80,6% 145 5.177E-69
3 phalp2_31629
4DqpK
8 31,6% 212 4.981E-27
4 phalp2_28634
8qsKr
11 33,3% 201 9.702E-23
5 phalp2_25593
3WFbK
1 33,7% 154 9.734E-21
6 phalp2_27718
6SAtZ
6 31,1% 196 3.839E-19
7 phalp2_33390
6Q97d
9 27,2% 191 7.074E-19
8 phalp2_9448
ugTX
6 29,8% 208 5.053E-17
9 phalp2_2348
4TuWR
59 26,4% 185 1.426E-15

Domains

Domains
Disordered region
PG_1
Representative sequence (used for alignment): 5QSR9 (241 AA)
Member sequence: 1NATe (168 AA)
1 241 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551|PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5QSR9) rather than this protein.
PDB ID
5QSR9
Method AlphaFoldv2
Resolution 91.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50