Protein

Protein accession
4NOD1 [EnVhog]
Representative
4GkQr
Source
EnVhog (cluster: phalp2_4675)
Protein name
4NOD1
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MITLEQLRVSTGATEANAAKYLDAINNALGLYQIDTPRKIAGFLSQVGHESGGLAIVVENLNYRVEALLSMFGRHRISEEDARKYGRTPDRPANQEAIANCLYGGSWGAKSLGNTEIGDGWKYRGRGLKQLTGRLNYRMCGDALGLNLIDDPDALAEPTAAALSAGWFWSSRRPMGIEEAAENQDVAKMTKLINGGDIGLTQRTALFRRALEVL
Physico‐chemical
properties
protein length:214 AA
molecular weight:23314,1 Da
isoelectric point:6,75
hydropathy:-0,30
Representative Protein Details
Accession
4GkQr
Protein name
4GkQr
Sequence length
141 AA
Molecular weight
15440,26500 Da
Isoelectric point
6,27846
Sequence
MTPEIIKSAFPKASDAIIDAILEYAPRYGIDAKQMPMFLAQAGHESGEFTVFCESLNYSADALVKIFSRHRISEADAEKYGRTSGHAANQEMIANLIYGGAWGAKNLGNTQPGDGWMFRGRGIFQLTGRANYVAFVKDSPN
Other Proteins in cluster: phalp2_4675
Total (incl. this protein): 23 Avg length: 211,4 Avg pI: 8,39

Protein ID Length (AA) pI
4GkQr 141 6,27846
16aWN 140 9,64210
1KxgC 217 9,38481
1LBMi 221 9,28617
2Zueb 108 6,03098
3fZ6a 286 9,25465
4NP8L 216 5,79419
4emA6 222 8,92599
4fS2W 181 9,56100
4g7Jf 239 9,06504
4o01f 173 10,30780
56FWf 214 6,75267
6zjn8 214 6,75267
89KzH 238 9,12623
8iQjP 241 9,06408
A0A023NGE0 229 6,52821
A0A1P8VVH0 291 6,00023
A0A2R3UA80 215 9,94014
A0A6J5M118 221 9,68884
A0A9E7MZJ3 213 9,49279
A0AAF0I9X9 215 9,49279
A0AAV2PF34 213 9,80604
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26839
3X8d3
3 52,2% 111 2.876E-39
2 phalp2_5234
8cPFI
3482 43,2% 141 2.712E-35
3 phalp2_9408
8GIg1
520 39,4% 137 6.138E-27
4 phalp2_39815
Hj9s
55 37,8% 119 2.693E-25
5 phalp2_36471
1bFy4
16 40,2% 134 3.690E-25
6 phalp2_12287
7e8ZJ
95 36,8% 122 2.215E-23
7 phalp2_29588
83X5u
30 40,3% 109 2.007E-22
8 phalp2_14082
8g4v0
2 34,7% 92 4.225E-20
9 phalp2_24890
f0po
5 36,3% 143 5.241E-16
10 phalp2_31323
2aR1M
3 32,7% 116 4.225E-14

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4GkQr (141 AA)
Member sequence: 4NOD1 (214 AA)
1 141 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4GkQr) rather than this protein.
PDB ID
4GkQr
Method AlphaFoldv2
Resolution 95.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50