Protein

Protein accession
1NUQl [EnVhog]
Representative
4lGFh
Source
EnVhog (cluster: phalp2_37758)
Protein name
1NUQl
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTRAPASLKKAQAYLHDVTDLPWVSLGIVGDDDHDGGYHCGEDRVDADDYSVDESSRDKNGLSDYASALDVGNFNRLREFSKWLVAECVKGAEDTKDIREVIYSPDGKTVKRWDRLGKRSSGDKSHLKHTHISYFRDSSGRDKNGLFRRFFENVPSKPSTPKPSKPTSKPAPTPHYSFPLPSGYYFGPKGGPKESVSGYYGRSFKGVKDRDWLKRWGVQMGKRGWNLKANLPSGNDGFFGPEYARFVKKFQADQGLRQDGKLGPATWRAAFENPVR
Physico‐chemical
properties
protein length:276 AA
molecular weight:31018,2 Da
isoelectric point:9,46
hydropathy:-0,93
Representative Protein Details
Accession
4lGFh
Protein name
4lGFh
Sequence length
345 AA
Molecular weight
37676,12140 Da
Isoelectric point
9,56145
Sequence
MTYAPQTYKDARAFLLHELDTHPGVTVNDDLDPLEVGIVGDEAHIRAGTSYHLGLVHLKPNAYSLRLPRDRAGATNGAAALDIGWFSKTLPNGRTINLRTLSTWLVQQCQAGASDTLWIREIGYSPDGNVVLHWDRERGRTSAPVAGVFDLSHRWHTHISGYRDCETVDKTSLFRRALKELSEGLWTPTPPKPSAWVPTLHQGLPTLKRRDSGTFVRSAQAALYAHGFPPFPNDPRRSIDGDFGPKTEAAVRNFQAARGLKVDGIIGTGETWPALFSAAGTVARGNRGTAVSIAQALLCARGQWILIDGIAGRQTDEAIRSFQRARGLKVDGIAGPATWPRLIRG
Other Proteins in cluster: phalp2_37758
Total (incl. this protein): 33 Avg length: 294,2 Avg pI: 7,86

Protein ID Length (AA) pI
4lGFh 345 9,56145
11ytb 282 6,66599
1fQFJ 303 8,36066
1p6yX 256 9,34013
35F9C 288 5,49596
38Y4e 263 9,17374
3NIuW 253 7,92673
3OLCA 256 9,26045
4Ka5D 279 5,35295
4Qyjd 278 6,13522
4Xtts 274 7,81146
5qRlX 312 5,32766
5zECA 320 5,28594
6T7rj 364 4,92887
6TI62 349 8,94043
6Tgy2 254 9,07845
6Th1Q 260 7,89636
7cvCu 314 6,53185
7lpg7 275 9,68439
7osEU 292 6,32103
7p08r 317 10,17738
7qcpx 316 9,88676
7qcsG 316 9,90642
7tJ9D 290 8,92876
7tMLY 307 6,17620
7ujJw 299 9,14615
7wbKU 278 9,29778
7zhDs 285 6,14972
7ziJw 314 10,33198
7zjxk 313 7,21488
X3Ki 301 7,29679
gSIG 278 6,49138
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36190
725jQ
32 29,6% 250 3.872E-33
2 phalp2_21756
3Ra3w
9 26,9% 267 1.323E-14

Domains

Domains
Unannotated
PG_1
PG_1
Representative sequence (used for alignment): 4lGFh (345 AA)
Member sequence: 1NUQl (276 AA)
1 345 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1NUQl
Method AlphaFoldv2
Resolution 91.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4lGFh) rather than this protein.
PDB ID
4lGFh
Method AlphaFoldv2
Resolution 86.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50