Protein

Protein accession
4060G [EnVhog]
Representative
1NzsX
Source
EnVhog (cluster: phalp2_36572)
Protein name
4060G
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDWREIGIIPLTDAAGITKQFNAESHRGLDIGWYKNKYCPVLAWQDGKLIAKGYSGEVGYWCVLEHEYADGKRWVGYIHLYQAISAAIGTTYKMGQQVSNAKRGNTGYSNGVHLHIYMTRIIPKDTVFVWTGSDDPWTKYAIDPLSHLYYDKKYNTDYIAPSWARPWPEEEDWEKLYKEEKGKNEILNAKIEQIREIVK
Physico‐chemical
properties
protein length:199 AA
molecular weight:23101,9 Da
isoelectric point:6,60
hydropathy:-0,61
Representative Protein Details
Accession
1NzsX
Protein name
1NzsX
Sequence length
172 AA
Molecular weight
19513,34750 Da
Isoelectric point
9,75569
Sequence
MIYPLLPTYDGYGAQGYSSSHRALDIGWLTKYSSNGKTKLYACLDGVVVQAGTITEKVNGKVVKPIVCVLRCEYNGYRYYIRYWHLDKVKVTKGSKVVRGQVIGIRGNTGYSFGTHLHIEVLKTKKGTAYSKCCGANWNKYNINPIRFFYRDSKYNHWSDGGVFKIKKMPTA
Other Proteins in cluster: phalp2_36572
Total (incl. this protein): 4 Avg length: 192,3 Avg pI: 8,10

Protein ID Length (AA) pI
1NzsX 172 9,75569
1NyTS 188 9,89520
8fAtW 210 6,17183
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31273
8pWKI
3 33,5% 155 6.481E-25
2 phalp2_13280
3inuw
7 34,8% 149 3.124E-16
3 phalp2_40531
4Ht1U
57 34,1% 158 2.418E-14
4 phalp2_22730
8dH10
29 29,4% 139 2.116E-13
5 phalp2_34915
4pbkX
159 31,8% 132 3.930E-13
6 phalp2_14848
7w0BH
74 29,1% 127 1.845E-12
7 phalp2_25537
3nGXJ
39 30,1% 176 4.660E-12
8 phalp2_27905
l4Bb
39 29,9% 157 1.176E-11
9 phalp2_40658
5cY9d
663 34,5% 113 2.967E-11
10 phalp2_39288
4dsh8
2 28,3% 155 1.881E-10

Domains

Domains
PET_M23
Disordered region
Representative sequence (used for alignment): 1NzsX (172 AA)
Member sequence: 4060G (199 AA)
1 172 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1NzsX) rather than this protein.
PDB ID
1NzsX
Method AlphaFoldv2
Resolution 79.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50