Protein

Protein accession
4nWR9 [EnVhog]
Representative
7G46l
Source
EnVhog (cluster: phalp2_40601)
Protein name
4nWR9
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
SEKKRINWLIANVEKFGWSWEVVPSEPWHIRYVCGDAVPQAVKDYVARNPKPGSVFGSVAEQKAAAEQKAATPAANVVAAANKPNIVKDNKGKAVKEAQSLLDKHGFACKPDGDFGPKTQEIVKQFQKAKGIPVTGNVDQPTWAALLA
Physico‐chemical
properties
protein length:148 AA
molecular weight:16004,1 Da
isoelectric point:9,34
hydropathy:-0,43
Representative Protein Details
Accession
7G46l
Protein name
7G46l
Sequence length
90 AA
Molecular weight
9284,65340 Da
Isoelectric point
9,47732
Sequence
MRYVSGDNVPAAVAAFTGGVAPIVNLNTTPPIHDHKALQEALKAKGFYKGEINGAKDAATDAAVKAFKVANKLAADSIVGPKVKELLGLK
Other Proteins in cluster: phalp2_40601
Total (incl. this protein): 15 Avg length: 102,1 Avg pI: 8,70

Protein ID Length (AA) pI
7G46l 90 9,47732
1hlMT 102 5,75639
2Y0Fo 142 8,86081
2YhDp 114 8,17899
37qLu 90 9,52419
3pLs 90 9,47861
45TFj 90 9,52419
4YnPK 111 8,17886
59GhP 111 8,16146
5Fdhx 70 9,35154
5v5Sv 110 7,03078
TTaT 90 9,30480
c8nN 106 8,68088
hXnx 67 9,57557
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16366
5Ce2m
1 87,0% 54 4.255E-23
2 phalp2_27137
2X8cf
2 34,4% 87 9.766E-10
3 phalp2_15416
23BCo
2 38,3% 60 1.440E-06
4 phalp2_38906
22Pcz
5 31,6% 79 1.818E-05

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 7G46l (90 AA)
Member sequence: 4nWR9 (148 AA)
1 90 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7G46l) rather than this protein.
PDB ID
7G46l
Method AlphaFoldv2
Resolution 63.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50