Protein

Protein accession
3NXJD [EnVhog]
Representative
4Wll6
Source
EnVhog (cluster: phalp2_14562)
Protein name
3NXJD
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MDISRVDIHPDNYGGINRGRRYIILHANGASTESSINWFKDPDARVSYHYLVTQDGVIFQFVDEGKRAWHAGVSRWQDDSDLNDLSVGIAVESVDGTESDLTALQINTLRTLIIDIAERHGIPTAHVLAHKEVSPGRKVDPIHINMPEMRKSLDGIIDAPHRVGTLVLHGFGDLTRRGQSVVLRGDMVYRVRGGKLDVRLEGGGDGWQGD
Physico‐chemical
properties
protein length:210 AA
molecular weight:23266,8 Da
isoelectric point:5,90
hydropathy:-0,38
Representative Protein Details
Accession
4Wll6
Protein name
4Wll6
Sequence length
202 AA
Molecular weight
23189,32460 Da
Isoelectric point
5,27246
Sequence
MNIQRVDIPDDNYGGVNHGRLFIIVHANGATDESTMHWLKNSDSDVSYHYLINLEGGVMQFVDEGKRAWHAGRSTWDGHHDLNDVSVGVAIESEEGTHSQVTETQYETLLELIRDIQERHSIRTDYVRGHKEVSPNRKTDPIHIDMDKLRRDLDNQDDTDEIHTLVLHGFDNLTTRAEVVVRGKMQLRTRGDKVDIRLEQVS
Other Proteins in cluster: phalp2_14562
Total (incl. this protein): 3 Avg length: 198,7 Avg pI: 6,00

Protein ID Length (AA) pI
4Wll6 202 5,27246
2Un9w 184 6,81161
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21916
4KsAl
20 35,0% 140 2.835E-25
2 phalp2_11251
6GtZw
14 31,6% 155 3.425E-24
3 phalp2_20791
5IOGV
1 26,8% 227 8.713E-24
4 phalp2_24979
ze0C
8 29,7% 178 1.260E-21
5 phalp2_10505
89jAF
60 34,9% 126 9.681E-20
6 phalp2_31152
3LHkO
133 27,9% 168 2.451E-19
7 phalp2_28569
7YbnR
6 31,6% 136 3.966E-18
8 phalp2_22919
33qi6
180 27,5% 203 1.002E-17
9 phalp2_18584
7jUZU
22 31,2% 144 1.364E-17
10 phalp2_17296
4dV4i
8 27,5% 145 1.858E-17

Domains

Domains
Representative sequence (used for alignment): 4Wll6 (202 AA)
Member sequence: 3NXJD (210 AA)
1 202 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Wll6) rather than this protein.
PDB ID
4Wll6
Method AlphaFoldv2
Resolution 83.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50