Protein

Protein accession
14upf [EnVhog]
Representative
H4lq
Source
EnVhog (cluster: phalp2_556)
Protein name
14upf
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MRISPLELVIFGRVYRFVGGYRALSWPISIGTRIPAPVMGAITVFARTGGGHVGFAVGRNEHGALMILGGNQGDRVSIAAFDPARVLGYVWPPGAPYSSQPLARYMGNYPLSQNEA
Physico‐chemical
properties
protein length:116 AA
molecular weight:12433,2 Da
isoelectric point:10,25
hydropathy:0,16
Representative Protein Details
Accession
H4lq
Protein name
H4lq
Sequence length
117 AA
Molecular weight
12642,45290 Da
Isoelectric point
9,91480
Sequence
VVWGLCRPCVEAAGFKKPKNWFRARAWLEFGQRIVTPAVGCIVVFERSGGGHVGFLMGQDEHGRLMVLGGNQANAVTIAPFDRSRVLGYRWPGTNIAPIGFLPFVASNGAKASNNEA
Other Proteins in cluster: phalp2_556
Total (incl. this protein): 5 Avg length: 148,6 Avg pI: 9,65

Protein ID Length (AA) pI
H4lq 117 9,91480
56NhC 162 9,50053
6Pymi 162 9,33704
A0A8S5ML70 186 9,24162
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32166
6PqA1
486 50,4% 123 2.118E-30
2 phalp2_26508
40TJu
12 71,7% 78 9.405E-29
3 phalp2_35973
5kZXR
1764 48,2% 112 2.782E-26
4 phalp2_18626
5Zp9
1 44,4% 90 7.434E-16
5 phalp2_39164
35aH9
5 45,8% 96 3.601E-15
6 phalp2_11009
4NiJd
22 38,2% 89 1.051E-12
7 phalp2_768
1YX1Z
1 38,5% 96 1.441E-12
8 phalp2_23434
6ADve
14 42,8% 77 8.650E-11
9 phalp2_34973
4FMbb
48 36,5% 93 2.224E-10
10 phalp2_36357
qs4Q
1 35,8% 78 6.364E-08

Domains

Domains
Unannotated
Representative sequence (used for alignment): H4lq (117 AA)
Member sequence: 14upf (116 AA)
1 117 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
14upf
Method AlphaFoldv2
Resolution 70.36
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (H4lq) rather than this protein.
PDB ID
H4lq
Method AlphaFoldv2
Resolution 85.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50