Protein

Protein accession
10zvw [EnVhog]
Representative
14F4j
Source
EnVhog (cluster: phalp2_8450)
Protein name
10zvw
Lysin probability
85%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VARWSRNCGLPWQVVVALIYHESGFNPLARSATNDHGLCQLHGRPIYDIGENVRAGCAHLAGCVAGSGSMRSALATYNGGPGGRNAGRCLAYADAIIAVAYGD
Physico‐chemical
properties
protein length:103 AA
molecular weight:10729,0 Da
isoelectric point:8,39
hydropathy:0,03
Representative Protein Details
Accession
14F4j
Protein name
14F4j
Sequence length
138 AA
Molecular weight
15021,81460 Da
Isoelectric point
5,59883
Sequence
MILQEDGGVAASITPRIMEWGSHLTDKQVYDIAWNIARYSRMNGLRPEVVVALIKVESGYNINAASPTNDYGLTQVHGKCIYGIEQNIATGLDELGWRLKGKGGDYRAALAGYNGGTYPPPVSWGYADRVLGLADTVF
Other Proteins in cluster: phalp2_8450
Total (incl. this protein): 15 Avg length: 160,5 Avg pI: 7,63

Protein ID Length (AA) pI
14F4j 138 5,59883
14MMr 148 9,03339
14VbA 147 9,61141
24P8d 142 9,82977
2TO1P 224 6,05139
4hPMZ 161 9,64758
4iYrh 150 9,99216
4mumz 185 6,33518
4ugem 172 5,67596
6UPm6 172 5,60071
8vruQ 174 4,37066
Ikgr 148 10,08410
Zsf6 148 9,01411
aPry 195 5,22415
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37065
ZDwm
3 41,4% 135 1.675E-38
2 phalp2_29014
6Dcie
9 31,7% 107 1.650E-15
3 phalp2_24291
3j68U
7 28,8% 149 5.792E-15
4 phalp2_10704
36twR
16 25,8% 151 3.804E-14
5 phalp2_600
139Z6
763 29,4% 136 3.804E-14
6 phalp2_4421
2GvBa
2 32,7% 119 7.122E-14
7 phalp2_27585
5Ftcj
5 32,0% 100 9.744E-14
8 phalp2_24649
5BnRV
3 27,4% 102 9.744E-14
9 phalp2_27552
5rxEe
14 34,2% 108 2.495E-13
10 phalp2_19999
6ULRQ
3 31,7% 126 2.495E-13

Domains

Domains
Representative sequence (used for alignment): 14F4j (138 AA)
Member sequence: 10zvw (103 AA)
1 138 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
10zvw
Method AlphaFoldv2
Resolution 95.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (14F4j) rather than this protein.
PDB ID
14F4j
Method AlphaFoldv2
Resolution 98.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50