Protein

Protein accession
ZJAZ [EnVhog]
Representative
4GTXt
Source
EnVhog (cluster: phalp2_5698)
Protein name
ZJAZ
Lysin probability
99%
PhaLP type
endolysin
Probability: 76% (predicted by ML model)
Protein sequence
MPRITPEFWENIRNKMRELDISGNAKVLGDYLGQNIKRVAEDVIPKPTKLSDDAPMNTFGNTPVAEPERKITWDYQDKESFPIGYRDYLTESADQYGIDPNILASLIASESGGAGYQPDLYGTSGEVGIAQIIPDLYYQGAGFQSPEEYALALQDPAFAIEQAASILKDLLTEYDNNYFDALASYNGGPTGYQTTGAGYDYARDTLSRIGMVDHYTPTYLNQ
Physico‐chemical
properties
protein length:222 AA
molecular weight:24614,9 Da
isoelectric point:4,25
hydropathy:-0,53
Representative Protein Details
Accession
4GTXt
Protein name
4GTXt
Sequence length
196 AA
Molecular weight
N/A Da
Isoelectric point
4,68418
Sequence
MNLSEIWDNLMGYFQNTDVQRQGNGVVASSQKPMRVMKGWVDDYTPENSTPKVLGASNSYNPITFRDXXXXYPEDLQVPTQKAADQYKLDPLLLASLVAQETGGFGYKPIRGSSGERGITQIIPDIWAANAGYADKAGEYGAMLETNPEFALLEAARILSMLEGQYPGYGLAAYNAGPNYNPGIPYQQEILSRIGR
Other Proteins in cluster: phalp2_5698
Total (incl. this protein): 6 Avg length: 215,3 Avg pI: 4,49

Protein ID Length (AA) pI
4GTXt 196 4,68418
3ekvS 223 4,29006
4HkGi 222 4,39049
4UWex 227 4,61529
4lFq9 202 4,69362
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26115
5FRQ
3 49,2% 142 1.207E-36
2 phalp2_23145
4GXmt
1 31,7% 126 2.807E-24
3 phalp2_20903
6Xcbb
9 26,6% 139 8.480E-07
4 phalp2_2925
RuYi
185 24,6% 138 9.214E-06
5 phalp2_34906
4kB5k
11 27,5% 138 2.246E-05
6 phalp2_16618
8yg21
13 26,7% 127 3.021E-05
7 phalp2_18775
16gAa
3 28,1% 128 5.462E-05
8 phalp2_29014
6Dcie
9 27,6% 134 9.866E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4GTXt (196 AA)
Member sequence: ZJAZ (222 AA)
1 196 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
ZJAZ
Method AlphaFoldv2
Resolution 75.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4GTXt) rather than this protein.
PDB ID
4GTXt
Method AlphaFoldv2
Resolution 64.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50