Protein

Protein accession
4TGbA [EnVhog]
Representative
4H3SU
Source
EnVhog (cluster: phalp2_34976)
Protein name
4TGbA
Lysin probability
99%
PhaLP type
endolysin
Probability: 93% (predicted by ML model)
Protein sequence
MTKLYEVQVARLNVRSGPGTDHDVTEVLKRGDLLYPVIDWLPVWLEPGTEPAAVGWVSARYVETVEEAIEPEPQPIPSAGEPPWVTVAKSQIGVSEVPGAGANPQILNYFMATSFRPSRGDEDPWCSGLACWCMEQGGYRSPRDARAISWRSWGRETEPRLGAVVVFPHHVGFFMGRRDDGRIDLLGGNQSNRVSIAPYGIEEILAYRWPL
Physico‐chemical
properties
protein length:211 AA
molecular weight:23459,3 Da
isoelectric point:5,05
hydropathy:-0,30
Representative Protein Details
Accession
4H3SU
Protein name
4H3SU
Sequence length
215 AA
Molecular weight
23785,35030 Da
Isoelectric point
5,10911
Sequence
MPRYFRATTTFNIRSGPGANFEDLGDLPQGQAIEEIGDGWCPVLLDDDITLGWRSRKYLVEISQEDLDPEETPTATPEGGADPVWIKWAKSKLGQKEVPGAADNPEIASWYHLTTLPKSYWHDSTAWCSVFCCAAMELNNIKSPRSALAFAWRTWGKKAATPQKGDIVIFSFSHVAFYLSGHGTGIIKALGGNQADSVSIADFQESSVEQYRRQP
Other Proteins in cluster: phalp2_34976
Total (incl. this protein): 3 Avg length: 214,3 Avg pI: 6,73

Protein ID Length (AA) pI
4H3SU 215 5,10911
nLvy 217 10,03394
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8921
3fgJH
1 39,5% 177 1.800E-36
2 phalp2_11009
4NiJd
22 35,2% 139 1.752E-33
3 phalp2_32166
6PqA1
486 37,5% 149 5.445E-32
4 phalp2_31967
4Vkzp
33 36,1% 144 1.235E-30
5 phalp2_28323
16UXy
9 38,8% 134 2.305E-30
6 phalp2_33920
1ZuoD
42 38,0% 142 1.327E-28
7 phalp2_22951
3ea6w
3 28,6% 220 6.302E-28
8 phalp2_12776
7ZYHD
4 28,9% 138 3.174E-25
9 phalp2_35973
5kZXR
1764 33,5% 137 2.791E-24
10 phalp2_17468
4WhMf
1 27,8% 190 3.982E-22

Domains

Domains
SH3_3
Unannotated
Representative sequence (used for alignment): 4H3SU (215 AA)
Member sequence: 4TGbA (211 AA)
1 215 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08239

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4H3SU) rather than this protein.
PDB ID
4H3SU
Method AlphaFoldv2
Resolution 80.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50