Protein

Protein accession
4T7DJ [EnVhog]
Representative
6Dn98
Source
EnVhog (cluster: phalp2_2584)
Protein name
4T7DJ
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRLRSVLLLLLLPLTALEQVNDERFLDCVVQKEGYTQDHIGPNGERSIYAITYAVWTQHMGKRPFALCTLRPDLARECALAHVRWLKQALAQAGMPQSPFTLGWAWHRGIRGYLGDIKRGLKSDYALEMANLYGVRAKNAN
Physico‐chemical
properties
protein length:141 AA
molecular weight:16009,4 Da
isoelectric point:9,47
hydropathy:-0,18
Representative Protein Details
Accession
6Dn98
Protein name
6Dn98
Sequence length
135 AA
Molecular weight
14949,12030 Da
Isoelectric point
9,73203
Sequence
MIRLLFILCVLCASGVRTSAADLDVGRMADALGMKETGLQWDGQPGPAGELSAYQITAGVWSQHMRPLHFSQARNPELARLCAVRHLRWLIQQIGARGLSITPQRVATAWHYGLSRARGRTQWGLEVANLYNDLP
Other Proteins in cluster: phalp2_2584
Total (incl. this protein): 10 Avg length: 157,3 Avg pI: 8,72

Protein ID Length (AA) pI
6Dn98 135 9,73203
3V4Bz 183 9,16484
3WUpI 161 9,13009
4I3k6 146 6,63711
4JysO 177 7,94768
5y7NN 135 9,17284
6UdoY 134 6,97138
6yx1Z 185 9,13899
8mXe2 176 9,81069
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24226
2SbQh
14 34,2% 140 8.188E-26
2 phalp2_16115
4fBEl
12 34,9% 106 6.287E-20
3 phalp2_18954
3NSCQ
6 34,8% 109 3.035E-19
4 phalp2_39860
14Mmd
7 30,0% 123 7.802E-19
5 phalp2_8263
3OU5
4 28,5% 140 7.116E-15
6 phalp2_26418
1Jbra
6 27,6% 112 9.741E-15
7 phalp2_20608
4HuY8
4 28,0% 121 2.498E-14
8 phalp2_21788
49rGR
10 28,3% 113 2.498E-14
9 phalp2_37693
46RgI
40 29,3% 116 4.680E-14
10 phalp2_14452
4GjIY
569 24,3% 144 3.075E-13

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 6Dn98 (135 AA)
Member sequence: 4T7DJ (141 AA)
1 135 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4T7DJ
Method AlphaFoldv2
Resolution 91.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6Dn98) rather than this protein.
PDB ID
6Dn98
Method AlphaFoldv2
Resolution 88.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50