Protein

Protein accession
7seUM [EnVhog]
Representative
3X7Ao
Source
EnVhog (cluster: phalp2_24357)
Protein name
7seUM
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAKLPIPFNNPATYPGHSGVDYGQPRGKPFAASGPGVVRTRSQNDAGGFYIWVQYDAGPLVGYHHMDSHNGCPAVGARVQEGTRLGYVGSLGRRSTGPHLHSECACHRTTAGYWTHFDSTRVIGTGSTAGGSTPLPKPPTPAPAPIPEEETDMAKLIRNTDGTIFQLDEFGVVDPRSLLDPGAGLTETVNAMDSAYGAEQLSDREFDLVGMAARNRWDQVRAQIVADVVAALKEVLPPKA
Physico‐chemical
properties
protein length:240 AA
molecular weight:25563,3 Da
isoelectric point:6,04
hydropathy:-0,39
Representative Protein Details
Accession
3X7Ao
Protein name
3X7Ao
Sequence length
214 AA
Molecular weight
23517,04380 Da
Isoelectric point
5,54296
Sequence
MTKLAMPFDDPTTYKGHSGVDFGEATNTPILASGPGKINWSGYVNERAGYGVIVEYDQYKGIEFLYCHQPKNGPRPKKDSRFVLGGFLGGVGSTGSRSTGPHLHLEVLNGKGAHTYDGVWLYFDKSRVVGDGSTAGGNDKPIEPTNEEEIMKPLLFQLDPAIDGRWVLVDYQNGAYWPIRNGWQLDRVREDKNVREVYGPQPIESVDGLAAVGL
Other Proteins in cluster: phalp2_24357
Total (incl. this protein): 9 Avg length: 213,9 Avg pI: 6,25

Protein ID Length (AA) pI
3X7Ao 214 5,54296
1f3mX 208 8,59771
4HAi3 200 7,89636
5hSBw 216 6,97007
5iol1 216 5,00617
6S5rc 184 5,58724
6S5vI 224 5,30503
6S61t 223 5,32879
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13775
6UeIZ
1 29,6% 192 2.475E-21
2 phalp2_11524
PVZf
4 34,9% 186 2.936E-20
3 phalp2_40313
3epNU
22 23,6% 190 1.026E-15
4 phalp2_14782
6W4yG
33 26,0% 150 5.215E-12
5 phalp2_33817
1joRs
29 29,0% 148 2.369E-11
6 phalp2_29729
1mI3z
7 20,1% 149 3.205E-11
7 phalp2_29414
1iBPp
693 30,0% 133 3.458E-07

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 3X7Ao (214 AA)
Member sequence: 7seUM (240 AA)
1 214 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7seUM
Method AlphaFoldv2
Resolution 68.44
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3X7Ao) rather than this protein.
PDB ID
3X7Ao
Method AlphaFoldv2
Resolution 61.48
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50