Protein

Protein accession
6TrEY [EnVhog]
Representative
5nEcp
Source
EnVhog (cluster: phalp2_28888)
Protein name
6TrEY
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MEVVAVAGKHAAPRKKWGRKAAVATAVGAAVVVPVALAPPVSAETSQSIPKMTKQRIAVQYAISKIGLGAYLWGGNGPTRFDCSGLTSQAWKAAGVSIPRTSQAQLAGLPRVSRANIQPGDLVVWSFRSHADHVSIYTGPIGPGGADLVDTASSHPGGGVGWSSMNRRGGTIAGIVRPAPVSSTPSPKSPAAGTYSVKPGDTLSGIARTYRVKGGWKALHRMNKSKVHNPNLIYPGQKLRIR
Physico‐chemical
properties
protein length:242 AA
molecular weight:25165,7 Da
isoelectric point:11,07
hydropathy:-0,16
Representative Protein Details
Accession
5nEcp
Protein name
5nEcp
Sequence length
229 AA
Molecular weight
23388,32100 Da
Isoelectric point
10,00532
Sequence
VCGPRAGLTRDSPAPALAAVRAAPRIATAKVQSSSAASVAVAFVRARVGGSYRYGGNGPAYDCSGLTQAAWAAAGVDIPRTSQEQLASLPRVSLSALQPGDIVGYFGGSHVTVYVGGGMVVGAENPVTGIALIPLSWGHQVPYTAVRPSGGVVTRTVNVPATRAQGAPQAALPHKALAKGTHTVRSGEWLSSIAREYHVAGGWQKLYQLNKGVVGGNADLIYPGQVLTL
Other Proteins in cluster: phalp2_28888
Total (incl. this protein): 3 Avg length: 231,3 Avg pI: 10,41

Protein ID Length (AA) pI
5nEcp 229 10,00532
5Ete8 223 10,15682
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37839
4Jmb8
51 50,2% 237 8.142E-66
2 phalp2_40559
4Obnr
7 36,7% 147 2.869E-28
3 phalp2_13476
4LGW5
2 23,2% 198 3.179E-11
4 phalp2_39637
6Kn72
20 29,9% 147 6.335E-10
5 phalp2_25107
1opzQ
11 32,1% 193 8.536E-10
6 phalp2_22955
3fPKW
30 26,9% 204 8.536E-10

Domains

Domains
Disordered region
NLPC_P60
LysM
Representative sequence (used for alignment): 5nEcp (229 AA)
Member sequence: 6TrEY (242 AA)
1 229 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877, PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6TrEY
Method AlphaFoldv2
Resolution 78.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5nEcp) rather than this protein.
PDB ID
5nEcp
Method AlphaFoldv2
Resolution 74.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50