Protein

Protein accession
6CGer [EnVhog]
Representative
4Rzy7
Source
EnVhog (cluster: phalp2_11024)
Protein name
6CGer
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTALNREASISKTLTYEGGYTNDPRDPGGATNWGITIFDAR
Physico‐chemical
properties
protein length:41 AA
molecular weight:4461,8 Da
isoelectric point:4,86
hydropathy:-0,66
Representative Protein Details
Accession
4Rzy7
Protein name
4Rzy7
Sequence length
42 AA
Molecular weight
4650,14830 Da
Isoelectric point
4,91529
Sequence
MRSTFDAYMPELLRHEGGYVDHPRDPGGATNMGVTIGTLAEW
Other Proteins in cluster: phalp2_11024
Total (incl. this protein): 10 Avg length: 43,5 Avg pI: 6,16

Protein ID Length (AA) pI
4Rzy7 42 4,91529
2FdVf 53 9,18276
4Prpz 36 6,80968
50f4j 47 9,52129
5ukGU 33 5,95828
6Dp70 51 5,52767
6E3la 32 4,78052
6I2db 58 5,65437
7cbA7 42 4,42727
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16057
3VaLH
19 57,1% 35 2.153E-12
2 phalp2_30012
2gpBx
1 56,7% 37 9.909E-11
3 phalp2_36246
7zwyO
9 57,1% 35 4.893E-10
4 phalp2_4474
3fztb
11 54,8% 31 6.310E-09
5 phalp2_18668
8Bsq6
8 48,1% 27 1.196E-08
6 phalp2_5022
1k5kd
7 54,2% 35 1.647E-08
7 phalp2_11918
2A96W
10 46,8% 32 2.268E-08
8 phalp2_3692
4ZEzq
3 54,2% 35 4.302E-08
9 phalp2_12777
80F1a
12 42,8% 42 1.548E-07
10 phalp2_23645
8DwFC
19 60,7% 28 4.047E-07

Domains

Domains
Representative sequence (used for alignment): 4Rzy7 (42 AA)
Member sequence: 6CGer (41 AA)
1 42 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Rzy7) rather than this protein.
PDB ID
4Rzy7
Method AlphaFoldv2
Resolution 87.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50