Protein

Protein accession
2cDjg [EnVhog]
Representative
4RtkZ
Source
EnVhog (cluster: phalp2_31925)
Protein name
2cDjg
Lysin probability
99%
PhaLP type
endolysin
Probability: 93% (predicted by ML model)
Protein sequence
MYKKSGTLDDVLREYESIVRMLWGKNPSGTEMSQYEEPSQNTAYQSPLKGDSKLVSRHHPGVPTKTHAAGHFGLDLKQPRGSEVYAIGPGVVARTGTGAKKGGNWVTTHHEDGKVSAYYAHLDSINVSPGDKVDNNTVIGTLGDTGNARYFPHLHYQVKVDGAWVDPLKINGKEVGSLSKG
Physico‐chemical
properties
protein length:181 AA
molecular weight:19615,6 Da
isoelectric point:8,66
hydropathy:-0,67
Representative Protein Details
Accession
4RtkZ
Protein name
4RtkZ
Sequence length
167 AA
Molecular weight
17860,68210 Da
Isoelectric point
6,03132
Sequence
MNDTQDLVNAIADLLGVTLPSSGSDEGQAERYQDPTRGAGEHKYPGDYSPNMATDPRHPTGHRGIDLFAPRGTPVYPLGPGKIIKKMTGSKSGKMIIIQDNNDVRSSYMHLDSFGKFNVGDEVGMNDVIGYVGDTGNAKGTSPHLHFEVRSGGSLINPTSIFGKSIN
Other Proteins in cluster: phalp2_31925
Total (incl. this protein): 5 Avg length: 195,4 Avg pI: 6,83

Protein ID Length (AA) pI
4RtkZ 167 6,03132
30uVS 193 7,14264
38w3G 225 6,44926
4HkSP 211 5,86478
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19812
57uCy
24 40,3% 161 5.225E-37
2 phalp2_21998
4ZChj
22 31,7% 148 1.088E-19
3 phalp2_35648
1pm6e
9 33,3% 123 3.638E-16
4 phalp2_40139
8i9Qd
8 32,7% 122 4.967E-16
5 phalp2_5764
7GfkS
23 28,4% 109 4.389E-15
6 phalp2_27075
2pChT
5 33,0% 121 4.389E-15
7 phalp2_2246
4s1ni
19 28,5% 126 1.116E-14
8 phalp2_11252
6Gz7I
49 29,7% 141 1.339E-13
9 phalp2_12793
8gvZT
1 27,2% 143 1.339E-13
10 phalp2_24104
8fsA2
7 36,5% 115 6.311E-13

Domains

Domains
Representative sequence (used for alignment): 4RtkZ (167 AA)
Member sequence: 2cDjg (181 AA)
1 167 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4RtkZ) rather than this protein.
PDB ID
4RtkZ
Method AlphaFoldv2
Resolution 86.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50