Protein

Protein accession
30uVS [EnVhog]
Representative
4RtkZ
Source
EnVhog (cluster: phalp2_31925)
Protein name
30uVS
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MAIENVKNELEELARSLGITLPGQRAESNTQTYRMPTEGQIISQHHPISQGGFATPTHPQGHFGLDILGQVGTPIYAIGPGVVSQIYNESNNPKGGNAVKIYHEDGAVVSYYAHLDKVDVSAGDEVNQNTQIGTMGTSGMIYNGKKRHTAPHLHYQVKINGTDVNPSMIASKPIGSFSRVARLAEEFRKKLGF
Physico‐chemical
properties
protein length:193 AA
molecular weight:20924,3 Da
isoelectric point:7,14
hydropathy:-0,46
Representative Protein Details
Accession
4RtkZ
Protein name
4RtkZ
Sequence length
167 AA
Molecular weight
17860,68210 Da
Isoelectric point
6,03132
Sequence
MNDTQDLVNAIADLLGVTLPSSGSDEGQAERYQDPTRGAGEHKYPGDYSPNMATDPRHPTGHRGIDLFAPRGTPVYPLGPGKIIKKMTGSKSGKMIIIQDNNDVRSSYMHLDSFGKFNVGDEVGMNDVIGYVGDTGNAKGTSPHLHFEVRSGGSLINPTSIFGKSIN
Other Proteins in cluster: phalp2_31925
Total (incl. this protein): 5 Avg length: 195,4 Avg pI: 6,83

Protein ID Length (AA) pI
4RtkZ 167 6,03132
2cDjg 181 8,65548
38w3G 225 6,44926
4HkSP 211 5,86478
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19812
57uCy
24 40,3% 161 5.225E-37
2 phalp2_21998
4ZChj
22 31,7% 148 1.088E-19
3 phalp2_35648
1pm6e
9 33,3% 123 3.638E-16
4 phalp2_40139
8i9Qd
8 32,7% 122 4.967E-16
5 phalp2_5764
7GfkS
23 28,4% 109 4.389E-15
6 phalp2_27075
2pChT
5 33,0% 121 4.389E-15
7 phalp2_2246
4s1ni
19 28,5% 126 1.116E-14
8 phalp2_11252
6Gz7I
49 29,7% 141 1.339E-13
9 phalp2_12793
8gvZT
1 27,2% 143 1.339E-13
10 phalp2_24104
8fsA2
7 36,5% 115 6.311E-13

Domains

Domains
Representative sequence (used for alignment): 4RtkZ (167 AA)
Member sequence: 30uVS (193 AA)
1 167 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
30uVS
Method AlphaFoldv2
Resolution 88.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4RtkZ) rather than this protein.
PDB ID
4RtkZ
Method AlphaFoldv2
Resolution 86.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50