Protein

Protein accession
5KwV [EnVhog]
Representative
35O74
Source
EnVhog (cluster: phalp2_25472)
Protein name
5KwV
Lysin probability
94%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
LIKPILILFAVACYLAPLQVEALTDDFYDYRQMQKEEVIEPERTFTEQDVNLLARVLWAEARGECREGQLMVAQTILDRMNYGEWGNTLESVVFASGQFVVGRYVNDYLLEVARAALNGERYNENAIIFHFRMTASMCDWWSPRLGRIGAHTYYGWER
Physico‐chemical
properties
protein length:158 AA
molecular weight:18398,8 Da
isoelectric point:4,86
hydropathy:-0,14
Representative Protein Details
Accession
35O74
Protein name
35O74
Sequence length
143 AA
Molecular weight
16249,32700 Da
Isoelectric point
6,43062
Sequence
MKNAITVLCFILILAGLSISPTAESEAELLARLLYAECRGESDEGRLAVAQCVLDRVSTDHRDFRRQDTIRKVITAKNQFAKPGELTDELLDVAKRAIAGERAFPDHEVLFFRATKSTDDWWDKYIGHIGRHAFFGRERIKND
Other Proteins in cluster: phalp2_25472
Total (incl. this protein): 5 Avg length: 144,2 Avg pI: 7,26

Protein ID Length (AA) pI
35O74 143 6,43062
1rHMR 135 8,71885
35Ocs 145 6,82911
5OiDg 140 9,43857
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34576
4ZO4j
9 38,3% 120 1.171E-19
2 phalp2_30147
3lcU7
46 36,5% 126 7.704E-16
3 phalp2_17998
1gOqM
126 32,2% 124 1.442E-15
4 phalp2_26540
88MeS
20 27,6% 141 1.442E-15
5 phalp2_39855
13daP
6 30,9% 126 1.973E-15
6 phalp2_3540
4yvcs
35 31,9% 122 1.293E-14
7 phalp2_14872
6oPp
1 35,0% 117 8.466E-14
8 phalp2_18913
23yi7
11 27,8% 104 7.563E-13
9 phalp2_15984
7KpQ6
448 29,5% 159 7.563E-13
10 phalp2_19006
135l5
10 23,2% 125 1.034E-12

Domains

Domains
Disordered region
Hydro_2
Representative sequence (used for alignment): 35O74 (143 AA)
Member sequence: 5KwV (158 AA)
1 143 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (35O74) rather than this protein.
PDB ID
35O74
Method AlphaFoldv2
Resolution 85.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50