Protein

Protein accession
6wW7I [EnVhog]
Representative
5Ie4j
Source
EnVhog (cluster: phalp2_27593)
Protein name
6wW7I
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MATKIVHSSDLFYVNRVGFIKKCTVFKWLLSIILILSTFTYYHFIDLDKSKFDSMIVTSENIVKEDIKQETLLDRVEDYIHSKNPNLDDDTKSALAKTIVDESISKKISLELILGLVYVESKFDQYAVSSSGALGFFQVKPGVHRDKIAMQDNRDLYDPVTNTKIGLKVLGDCMRIHSGIRKSLSCYNGSGNDESESYANKVIKSAPKHV
Physico‐chemical
properties
protein length:210 AA
molecular weight:23711,0 Da
isoelectric point:7,69
hydropathy:-0,17
Representative Protein Details
Accession
5Ie4j
Protein name
5Ie4j
Sequence length
199 AA
Molecular weight
21816,44340 Da
Isoelectric point
9,85788
Sequence
MTKILKRSQLVYLPKIGFVSRLKLIVSALLVVLAVASCVAPTTAHARPNTMTVGNSLQDKMVAYMKRTNKSISNKVAYDLANAMLIQHYKYGIPVEVMLGQSTIESRFDQFAIGDQGELGFFQVHTKWHADRLRGLMKVGVIKTKNIYDPLTNTSVGMSKLGECMRKHGGSVQKGLMCYNGTGDGAVAYSKKVIAASKE
Other Proteins in cluster: phalp2_27593
Total (incl. this protein): 4 Avg length: 205,8 Avg pI: 9,26

Protein ID Length (AA) pI
5Ie4j 199 9,85788
3RSkq 211 9,69426
6IoaM 203 9,80856
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39107
2mFzR
21 28,9% 207 3.102E-24
2 phalp2_1583
83Ew7
17 31,6% 136 5.784E-24
3 phalp2_7857
eRMX
1 27,6% 141 2.743E-16
4 phalp2_9885
2tkXx
10 26,5% 147 5.082E-16
5 phalp2_25221
2aelP
7 28,2% 138 9.410E-16
6 phalp2_31447
3o77A
181 31,7% 123 8.110E-15
7 phalp2_31682
3e47a
166 26,6% 135 5.042E-12
8 phalp2_31802
4igse
1 26,4% 155 3.138E-11
9 phalp2_21025
8KK9
3 27,2% 125 3.566E-10
10 phalp2_38948
3NAZh
14 22,0% 127 8.849E-10

Domains

Domains
Disordered region
SLT
Representative sequence (used for alignment): 5Ie4j (199 AA)
Member sequence: 6wW7I (210 AA)
1 199 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6wW7I
Method AlphaFoldv2
Resolution 87.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5Ie4j) rather than this protein.
PDB ID
5Ie4j
Method AlphaFoldv2
Resolution 82.23
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50