Protein

Protein accession
5ksoF [EnVhog]
Representative
6G0iF
Source
EnVhog (cluster: phalp2_11249)
Protein name
5ksoF
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSRFRSDEITLLLPSFQPYVQAVLDDMTAQGFKPILFDGLRTPAEALRNAKRGTGKVQSPHLYGLAADVICDDHGWSCREKKCKFYAKLVAAVRSRGLITGADFHNAQGKPMVDEPHFQGLPPAQERKVRELGMSDESIPERDAIVAAWLKQHAKL
Physico‐chemical
properties
protein length:156 AA
molecular weight:17387,8 Da
isoelectric point:8,81
hydropathy:-0,38
Representative Protein Details
Accession
6G0iF
Protein name
6G0iF
Sequence length
141 AA
Molecular weight
15577,70940 Da
Isoelectric point
8,80014
Sequence
MHPQSDTALLVPAFWERLEPALRELRGQGFHPVVHETLRSLARSEALVAAGKSKSVGPSMHCYGLAADVVCGVHGYDCKRHHCPFFTEYGLAVERHRLTWGGRWKTLVDSPHSQAIPVALQNKARAMPADQLDAFVRSVLG
Other Proteins in cluster: phalp2_11249
Total (incl. this protein): 24 Avg length: 152,9 Avg pI: 8,46

Protein ID Length (AA) pI
6G0iF 141 8,80014
16Lo2 152 9,35238
1FTCH 168 10,18119
24Sr7 145 8,32346
34znX 189 9,50453
40UE1 146 10,31238
4A1Dn 147 9,21551
4IJT4 122 7,16486
4N52F 164 7,20056
4gO9B 151 8,51326
5DmDH 164 6,05144
5GLza 120 8,01963
5H2Ti 176 8,66837
5HcPg 148 6,76346
6FEq3 153 9,12996
6FSLp 151 9,35032
6Fc42 185 8,83173
6G0XN 154 9,46655
6G0yC 158 8,66528
6G1DU 151 6,36667
6M3z2 132 8,22393
7Vvl6 145 9,61999
84rhP 152 6,40527
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40042
2aaM9
1 35,3% 133 2.386E-37
2 phalp2_19594
49p7r
66 28,9% 145 1.537E-29
3 phalp2_1337
15l6q
1516 31,9% 122 5.161E-22
4 phalp2_35851
4JEbx
5 29,6% 118 1.817E-21
5 phalp2_27726
6US3d
23 33,3% 126 2.252E-20
6 phalp2_9691
244zs
82 34,4% 125 2.788E-19
7 phalp2_26569
8vSQ6
2 30,6% 124 2.788E-19
8 phalp2_35414
f7cN
32 32,5% 120 2.517E-18
9 phalp2_21870
4B18v
3 27,0% 100 7.970E-17
10 phalp2_19514
3nrhB
1 35,1% 108 3.829E-16

Domains

Domains
Representative sequence (used for alignment): 6G0iF (141 AA)
Member sequence: 5ksoF (156 AA)
1 141 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6G0iF) rather than this protein.
PDB ID
6G0iF
Method AlphaFoldv2
Resolution 94.85
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50