Protein

Protein accession
1SgIF [EnVhog]
Representative
1EXQd
Source
EnVhog (cluster: phalp2_5084)
Protein name
1SgIF
Lysin probability
99%
PhaLP type
VAL
Probability: 92% (predicted by ML model)
Protein sequence
MKQYYGDDVRWFVGVVTNNLDPLQLGRVQVRIFGIHSRKVSEIPNHALPWATVLQPSTSGGTSGIGMMPQILPGAQVFGVFLDGKASQLPCIFGVMPKIEIPSEQQLSNQQDKAIQYDIGYGEGQVDPALARAAGLNTYSSPASLGPVVPVGTTRVEQAFNFFTARGFTEKQSAGIVGNLIAEVGPNLPEHGPRGDGGRAAGIAQWHPGRRNIFEQVYGKPWQDSTFTDQLQFIVWELSNSDSYSGNLNKRAGDLLKGTDSVAMAATIFDEKYERSSGIARQKRINYAHSVYDQFARV
Physico‐chemical
properties
protein length:298 AA
molecular weight:32519,2 Da
isoelectric point:6,97
hydropathy:-0,30
Representative Protein Details
Accession
1EXQd
Protein name
1EXQd
Sequence length
219 AA
Molecular weight
23736,67140 Da
Isoelectric point
5,39649
Sequence
MTLTIPTEFYGDNVRWFLGVVEDINDPLNLGRVRVRVFGIHDENLQNISEADLPWASVVVPIIDPGVSGMVQPFGVQPGAQVFGMFMDGKTSQVPLVLGSIPHRDDLTVENTSVDTTFVPLANPDHQITKPNRRANITKKPIPLVEGGNEEKAFNFLKSYLETKGAAYAAEGAAGLVGNFLHEAGAGLKPDTQERKPIAGRGGLGLAQWTGPRRVQLEN
Other Proteins in cluster: phalp2_5084
Total (incl. this protein): 22 Avg length: 274,7 Avg pI: 6,47

Protein ID Length (AA) pI
1EXQd 219 5,39649
141Y9 222 6,30449
1Oq7W 177 6,27584
1RNxk 215 6,83525
1RsCR 295 5,99887
1T6WY 293 5,30452
1TduR 297 5,23177
1UDzO 181 6,17029
1UXx8 293 5,46072
1V3Kk 311 8,57618
1VAMb 294 6,33047
1VyA5 292 6,32865
1gi2S 297 5,43053
1wv2j 295 8,57444
22CLY 293 7,85310
22pfY 296 6,65786
2QRBR 297 6,44693
6Brft 294 7,02566
6JqQ7 292 6,31654
6Ovlu 295 6,52827
QhJG 297 6,38651
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14925
8EnYA
487 43,3% 217 1.881E-65
2 phalp2_30820
8E2Jx
328 34,0% 226 1.432E-44
3 phalp2_14881
bzzZ
20 33,3% 219 6.733E-30
4 phalp2_27931
xkUJ
41 32,0% 215 2.641E-25
5 phalp2_18785
18JLC
4 30,7% 208 2.105E-21
6 phalp2_27118
2NbEK
1 25,7% 214 9.424E-13
7 phalp2_23711
Fj44
3 27,3% 205 2.338E-12

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1EXQd) rather than this protein.
PDB ID
1EXQd
Method AlphaFoldv2
Resolution 80.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50