Protein

Protein accession
6IGh1 [EnVhog]
Representative
4KM2Z
Source
EnVhog (cluster: phalp2_15940)
Protein name
6IGh1
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MFKPKDGNNFVLPVKHRTQIDNNGGTGYRECFLTCATMLADYLLDGQLTDTAANMGSSEPEDVYAQELAKHGDTTDWTAEINTLRAFGIEAYASTTASLNDVAHALQCGVPVILGTAYKGSGHMVLAVGRSPLGFTILCPNGIRDGASNDWIVRFYSESEAKPDQFSWDMLKRVFVDMGSESGWSVFVTAVNGEKTGVKSGL
Physico‐chemical
properties
protein length:202 AA
molecular weight:21829,3 Da
isoelectric point:4,91
hydropathy:-0,17
Representative Protein Details
Accession
4KM2Z
Protein name
4KM2Z
Sequence length
190 AA
Molecular weight
20942,99820 Da
Isoelectric point
6,50212
Sequence
MPVIPVSYYYQTDNLSGQGYRECSSTSNAALANHLLGNQFDRDAAIKGISQPEQIYIERLRKYGDTTDHNANTRCLQSFGIESAFFTNLTNADYFKSIANNIPMVLGLIYKGGGHIVLGVGHSENRKSIIINDPFGSRDGKSDNWLSTSPESGKGDVYSLNTFNMLWEGHLGQGYGRRVYSVNGISTNLS
Other Proteins in cluster: phalp2_15940
Total (incl. this protein): 6 Avg length: 193,7 Avg pI: 5,62

Protein ID Length (AA) pI
4KM2Z 190 6,50212
4XOsi 206 7,08262
5Hdir 195 4,69129
6VxMi 205 5,43457
7WJzd 164 5,09660
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6070
4WjHW
2 33,1% 172 1.298E-42
2 phalp2_6277
6MCzG
1588 28,9% 183 3.689E-31
3 phalp2_29834
276so
38 27,3% 183 5.885E-19
4 phalp2_16156
4uGeP
55 25,9% 166 3.788E-18
5 phalp2_15789
45LhM
112 24,4% 180 1.848E-15
6 phalp2_10798
3VYNI
16 24,2% 173 1.017E-13
7 phalp2_5654
4rudF
12 30,0% 183 2.558E-13
8 phalp2_23206
7Ccqz
29 26,7% 146 1.188E-12
9 phalp2_16130
4iX7k
10 24,0% 166 3.327E-09
10 phalp2_27700
6MLNR
2 22,2% 175 8.262E-09

Domains

Domains
Representative sequence (used for alignment): 4KM2Z (190 AA)
Member sequence: 6IGh1 (202 AA)
1 190 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13529

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6IGh1
Method AlphaFoldv2
Resolution 94.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4KM2Z) rather than this protein.
PDB ID
4KM2Z
Method AlphaFoldv2
Resolution 95.18
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50