Protein

Protein accession
7YxkC [EnVhog]
Representative
a1g4
Source
EnVhog (cluster: phalp2_24879)
Protein name
7YxkC
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALFGIDVSAWQGKIDWEKVAKAGVKFAMLKAGGADDGYYEDSRFKYNYEQCKKLGIPVGAYYFTSKNCITEAQGRAEAKKFLAIIKGCQFEYPVAIDVEAQRASKSVITTAVIAFCDEMEKAGYYVVVYGSTVAGFHDRMDTSRLTRFDKWVADYRGKCYYTLPYGIWQFSSKQKINGISGNVDGDYSYKDYPTIIKNAKLNGFGKETPSEKPKEEEAKTEEVYYKIQKGDTLSAIAKKFGTTAKKIQKLNPDKIKDINLIYAGDTIRVK
Physico‐chemical
properties
protein length:271 AA
molecular weight:30491,5 Da
isoelectric point:9,04
hydropathy:-0,46
Representative Protein Details
Accession
a1g4
Protein name
a1g4
Sequence length
151 AA
Molecular weight
16323,21740 Da
Isoelectric point
5,20159
Sequence
MKNGIDVSVYQGDIDWKAIKNSGIEFAIIKAGGSDAGFYKDSKFEKNYTNAKAVGMPVGAYYFVGSGCTSKADGIADAKRFLEIIKGKTFEYPVYIDLEATSPSAKAGATEACIGFCETMENAGYYCGIYASDVSGFNDRLDLSRLSKFDK
Other Proteins in cluster: phalp2_24879
Total (incl. this protein): 16 Avg length: 225,5 Avg pI: 7,23

Protein ID Length (AA) pI
a1g4 151 5,20159
1cd0e 258 5,18226
1gJIU 167 8,81968
1qvVR 255 4,95866
24Fvo 171 6,50473
38HDa 171 6,06997
3dQzW 206 8,80498
3dVOD 218 8,81226
3xRtR 336 9,54649
3zZcJ 267 6,61728
71nSR 255 5,20579
87Rry 256 7,68204
8bslj 211 9,16923
8fvBW 208 9,18579
8hdi9 207 4,94939
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3425
3TK4D
923 54,2% 153 8.827E-53
2 phalp2_14273
3kLGA
7 48,3% 149 2.436E-46
3 phalp2_4390
2mvmf
118 48,1% 133 1.617E-45
4 phalp2_10753
3yWC4
1 33,5% 140 3.334E-36
5 phalp2_29946
8mgNr
1 45,9% 137 4.151E-35
6 phalp2_5668
4yhqW
7 36,1% 155 1.655E-32
7 phalp2_11218
6q99m
1 32,8% 152 2.058E-31
8 phalp2_3380
3qQTc
8 47,0% 100 2.102E-28
9 phalp2_25924
66tGM
6 40,3% 104 2.880E-28
10 phalp2_34287
3mtX2
6 36,4% 107 2.953E-22

Domains

Domains
Representative sequence (used for alignment): a1g4 (151 AA)
Member sequence: 7YxkC (271 AA)
1 151 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (a1g4) rather than this protein.
PDB ID
a1g4
Method AlphaFoldv2
Resolution 97.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50