Protein

Protein accession
7Kwug [EnVhog]
Representative
38KGa
Source
EnVhog (cluster: phalp2_32863)
Protein name
7Kwug
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTETQLRQKYIQGLLDIEGKCAEGGANHSALLAVYNTLDPLPRGVKMLPSYDWCAATITANAIRQDMADIFAKECSCTLQIRQWQRMRRWVERDDYVPQTGDIIYYAWDANGSGDWAKSVDHVGAVVRCEGGYITAIEGNYKNNVSRRRIPINYKFIRGFAVPDYASLATEGNDMTRYRKIEDIPKGYQAETQELIDLGFNGYSDERGLYVTEDMLRTMIVNLRMCKALIAAIPDIDKESLFEEFKKNLKLNIAVEVE
Physico‐chemical
properties
protein length:258 AA
molecular weight:29391,2 Da
isoelectric point:5,31
hydropathy:-0,38
Representative Protein Details
Accession
38KGa
Protein name
38KGa
Sequence length
268 AA
Molecular weight
30289,01030 Da
Isoelectric point
9,62257
Sequence
MTEMELRKKYVSTIKSWHGRKESNGTHKPIIDIYNNDKPLPRSYKVKYTDSWCATTVSAAAIKAEKETGIKFTSIIPKECSCNYQIHLFQKLGRWEERDSYVPQIGDIIYYDWNDNGVGDDKGSSEHVGVVIEVTSTHIKVEEGNKSDAVGTRTIAINGRYIRGFGKPNYKSLATSSKKKDDSKDKKKSDPKKKEITKKTPKKGDKIVLKNASLYKASTSTKASDHITGTYYFWDAKNYNGKTRITTSPKYVGNTKQITGWISAVHLK
Other Proteins in cluster: phalp2_32863
Total (incl. this protein): 2 Avg length: 263,0 Avg pI: 7,47

Protein ID Length (AA) pI
38KGa 268 9,62257
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15661
2WFec
420 62,7% 172 2.915E-65
2 phalp2_25577
3Q2DF
166 60,7% 181 4.905E-64
3 phalp2_30942
13AWP
189 63,9% 161 4.797E-58
4 phalp2_35905
7DH9G
6 50,7% 189 5.011E-50
5 phalp2_27238
3TMvJ
21 52,6% 192 6.850E-50
6 phalp2_29481
1Jfam
8 40,8% 191 1.257E-27
7 phalp2_19272
7YZUV
4 35,4% 175 1.477E-17
8 phalp2_14480
4LcTU
3 29,6% 162 7.985E-11
9 phalp2_4086
1oyct
1 27,2% 165 8.834E-08

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 38KGa (268 AA)
Member sequence: 7Kwug (258 AA)
1 268 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7Kwug
Method AlphaFoldv2
Resolution 90.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (38KGa) rather than this protein.
PDB ID
38KGa
Method AlphaFoldv2
Resolution 90.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50