Protein

Protein accession
H9NC94 [UniProt]
Representative
7QhUZ
Source
UniProt (cluster: phalp2_35050)
Protein name
Gp217
Lysin probability
98%
PhaLP type
endolysin
Probability: 87% (predicted by ML model)
Protein sequence
MFSSKNEKSDQKHPILKLIESIKKFFDFDTVLSIIIMVFILWLLLASKGSAKAEMLSTDSDIVDVNLPEKFNEKSTSSKIKNKEDLYVYMKKAGIKFPELVWSQAMMESAWMNSAVFNRSNNLFGMKKSGKRDHLQIHKRNDSYSHFETIKKCVLDYKLHQEQVLKIENIKNESQYMRRLIKTGYCPDEGYARKILAIRKNTDFNELKKSS
Physico‐chemical
properties
protein length:211 AA
molecular weight:24648,4 Da
isoelectric point:9,47
hydropathy:-0,52
Representative Protein Details
Accession
7QhUZ
Protein name
7QhUZ
Sequence length
211 AA
Molecular weight
24648,38450 Da
Isoelectric point
9,47371
Sequence
MFSSKNEKSDQKHPILKLIESIKKFFDFDTVLSIIIMVFILWLLLASKGSAKAEMLSTDSDIVDVNLPEKFNEKSTSSKIKNKEDLYVYMKKAGIKFPELVWSQAMMESAWMNSAVFNRSNNLFGMKKSGKRDHLQIHKRNDSYSHFETIKKCVLDYKLHQEQVLKIENIKNESQYMRRLIKTGYCPDEGYARKILAIRKNTDFNELKKSS
Other Proteins in cluster: phalp2_35050
Total (incl. this protein): 4 Avg length: 210,5 Avg pI: 9,30

Protein ID Length (AA) pI
7QhUZ 211 9,47371
3AWZZ 211 9,14860
3fvRU 209 9,08464
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19706
4IQe4
2 28,1% 149 3.011E-11
2 phalp2_12135
5Tp7E
102 28,4% 130 5.050E-09
3 phalp2_3127
7ZEo1
1 23,9% 163 6.815E-09
4 phalp2_26188
8FkTo
1 26,0% 192 3.042E-08
5 phalp2_28793
7FgqF
2 22,5% 173 2.628E-06
6 phalp2_34258
3dxLm
725 26,0% 146 2.895E-04
7 phalp2_34940
4B1SD
1 22,6% 137 2.895E-04
8 phalp2_31056
1HUjD
3 19,7% 162 9.295E-04

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7QhUZ (211 AA)
Member sequence: H9NC94 (211 AA)
1 211 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Sphingomonas phage PAU
[NCBI]
1150991 No lineage information
Host Sphingomonas paucimobilis
[NCBI]
13689 Proteobacteria > Alphaproteobacteria > Sphingomonadales > Sphingomonadaceae > Sphingomonas >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ362498 [NCBI]
CDS location
range 158288 -> 158923
strand +
CDS
ATGTTTAGTTCAAAAAATGAAAAATCAGATCAGAAACACCCAATCTTAAAATTAATCGAATCTATAAAGAAATTCTTTGATTTCGATACAGTACTGTCAATTATTATAATGGTATTCATACTATGGTTACTATTAGCTTCTAAAGGTTCAGCAAAGGCTGAAATGCTGTCAACAGATTCTGATATTGTTGACGTAAATCTCCCAGAAAAATTCAATGAAAAATCTACTTCAAGTAAAATCAAGAACAAAGAGGATTTGTATGTTTACATGAAGAAAGCTGGAATTAAGTTTCCTGAGCTTGTATGGTCTCAAGCTATGATGGAATCTGCTTGGATGAATTCAGCAGTTTTTAATCGTTCTAATAATCTTTTTGGAATGAAGAAATCTGGAAAACGTGACCATTTACAAATACATAAACGTAACGATTCTTATTCACATTTCGAAACAATCAAGAAGTGTGTATTAGATTACAAGTTACATCAAGAACAAGTTCTTAAAATCGAAAACATTAAAAACGAATCTCAATATATGAGAAGGTTAATCAAAACTGGTTATTGTCCTGATGAAGGATATGCTAGAAAGATATTAGCTATTCGTAAGAATACAGATTTTAACGAATTGAAAAAATCTAGTTAG

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002573377_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7QhUZ) rather than this protein.
PDB ID
7QhUZ
Method AlphaFoldv2
Resolution 80.74
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50