Protein

Protein accession
A0A8S5U601 [UniProt]
Representative
6pGHJ
Source
UniProt (cluster: phalp2_24713)
Protein name
Peptidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MPNIKGFFSKTSPSCWKQDGFFMHYQNYHEEWPWPNFKPKEVACKHCGELWEGEKPMPKWFHESMEALQYLRELWGKSLIINSGHRCAEHNAAVGGATSSQHLRIAFDCRIPKKEQREFKELAEEAGFRGVGYYSNFIHIDMGPRRTWLGKY
Physico‐chemical
properties
protein length:152 AA
molecular weight:17901,2 Da
isoelectric point:8,57
hydropathy:-0,76
Representative Protein Details
Accession
6pGHJ
Protein name
6pGHJ
Sequence length
190 AA
Molecular weight
21966,64740 Da
Isoelectric point
6,81559
Sequence
VVPAEYETTVAQTAAIFLYGSITETHSGGETRIQSLEIMPNIKGFFSKTSPSCWKQDGFFMHYQNYHEEWPWPNFKPKEVACKHCGELWEGEKPMPKWFHESMEALQYLRELWGKSLIINSGHRCAEHNAAVGGATSSQHLRIAFDCRIPKKEQREFKELAEEAGFRGVGYYSNFIHIDMGPRRTWLGKY
Other Proteins in cluster: phalp2_24713
Total (incl. this protein): 3 Avg length: 154,0 Avg pI: 7,99

Protein ID Length (AA) pI
6pGHJ 190 6,81559
A0A8S5P454 120 8,59649
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37118
1hl31
667 44,2% 131 1.158E-29
2 phalp2_34760
3hjNi
68 42,7% 138 4.050E-29
3 phalp2_33636
DFQX
10 42,6% 143 1.884E-25
4 phalp2_32287
4aKr
1 41,1% 124 7.984E-24
5 phalp2_11062
4US9Y
716 44,3% 124 6.687E-20
6 phalp2_11545
XXz6
1240 35,8% 134 9.602E-18
7 phalp2_1834
3nSHh
7 38,1% 118 1.559E-16
8 phalp2_15919
4GXbY
30 37,9% 116 8.648E-15
9 phalp2_31755
45JWG
1 35,8% 131 4.038E-14
10 phalp2_34553
7OdA8
10 34,3% 128 7.281E-10

Domains

Domains [InterPro]
Disordered region
PET_M15
Representative sequence (used for alignment): 6pGHJ (190 AA)
Member sequence: A0A8S5U601 (152 AA)
1 190 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctwHj1
[NCBI]
2825727 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK016018 [NCBI]
CDS location
range 18948 -> 19406
strand +
CDS
ATGCCCAACATTAAGGGATTTTTTTCAAAAACAAGTCCATCTTGCTGGAAACAGGATGGATTTTTTATGCACTATCAAAATTACCATGAGGAATGGCCTTGGCCCAACTTCAAACCGAAAGAAGTTGCTTGCAAGCACTGTGGGGAATTATGGGAAGGCGAAAAGCCCATGCCCAAGTGGTTCCATGAAAGCATGGAAGCCTTGCAATATCTCCGCGAACTGTGGGGAAAGTCCCTCATCATCAACTCCGGACATCGATGCGCTGAACACAACGCCGCAGTGGGCGGGGCGACTTCTTCTCAGCATCTACGTATCGCTTTTGACTGCCGTATCCCCAAGAAAGAACAGCGCGAGTTCAAAGAGCTTGCAGAAGAAGCGGGATTCCGAGGCGTAGGGTACTATTCCAATTTTATTCATATCGACATGGGGCCGCGCCGCACGTGGCTCGGAAAGTACTAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6pGHJ) rather than this protein.
PDB ID
6pGHJ
Method AlphaFoldv2
Resolution 78.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50