Protein
- Protein accession
- A0A8S5U601 [UniProt]
- Representative
- 6pGHJ
- Source
- UniProt (cluster: phalp2_24713)
- Protein name
- Peptidase
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 98% (predicted by ML model) - Protein sequence
-
MPNIKGFFSKTSPSCWKQDGFFMHYQNYHEEWPWPNFKPKEVACKHCGELWEGEKPMPKWFHESMEALQYLRELWGKSLIINSGHRCAEHNAAVGGATSSQHLRIAFDCRIPKKEQREFKELAEEAGFRGVGYYSNFIHIDMGPRRTWLGKY
- Physico‐chemical
properties -
protein length: 152 AA molecular weight: 17901,2 Da isoelectric point: 8,57 hydropathy: -0,76
Representative Protein Details
- Accession
- 6pGHJ
- Protein name
- 6pGHJ
- Sequence length
- 190 AA
- Molecular weight
- 21966,64740 Da
- Isoelectric point
- 6,81559
- Sequence
-
VVPAEYETTVAQTAAIFLYGSITETHSGGETRIQSLEIMPNIKGFFSKTSPSCWKQDGFFMHYQNYHEEWPWPNFKPKEVACKHCGELWEGEKPMPKWFHESMEALQYLRELWGKSLIINSGHRCAEHNAAVGGATSSQHLRIAFDCRIPKKEQREFKELAEEAGFRGVGYYSNFIHIDMGPRRTWLGKY
Other Proteins in cluster: phalp2_24713
| Total (incl. this protein): 3 | Avg length: 154,0 | Avg pI: 7,99 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 6pGHJ | 190 | 6,81559 |
| A0A8S5P454 | 120 | 8,59649 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_37118
1hl31
|
667 | 44,2% | 131 | 1.158E-29 |
| 2 |
phalp2_34760
3hjNi
|
68 | 42,7% | 138 | 4.050E-29 |
| 3 |
phalp2_33636
DFQX
|
10 | 42,6% | 143 | 1.884E-25 |
| 4 |
phalp2_32287
4aKr
|
1 | 41,1% | 124 | 7.984E-24 |
| 5 |
phalp2_11062
4US9Y
|
716 | 44,3% | 124 | 6.687E-20 |
| 6 |
phalp2_11545
XXz6
|
1240 | 35,8% | 134 | 9.602E-18 |
| 7 |
phalp2_1834
3nSHh
|
7 | 38,1% | 118 | 1.559E-16 |
| 8 |
phalp2_15919
4GXbY
|
30 | 37,9% | 116 | 8.648E-15 |
| 9 |
phalp2_31755
45JWG
|
1 | 35,8% | 131 | 4.038E-14 |
| 10 |
phalp2_34553
7OdA8
|
10 | 34,3% | 128 | 7.281E-10 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Siphoviridae sp. ctwHj1 [NCBI] |
2825727 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
BK016018
[NCBI]
CDS location
range 18948 -> 19406
strand +
strand +
CDS
ATGCCCAACATTAAGGGATTTTTTTCAAAAACAAGTCCATCTTGCTGGAAACAGGATGGATTTTTTATGCACTATCAAAATTACCATGAGGAATGGCCTTGGCCCAACTTCAAACCGAAAGAAGTTGCTTGCAAGCACTGTGGGGAATTATGGGAAGGCGAAAAGCCCATGCCCAAGTGGTTCCATGAAAGCATGGAAGCCTTGCAATATCTCCGCGAACTGTGGGGAAAGTCCCTCATCATCAACTCCGGACATCGATGCGCTGAACACAACGCCGCAGTGGGCGGGGCGACTTCTTCTCAGCATCTACGTATCGCTTTTGACTGCCGTATCCCCAAGAAAGAACAGCGCGAGTTCAAAGAGCTTGCAGAAGAAGCGGGATTCCGAGGCGTAGGGTACTATTCCAATTTTATTCATATCGACATGGGGCCGCGCCGCACGTGGCTCGGAAAGTACTAG
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(6pGHJ)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50