Protein

Protein accession
A0AAE9C7R1 [UniProt]
Representative
2Sg1U
Source
UniProt (cluster: phalp2_22894)
Protein name
CHAP-domain endopeptidase
Lysin probability
100%
PhaLP type
VAL
Probability: 70% (predicted by ML model)
Protein sequence
MDTEPAVQEALAHLGAEYVWGATGPDTFDCSGLVQYAFKKAGFDMTRTTYTQVLQGDPVTGAPQRGDLVFPDAGHVGIALGGDQMVHAPQTGDVVKISNYWTTPFAVRRMGPNSGMVGDTTVSNMYHAVGGPPSLSNMIPGLGTVQSQIDNLNNAVEQSTSVLGNISDVTKAVSMFLNILMSEQGWLRISKVILGTVAVLIGTGLLMKDFVGEVM
Physico‐chemical
properties
protein length:215 AA
molecular weight:22736,7 Da
isoelectric point:4,75
hydropathy:0,10
Representative Protein Details
Accession
2Sg1U
Protein name
2Sg1U
Sequence length
219 AA
Molecular weight
23220,50730 Da
Isoelectric point
10,69319
Sequence
MFTPRRYHILAIFVLVYSFVVTLLNPSLDTASAPPHSAQASFNPLWAPSVAAPVPDSGRLFTFRKHVEPPQITPAVPAPAVVATTPPRQVVLVSAAVAAPPRTPAPSRSTATETAIAYAMSKLGRPYVWGAAGPNTFDCSGLVQAAFRSAGIDLPRTTHTIIGRGTPVSRGALQRGDLVWPSSGHIGIYLGNGKIIHAPQPGDVVKISTLWSFYAGRRL
Other Proteins in cluster: phalp2_22894
Total (incl. this protein): 2 Avg length: 217,0 Avg pI: 7,72

Protein ID Length (AA) pI
2Sg1U 219 10,69319
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30781
bmLo
6 40,9% 176 3.293E-22
2 phalp2_24984
CFMj
3 38,3% 172 4.611E-20
3 phalp2_27467
7KKUL
3 38,3% 146 2.168E-17
4 phalp2_12851
1ltcQ
3 37,2% 161 1.276E-12

Domains

Domains [InterPro]
Disordered region
NLPC_P60
Representative sequence (used for alignment): 2Sg1U (219 AA)
Member sequence: A0AAE9C7R1 (215 AA)
1 219 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodococcus phage P19
[NCBI]
2900137 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OL656105 [NCBI]
CDS location
range 11802 -> 12449
strand +
CDS
ATGGACACCGAACCAGCAGTACAAGAAGCACTGGCACACCTCGGTGCCGAATACGTATGGGGCGCAACGGGACCGGACACGTTCGACTGTTCGGGACTGGTCCAGTACGCATTCAAAAAGGCGGGGTTCGACATGACACGAACCACGTACACCCAAGTGCTGCAAGGTGATCCTGTGACCGGCGCACCGCAGCGCGGTGACTTGGTGTTCCCCGATGCAGGGCACGTGGGGATCGCACTCGGTGGGGATCAGATGGTTCATGCTCCGCAGACCGGCGACGTGGTGAAGATCAGTAACTACTGGACTACACCGTTTGCCGTTCGTCGGATGGGGCCGAACAGTGGAATGGTCGGTGACACAACGGTATCCAACATGTACCACGCAGTCGGGGGACCACCCTCACTGTCGAACATGATACCGGGACTTGGCACCGTCCAATCGCAGATAGACAACTTGAACAATGCAGTGGAGCAAAGCACTTCCGTTCTCGGTAATATCTCCGATGTCACCAAAGCGGTATCCATGTTTCTCAATATCCTTATGTCGGAACAGGGTTGGTTGAGAATCTCGAAAGTGATACTGGGAACTGTCGCTGTGCTGATCGGCACAGGGCTACTGATGAAAGACTTTGTAGGAGAGGTAATGTAA

Gene Ontology

Description Category Evidence (source)
GO:0008234 cysteine-type peptidase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2Sg1U) rather than this protein.
PDB ID
2Sg1U
Method AlphaFoldv2
Resolution 75.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50