Protein

Protein accession
A0AAU8B3E2 [UniProt]
Representative
82GE1
Source
UniProt (cluster: phalp2_11792)
Protein name
Peptidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MDFKQNITEHFKLSEFVTTTFDRDKLRLEFLAENDVLFRKGELDMIYRFGRLAGALEYIRKFCGAPIIILSCYRSSRVNKLVHGSKTSQHRTGSAVDFFCPSMRADDFFSLVKRALEFHNIEYDQLIFYSRRGFVHLGLFRYNIGTSIRRPSRRMIIYK
Physico‐chemical
properties
protein length:159 AA
molecular weight:18827,6 Da
isoelectric point:9,81
hydropathy:-0,24
Representative Protein Details
Accession
82GE1
Protein name
82GE1
Sequence length
134 AA
Molecular weight
15843,11150 Da
Isoelectric point
6,19581
Sequence
MLYYNHLEPIEHFSFNELTRTDTGLFNKPDSWKKIFNLINLAYKLEEIREYCGFPIVVNSAYRSTEVNSAVGGVEMSMHTLGCAADIRPLFQSDWPKLVKCCEEFYKLKHLNRCIVYRDKKFIHIENPCIEWDI
Other Proteins in cluster: phalp2_11792
Total (incl. this protein): 5 Avg length: 146,8 Avg pI: 7,68

Protein ID Length (AA) pI
82GE1 134 6,19581
A0A873W7X0 138 6,13079
A0A976N1H7 144 6,49308
A0AAU8AV33 159 9,77375
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25470
34SMU
72 34,3% 128 4.489E-17
2 phalp2_24996
FFK4
307 35,7% 123 7.606E-16
3 phalp2_23143
4GhkT
33 33,6% 119 6.862E-15
4 phalp2_40257
2Q5es
36 37,1% 132 1.041E-12
5 phalp2_34181
2HC1f
9 26,1% 126 2.665E-12
6 phalp2_11062
4US9Y
716 30,9% 126 3.646E-12
7 phalp2_25304
8lXTH
6 32,4% 108 3.646E-12
8 phalp2_9595
1rj6b
47 29,3% 133 1.277E-11
9 phalp2_36915
7TgS2
18 33,3% 84 1.746E-11
10 phalp2_11058
4UnMN
68 31,2% 96 3.266E-11

Domains

Domains [InterPro]
Representative sequence (used for alignment): 82GE1 (134 AA)
Member sequence: A0AAU8B3E2 (159 AA)
1 134 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Dulem virus 264
[NCBI]
3145741 Petitvirales > Microviridae > Microvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
PP511523 [NCBI]
CDS location
range 4170 -> 4649
strand +
CDS
ATGGATTTCAAACAAAACATTACAGAGCATTTTAAGCTTTCAGAATTTGTAACCACAACTTTTGATAGAGATAAGCTTCGTTTAGAATTTCTTGCGGAGAATGATGTACTTTTCCGTAAAGGTGAGCTTGATATGATTTATAGGTTTGGTCGTCTTGCTGGCGCTTTAGAGTATATAAGAAAATTTTGTGGTGCGCCTATTATTATTCTGTCTTGTTACCGTTCTTCTCGTGTCAATAAATTGGTTCATGGAAGTAAAACGTCTCAACATAGAACAGGATCTGCGGTCGATTTCTTTTGTCCATCTATGCGTGCCGATGATTTTTTTTCACTTGTTAAGCGTGCATTAGAATTTCATAATATTGAATATGACCAACTCATTTTTTACAGTAGACGTGGTTTTGTTCATCTTGGTTTATTTCGGTATAATATAGGTACTTCTATTAGGAGGCCATCTCGCAGAATGATAATTTACAAATAA

CDS Source ID
CDS Source
PP511707 [NCBI]
CDS location
range 4170 -> 4649
strand +
CDS
ATGGATTTCAAACAAAACATTACAGAGCATTTTAAGCTTTCAGAATTTGTAACCACAACTTTTGATAGAGATAAGCTTCGTTTAGAATTTCTTGCGGAGAATGATGTACTTTTCCGTAAAGGTGAGCTTGATATGATTTATAGGTTTGGTCGTCTTGCTGGCGCTTTAGAGTATATAAGAAAATTTTGTGGTGCGCCTATTATTATTCTGTCTTGTTACCGTTCTTCTCGTGTCAATAAATTGGTTCATGGAAGTAAAACGTCTCAACATAGAACAGGATCTGCGGTCGATTTCTTTTGTCCATCTATGCGTGCCGATGATTTTTTTTCACTTGTTAAGCGTGCATTAGAATTTCATAATATTGAATATGACCAACTCATTTTTTACAGTAGACGTGGTTTTGTTCATCTTGGTTTATTTCGGTATAATATAGGTACTTCTATTAGGAGGCCATCTCGCAGAATGATAATTTACAAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (82GE1) rather than this protein.
PDB ID
82GE1
Method AlphaFoldv2
Resolution 93.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50