Protein

Protein accession
A0A976N1H7 [UniProt]
Representative
82GE1
Source
UniProt (cluster: phalp2_11792)
Protein name
Peptidase
Lysin probability
98%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MANPLTYCTENFTIKEFLRTSTGFYNIPEDDIIFENLKHIMSELQYCRSRIGVPIRVNSGYRSTAVNAAVGGSPTSMHLLGLAADVRCSRPLDTLKLLVTMLTSTDYHQIGIYYCPDGSINRIHFSSYADSNLNTEKSVYYHLA
Physico‐chemical
properties
protein length:144 AA
molecular weight:16197,2 Da
isoelectric point:6,49
hydropathy:-0,13
Representative Protein Details
Accession
82GE1
Protein name
82GE1
Sequence length
134 AA
Molecular weight
15843,11150 Da
Isoelectric point
6,19581
Sequence
MLYYNHLEPIEHFSFNELTRTDTGLFNKPDSWKKIFNLINLAYKLEEIREYCGFPIVVNSAYRSTEVNSAVGGVEMSMHTLGCAADIRPLFQSDWPKLVKCCEEFYKLKHLNRCIVYRDKKFIHIENPCIEWDI
Other Proteins in cluster: phalp2_11792
Total (incl. this protein): 5 Avg length: 146,8 Avg pI: 7,68

Protein ID Length (AA) pI
82GE1 134 6,19581
A0A873W7X0 138 6,13079
A0AAU8AV33 159 9,77375
A0AAU8B3E2 159 9,81120
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25470
34SMU
72 34,3% 128 4.489E-17
2 phalp2_24996
FFK4
307 35,7% 123 7.606E-16
3 phalp2_23143
4GhkT
33 33,6% 119 6.862E-15
4 phalp2_40257
2Q5es
36 37,1% 132 1.041E-12
5 phalp2_34181
2HC1f
9 26,1% 126 2.665E-12
6 phalp2_11062
4US9Y
716 30,9% 126 3.646E-12
7 phalp2_25304
8lXTH
6 32,4% 108 3.646E-12
8 phalp2_9595
1rj6b
47 29,3% 133 1.277E-11
9 phalp2_36915
7TgS2
18 33,3% 84 1.746E-11
10 phalp2_11058
4UnMN
68 31,2% 96 3.266E-11

Domains

Domains [InterPro]
Representative sequence (used for alignment): 82GE1 (134 AA)
Member sequence: A0A976N1H7 (144 AA)
1 134 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Sigmofec virus UA08Rod_4043
[NCBI]
2929393 Petitvirales > Microviridae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OM869578 [NCBI]
CDS location
range 3606 -> 4040
strand +
CDS
ATGGCTAATCCTTTAACTTATTGTACTGAAAACTTTACTATTAAAGAGTTTCTTCGTACGTCTACCGGTTTCTATAATATCCCTGAGGATGATATTATCTTTGAAAATCTTAAGCATATAATGTCTGAGTTACAGTATTGCCGCTCCCGTATAGGCGTACCAATTCGTGTCAACTCCGGATACCGTTCAACGGCAGTCAATGCTGCCGTTGGCGGTAGTCCCACCTCTATGCATTTATTAGGCCTTGCAGCGGATGTCCGCTGTTCAAGGCCGTTGGACACCCTCAAGCTTCTTGTGACGATGCTTACATCCACTGACTACCATCAGATAGGCATTTATTATTGTCCGGATGGCTCTATAAACAGGATTCATTTTTCATCGTATGCCGATAGTAATCTTAACACTGAAAAGAGCGTCTACTATCATCTTGCCTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (82GE1) rather than this protein.
PDB ID
82GE1
Method AlphaFoldv2
Resolution 93.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50