Protein

Protein accession
A0A0F6WE88 [UniProt]
Representative
8qH8D
Source
UniProt (cluster: phalp2_10559)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGSENGWEPARLAPDSPLLVWKIVPGTNPPVHLQVMRGFPEVFLIAWAADWNEFIEPLRDADSACYTPTNSVSTSNHLNATAEDLNWKNHPFKQRGSLNAAQMAVLKEMEDFYEGWMFWAGRWQNPIDEMHSQCGYDTWNDNDRGFDFIHRKIRSDGRSTFRRGSLPGENLPAATPAFPPIAPAHDDEVAATVLYDAVPIIDMNRARKLLPLVRAGLVAARCDTPRKIAMYLAQVGWESDGFNATEEYAKNGRYAPFIGRTWIMVTWQSNYAAFGRWCYDRHLVSDPDVFVKNPRKLADDEWAGLGPAWYITDARPNINAMADAGDLLGVTRAINGGTNGLEDPRPGVPGRRTRWNQAIALGERLLELINHPDTEDVMPGLTDDEQRELLVNTRWLREQLEVSRPDWSADADLGTDSQGRPNTLRTAVAKILRLVDKGKPATSVANANPPATS
Physico‐chemical
properties
protein length:453 AA
molecular weight:50656,3 Da
isoelectric point:5,26
hydropathy:-0,50
Representative Protein Details
Accession
8qH8D
Protein name
8qH8D
Sequence length
222 AA
Molecular weight
24825,60210 Da
Isoelectric point
9,28985
Sequence
MVQVGVESENGWRPARATPDLTEWITVPGTNVTLQLMKGWPLQILRAWAADYNAFIEPLRDPDSAAWTPTNSVATSNHLNGTACDLNWNTHPFRVRGTFTASQMATLRQMLDFYEGTVFWAGDWNDPIDEMHHQMGYGTWNNPKTGDFVKRKVRADGYSTFRRGAVPPSDPDAGGGRPLPRDGRPPVVGPLPPATPGRLRVAESLRVHHRRPDRHVVRADRA
Other Proteins in cluster: phalp2_10559
Total (incl. this protein): 11 Avg length: 452,3 Avg pI: 5,87

Protein ID Length (AA) pI
8qH8D 222 9,28985
A0A482JE75 464 5,83352
A0A0F6WE39 451 5,19568
A0A5J6TC65 467 5,90883
A0A345KV15 565 5,14480
A0A481W362 450 6,11874
A0A222YY59 451 5,26218
A0A649V5U4 494 5,33669
A0A649VF04 494 5,27479
A0AA48Y404 464 5,91963
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17747
7t2Ng
129 43,1% 167 1.896E-29
2 phalp2_5098
1IyeS
221 38,9% 159 8.386E-26
3 phalp2_12264
727mo
153 35,7% 207 5.396E-25
4 phalp2_40429
4f3sa
2 25,4% 173 1.251E-10
5 phalp2_34934
4zLgX
5 25,8% 143 4.105E-06
6 phalp2_38558
1NKac
1 24,0% 158 1.326E-05

Domains

Domains [InterPro]
PET_M15
Disordered region
Representative sequence (used for alignment): 8qH8D (222 AA)
Member sequence: A0A0F6WE88 (453 AA)
1 222 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage FlagStaff
[NCBI]
1647304 Avocadovirus > Avocadovirus flagstaff
Host Mycobacterium smegmatis str. MC2 155
[NCBI]
246196 Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KR080197 [NCBI]
CDS location
range 25277 -> 26638
strand +
CDS
ATGGGTAGCGAGAACGGCTGGGAACCGGCGCGACTGGCGCCGGATTCCCCCCTGCTGGTCTGGAAGATAGTCCCCGGCACGAACCCGCCGGTTCACCTACAGGTGATGCGCGGGTTCCCCGAAGTGTTCCTGATCGCGTGGGCCGCGGACTGGAACGAGTTCATCGAACCGCTGCGCGACGCCGATTCCGCCTGCTACACGCCAACGAACAGCGTGTCGACGTCGAACCACCTCAACGCGACCGCCGAAGACCTGAACTGGAAAAACCACCCGTTCAAGCAGCGCGGATCGCTCAACGCGGCGCAGATGGCAGTCCTCAAGGAAATGGAGGACTTCTACGAGGGCTGGATGTTCTGGGCCGGCCGCTGGCAAAACCCGATCGACGAAATGCACAGCCAGTGCGGTTACGACACCTGGAACGACAACGATCGCGGGTTCGACTTCATCCACCGGAAGATCCGGTCCGACGGCCGTTCGACGTTCCGTCGCGGATCGCTGCCTGGCGAGAACCTGCCGGCCGCGACGCCGGCGTTCCCGCCGATCGCGCCGGCGCACGACGACGAAGTGGCCGCGACCGTGCTCTACGACGCGGTTCCGATCATCGACATGAACCGGGCCCGCAAGCTGCTGCCGCTGGTCCGCGCCGGCCTGGTCGCTGCGCGCTGCGACACCCCGCGCAAGATCGCGATGTATCTCGCGCAAGTGGGCTGGGAATCCGACGGGTTCAACGCGACCGAGGAATACGCGAAGAACGGCCGGTACGCGCCGTTCATCGGCCGGACGTGGATCATGGTCACCTGGCAGTCGAACTACGCGGCGTTCGGCCGCTGGTGCTACGACCGGCACCTGGTGAGCGACCCCGACGTGTTCGTGAAGAACCCGCGGAAGCTGGCCGACGACGAATGGGCCGGCCTGGGTCCGGCCTGGTACATCACCGACGCGCGGCCGAACATCAACGCCATGGCCGACGCCGGCGATCTGCTCGGAGTGACGCGCGCGATCAACGGCGGCACGAACGGGCTCGAAGACCCGCGACCGGGCGTTCCCGGCCGGCGGACACGATGGAATCAGGCGATCGCGCTGGGCGAACGTCTGCTGGAACTGATCAACCACCCCGATACGGAGGACGTTATGCCTGGTCTGACTGACGACGAACAGCGCGAACTGCTGGTGAACACGCGCTGGCTGCGCGAGCAACTGGAAGTCTCGCGCCCCGACTGGTCCGCGGACGCCGATCTGGGCACCGACTCGCAGGGCCGGCCGAACACGCTGCGGACCGCGGTCGCGAAGATCCTGCGGCTGGTCGACAAGGGCAAGCCGGCGACCAGCGTCGCGAACGCGAACCCGCCGGCGACGTCGTGA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000624e13f_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (8qH8D) rather than this protein.
PDB ID
8qH8D
Method AlphaFoldv2
Resolution 80.66
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50