Protein

Protein accession
H7BV73 [UniProt]
Representative
6rym4
Source
UniProt (cluster: phalp2_8091)
Protein name
Peptidoglycan binding-like domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MGRNPHSFTLEEVMAIIRNTYTDALFNGLMAAGCTIYGACGAMGNIYAESGANPRNLENLCEKRLNYKYTDDTYTTAVDSGEITRDLFLHPLGDFRQYGYGFCQWTSAGRKAGLYDLVKSKGVSIGDPDTQVEFMLKELQQSYKSVLQVLKTATSVQEASDIFLVKFEVPANTGSEVKKARSSYREQYLKIYKDIEKEETNMSLISNSGHDENGKYSGGKAGDQTGTEWALIPWYNRPWKCVLRHPDAKVRAKLAELGIKAAKNDLVGYDQGQRGTYWEHLKASNYDPSQITIACEGDCSAGVIANIKAAGYLLGIDALKNINATYTGNLRSGAKAAGFQVLTESKYLTGPDYLLAGDILLNDSHHTATNVQDGSKSGGAGTSGSGSTNSGSGTISGGNSKTANIKNGQQWLNSNYGDKLIQFCGAKLEVDGSYGPASRWGALAIWKDLMNRKYGTKLTPTNKNFYGSCKEVAGKAGVHSGTVGTFTFIAQFILSAKGYYTGAMDASCGSKLVEAITAFQKANGLDADGWCGADTWYALFN
Physico‐chemical
properties
protein length:541 AA
molecular weight:58464,8 Da
isoelectric point:7,86
hydropathy:-0,39
Representative Protein Details
Accession
6rym4
Protein name
6rym4
Sequence length
308 AA
Molecular weight
33782,74510 Da
Isoelectric point
6,45869
Sequence
MKENLQMAIERNTYTDILFDALMAAGCTIYGACAAMGNIYAESGANPRNLENLCEKKLNYKYTDDTYTEAVDSGKITRALFLHPLGDSRQYGYGFCQWTSAGRKAGLYDLVKSRGVSIGDAKVQTEYMLSELQKSYKSVWKVLQTATSVQEASDIFLVKFEAPTNTGSAVKKARDSYGEQYLKIYQNQKKEENKVSKIENAVARAEAIALDDSHGYDQVDRWGNPNYDCSGLVIRSLEEAGILAKSSGATYTGNMPEVLPKIGFKDVVKSVDLATGSGMIRGDVLLGNGHTAFYCGNGKLVHASINEK
Other Proteins in cluster: phalp2_8091
Total (incl. this protein): 4 Avg length: 381,5 Avg pI: 7,61

Protein ID Length (AA) pI
6rym4 308 6,45869
3wcDS 310 7,54085
5TcWG 367 8,56619
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4542
3TDL2
7 31,1% 353 1.025E-46
2 phalp2_772
21fbP
17 34,1% 351 1.230E-45
3 phalp2_38048
6eiJ9
6 40,8% 225 2.739E-44
4 phalp2_9140
6h8Qx
4 38,8% 206 6.834E-35
5 phalp2_12591
1ntYC
157 35,5% 298 3.183E-34
6 phalp2_18242
4UeaR
8 40,4% 210 3.721E-33
7 phalp2_13483
4MI6v
2 23,1% 315 2.908E-06

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6rym4 (308 AA)
Member sequence: H7BV73 (541 AA)
1 308 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteriophage sp
[NCBI]
38018 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ680357 [NCBI]
CDS location
range 53073 -> 54698
strand -
CDS
GTGGGGAGAAATCCCCACTCTTTTACGTTGGAGGAAGTTATGGCAATTATACGAAACACCTATACAGACGCATTATTTAATGGTTTGATGGCTGCTGGATGCACAATATACGGAGCATGCGGAGCTATGGGGAATATTTACGCAGAATCCGGGGCAAATCCCCGTAATCTGGAGAACCTCTGCGAGAAGAGGCTGAATTACAAATACACCGATGACACGTACACAACGGCAGTAGACAGCGGAGAGATCACAAGAGATCTTTTCTTGCATCCGTTGGGAGATTTCAGACAATACGGTTATGGTTTTTGCCAGTGGACGTCCGCCGGAAGAAAAGCAGGACTGTACGATCTGGTTAAATCAAAAGGCGTGTCGATCGGAGATCCGGACACTCAGGTTGAGTTCATGCTGAAAGAATTACAGCAGAGCTACAAGAGTGTTCTGCAGGTATTGAAAACGGCAACCTCAGTCCAGGAGGCGTCAGATATCTTCCTGGTAAAATTCGAGGTTCCGGCAAATACCGGATCAGAAGTCAAAAAGGCAAGATCTTCCTACAGGGAGCAGTACCTGAAAATCTATAAAGACATCGAAAAGGAGGAAACAAACATGAGTTTAATTTCAAACAGCGGACATGATGAAAACGGAAAGTATTCAGGAGGAAAAGCCGGAGATCAGACCGGGACAGAATGGGCTTTGATTCCATGGTACAACAGACCCTGGAAGTGCGTTCTGAGGCATCCGGATGCAAAAGTTAGAGCGAAGCTGGCAGAGCTTGGGATCAAAGCTGCTAAAAATGATTTGGTCGGTTACGATCAGGGACAAAGAGGTACATACTGGGAGCACCTGAAAGCAAGTAATTACGATCCTTCACAGATCACAATCGCTTGCGAGGGGGATTGTTCTGCCGGAGTGATCGCTAATATTAAAGCGGCTGGTTATCTCCTGGGAATTGATGCACTGAAAAACATTAACGCCACATATACGGGCAATCTGAGATCCGGAGCAAAGGCGGCTGGATTCCAGGTATTAACAGAATCGAAGTATCTTACTGGCCCTGATTATCTTTTAGCCGGAGACATCCTTTTGAACGACAGCCACCATACGGCAACAAATGTCCAGGATGGTTCTAAGTCTGGAGGAGCCGGAACATCTGGCTCAGGATCAACAAACAGCGGATCCGGAACAATTTCCGGAGGCAACAGTAAGACAGCAAACATCAAGAACGGTCAGCAATGGTTGAATAGTAATTACGGCGATAAGCTGATCCAGTTCTGCGGAGCTAAACTGGAGGTGGATGGATCCTACGGTCCAGCTTCCAGATGGGGAGCCCTTGCGATCTGGAAAGATCTTATGAACCGCAAGTATGGAACAAAGCTCACTCCGACAAACAAGAATTTCTATGGTTCATGCAAGGAAGTAGCCGGAAAAGCCGGAGTTCACAGTGGAACGGTCGGTACATTCACATTCATAGCTCAGTTCATTTTATCCGCAAAGGGATATTACACCGGAGCCATGGATGCCAGCTGCGGAAGCAAGCTGGTAGAGGCGATCACAGCATTCCAGAAAGCGAATGGTCTGGATGCTGACGGATGGTGCGGAGCCGATACCTGGTACGCACTTTTTAACTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002517161_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6rym4) rather than this protein.
PDB ID
6rym4
Method AlphaFoldv2
Resolution 93.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50