Protein

Protein accession
H7BV84 [UniProt]
Representative
2LVGE
Source
UniProt (cluster: phalp2_6929)
Protein name
CHAP domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDKQEFIKKIAGYVQKHAPAYGILVHSPIIAQAILESGWGESRLAAVYHNYFGLKCGTKWTGKSVNLSTMEEYTPGTLTQIKDNFRVYDNMEEGVKGYFEFIQLSRYQNLRGITDPETYLKTIKADGYATSSKYVDNTMRIVTQYDLQQYDVKGAGSMAKLASAVLAQARAWIGRNEADGTHKGIIDVYNGHKPLARGYKVKYTDAWCATFVSAVAIKCGLTGIIPTECGCGQMIALFKNLGEWQESDSRTPSPGDIIFYDWDDTGAGDCTGNPDHVGIVESVSGGKITVIEGNKNNAVGRRTLAVNGRYIRGYGVPRYDKENAGSGSQATKSVVAVAKEVIAGKWGNGEDRKNRLTAAGYDYKAVQDQVNALLKGTTAATKSVAAVAKEVIAGKWGNGKERKNRLEAAGYNYNEVQAKVNAMLR
Physico‐chemical
properties
protein length:425 AA
molecular weight:46387,9 Da
isoelectric point:9,04
hydropathy:-0,41
Representative Protein Details
Accession
2LVGE
Protein name
2LVGE
Sequence length
407 AA
Molecular weight
44252,93030 Da
Isoelectric point
8,29323
Sequence
MYSVSDLYKTAIQNNTRSFSWSGTITTSNGRVYPFENKDIVKGSGYVSRQCSGSSEIELGSVYAAELGIAQYTSKCSYKGKYGIWQYSSKGSVDGISGNVDLDYGYVDYPAIIKSGGFNGYTKDAFDDNTPAPATSSQRDQIIAQARAWLGKKESDGSHREIIDVYNSHKPLARGYAVKYTDAWCATFVSALAIKCGLTDIIPNECGCGQMVTLFQKLGEWVENDAYLPSPGDVIFYDWQDSGSGDNTGWPDHVGIVEAVSGKTLTIIEGNKSDSVSRRSLQVDGKNIRGYGVPKYNTGSVTPDPVAPGKTVDELAKEVLDGKWGNGTDRKNRLTAARYDYSAVQAKVNALVKAKSESAVFYTVKSGDTLSSIAQKYDTSVSAIQKLNPTLIKNVNLILTGWKIRVK
Other Proteins in cluster: phalp2_6929
Total (incl. this protein): 3 Avg length: 455,3 Avg pI: 8,79

Protein ID Length (AA) pI
2LVGE 407 8,29323
5C0uy 534 9,02250
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4916
6k3vw
208 56,6% 318 1.037E-109
2 phalp2_29393
1dfbb
105 52,4% 265 5.992E-78
3 phalp2_7368
4xoTh
314 47,4% 289 8.202E-73
4 phalp2_38736
13rhz
58 44,1% 317 3.882E-72
5 phalp2_12538
16lqa
19 36,0% 399 2.145E-65
6 phalp2_4009
11iJK
7 31,6% 446 1.492E-40
7 phalp2_33156
7DPYX
1 35,3% 266 1.793E-36
8 phalp2_9136
6d0dy
3 32,7% 287 1.914E-25
9 phalp2_2168
3VGN3
9 28,2% 368 1.638E-22

Domains

Domains [InterPro]
Disordered region
Unannotated
CHAP
LysM
Representative sequence (used for alignment): 2LVGE (407 AA)
Member sequence: H7BV84 (425 AA)
1 407 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257, PF08230|PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteriophage sp
[NCBI]
38018 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ680358 [NCBI]
CDS location
range 7624 -> 8901
strand +
CDS
ATGGACAAGCAGGAGTTTATTAAAAAGATTGCCGGGTACGTGCAGAAACACGCCCCGGCATACGGGATTTTGGTACATAGTCCGATTATAGCCCAGGCGATACTTGAAAGCGGTTGGGGAGAAAGCCGCCTGGCCGCCGTGTATCACAATTATTTTGGGCTGAAATGTGGGACAAAATGGACCGGGAAAAGCGTAAACCTTTCCACCATGGAAGAATATACGCCGGGAACCCTTACGCAGATTAAGGACAATTTCCGGGTGTATGACAACATGGAAGAGGGTGTAAAAGGCTATTTTGAGTTTATCCAGTTATCCAGGTATCAGAATTTACGGGGCATTACGGACCCGGAAACGTACCTTAAAACCATTAAGGCGGACGGGTACGCAACCAGTAGCAAGTATGTGGACAATACCATGAGGATTGTTACACAGTACGATTTGCAGCAGTATGATGTGAAAGGAGCCGGAAGCATGGCAAAATTGGCAAGTGCAGTATTAGCCCAGGCAAGGGCGTGGATTGGCCGAAATGAAGCGGACGGCACCCACAAGGGCATTATTGACGTGTACAACGGCCACAAACCATTGGCGAGGGGTTACAAAGTCAAATATACAGACGCCTGGTGTGCCACCTTTGTTTCCGCCGTGGCTATCAAGTGCGGTTTGACTGGCATTATACCGACAGAGTGCGGTTGCGGCCAGATGATTGCATTATTCAAGAACCTGGGGGAATGGCAGGAAAGCGACAGCAGGACGCCAAGCCCTGGGGATATTATTTTTTACGATTGGGACGATACCGGGGCCGGGGATTGCACCGGGAACCCGGACCATGTGGGCATTGTTGAGAGCGTGAGCGGCGGAAAGATTACCGTTATCGAGGGCAATAAAAACAATGCCGTAGGCCGCAGGACATTGGCCGTAAATGGCCGCTATATCCGTGGTTATGGCGTGCCGAGATATGACAAAGAAAACGCCGGGAGCGGGTCCCAGGCCACAAAAAGCGTGGTAGCAGTAGCCAAGGAAGTAATTGCCGGAAAGTGGGGAAACGGAGAGGACAGAAAGAACCGCCTTACCGCCGCCGGGTACGATTACAAAGCCGTCCAGGACCAGGTAAACGCCTTACTGAAAGGCACCACCGCCGCCACAAAAAGCGTGGCAGCAGTAGCCAAGGAAGTAATTGCCGGGAAATGGGGAAACGGTAAAGAGAGAAAGAACCGCCTGGAAGCCGCAGGGTATAATTACAATGAGGTCCAGGCAAAGGTCAACGCTATGTTGAGATAG

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000251716c_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2LVGE) rather than this protein.
PDB ID
2LVGE
Method AlphaFoldv2
Resolution 78.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50