Protein

Protein accession
G9FHJ0 [UniProt]
Representative
7ebmV
Source
UniProt (cluster: phalp2_28062)
Protein name
Lysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MQEIDQTNISKNSSGRYGGRIRLFVIHTQEGNGTARSLAGYLQNPSSGVSYHYSIDNTECIAVVDTDRSAWAVLDANGYTINLCYAGSRASMSRDEWIAKYSNAIDFTGWLIARDAKQYGFTPYVIGHPEIRAGKQGTTDHYGITKGLGIGDHTDVGGGYPWDLLRKAIDKYMGNTAPLPPAPLPIVNPIDECRKANDWLGDALDGQKPCPDKKGQFRYFKNGAVYWTPETGARAIPSELLKKFADLKWETGPLGYPTNDRTILNGPNGRPWGVVQGFNGGNLYRKYGTDEAFWTHGEIGNRWARSGFENGPLGWPLSDELPFDGGVVQYFENGQIHWPGKLKTIALLTKDGADTPLADQGK
Physico‐chemical
properties
protein length:362 AA
molecular weight:39842,0 Da
isoelectric point:6,40
hydropathy:-0,57
Representative Protein Details
Accession
7ebmV
Protein name
7ebmV
Sequence length
528 AA
Molecular weight
57292,20990 Da
Isoelectric point
6,36991
Sequence
VSAILYPLAAGSYQISSGFGPRWGSQHNGLDFSAPLGTPIYAVADGTVVEGSERKAGTVSGFGNWVWIDHQPNLGVDTIYGHVRHGDIKVRAGQKVRAGDLIALVGNEGQSTGPHLHFEVWTPPGRIGGRPIDPAGWLAGAAHPGAPKPAPKPAPAPSPTPEKSTVQPDYREIIRFGPSSSSRNGARISNFLLHTEEGNASAEGLAAFCNNPGNGASYHYVVRDGIVVCINDTDRASWSVLNANPYTINLVFAGSKAAWSREQWLAREGDIRIAAWLAVQDAKKYGFSVEVNPRPYPGGRRDGFSDHGYVTRVLRIGNHTDTGDKFPWDRFAAFVDEFVNGKAPAPAPAPVVVNKIDEMATAATWLGKRLTGEETCPDNVGKFAHFENGSIYWTPSTEARPIPKYLMESYETYKWETGPLGYPIAYHTSLTDSAGKWEADVQAFERGVLYRRVGSPLGFFVTGRIGARWAREGYEKSAYGYPTSNEIHLADGTIIQNFEHGRIAWNPDGTLGLKLDVGDEQIIASQVH
Other Proteins in cluster: phalp2_28062
Total (incl. this protein): 12 Avg length: 471,6 Avg pI: 6,77

Protein ID Length (AA) pI
7ebmV 528 6,36991
1fRGI 368 4,77274
3lLU9 559 8,42152
4Lzw9 495 4,80985
7k0Gt 531 7,40592
7kfbc 521 6,33126
7ssKL 572 8,50501
7yPfS 529 5,98363
A0A1C9EHM2 472 5,91389
A0A9E7NI91 361 8,14037
A0AAE8XAF8 361 8,14037
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35332
7m5Au
49 58,5% 355 1.330E-162
2 phalp2_30754
7C7wA
28 55,1% 397 2.743E-160
3 phalp2_30716
7i7AZ
2 38,7% 361 8.281E-90
4 phalp2_36825
6W23o
94 40,6% 349 1.446E-84
5 phalp2_2847
8LyZr
374 30,4% 394 2.979E-52

Domains

Domains [InterPro]
PET_M23
Unannotated
LGFP
Representative sequence (used for alignment): 7ebmV (528 AA)
Member sequence: G9FHJ0 (362 AA)
1 528 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551, PF08310

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodococcus phage REQ1
[NCBI]
1109712 No lineage information
Host Rhodococcus equi
[NCBI]
43767 Actinobacteria > Actinobacteria > Corynebacteriales > Nocardiaceae > Rhodococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JN116825 [NCBI]
CDS location
range 49534 -> 50622
strand +
CDS
ATGCAAGAGATAGACCAGACCAATATCAGCAAGAACAGCAGCGGGCGATACGGCGGCCGGATCCGGCTGTTCGTCATCCACACCCAGGAGGGCAACGGCACCGCGCGCTCGCTGGCGGGGTACCTGCAGAACCCCTCGTCCGGTGTCAGCTACCACTACTCGATCGACAACACCGAGTGCATCGCGGTGGTCGACACGGACCGTTCCGCGTGGGCAGTGCTCGATGCGAACGGCTACACGATCAACCTGTGCTACGCCGGTAGCCGGGCCTCTATGTCACGTGACGAATGGATCGCCAAGTACAGCAACGCGATCGACTTCACCGGCTGGCTGATCGCACGCGATGCGAAGCAGTACGGCTTCACGCCCTACGTCATCGGGCACCCGGAGATCCGCGCGGGCAAGCAGGGCACGACCGACCACTACGGCATCACCAAGGGCCTGGGGATCGGCGACCACACCGACGTCGGCGGCGGCTACCCGTGGGACCTGCTGCGCAAGGCGATCGACAAGTACATGGGTAACACGGCCCCGCTGCCGCCCGCCCCGCTGCCGATCGTCAACCCGATCGACGAGTGCCGGAAGGCGAACGACTGGCTCGGCGACGCGCTCGATGGCCAGAAGCCCTGCCCGGACAAGAAGGGCCAGTTCCGGTACTTCAAGAACGGCGCGGTCTACTGGACCCCGGAGACCGGCGCGCGGGCGATCCCGTCGGAGCTGCTCAAGAAGTTCGCCGACCTGAAGTGGGAGACCGGCCCGCTCGGCTACCCGACGAACGACCGCACCATCCTCAACGGGCCGAACGGCCGTCCGTGGGGCGTCGTGCAGGGATTCAACGGCGGCAACCTGTACCGCAAGTACGGCACCGACGAGGCCTTCTGGACGCACGGCGAGATCGGCAACCGCTGGGCGCGTAGCGGATTCGAGAACGGTCCGCTCGGCTGGCCGCTGTCCGACGAGCTGCCGTTCGACGGCGGAGTCGTCCAGTACTTCGAGAACGGCCAGATCCACTGGCCCGGCAAGCTCAAGACCATCGCGCTCCTGACCAAGGACGGCGCGGATACACCCCTGGCTGACCAGGGTAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00023eec75_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7ebmV) rather than this protein.
PDB ID
7ebmV
Method AlphaFoldv2
Resolution 87.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50