Protein

Protein accession
A0A0U4K3I2 [UniProt]
Representative
6Fcrq
Source
UniProt (cluster: phalp2_19946)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSKTMTAAQFLAALKAEGVPVRERKGWRTHNRNHKGASGPHNGVMVHHTASGSDGIESFVSNGNSALPGPLCHGLIEKSGRVVLIGWGRANHAGGGDPDVLAAVRAERWPVPKTNEHDGSAGSVDGNAHFVGYECVNQGNGKDPWPAIQLEGIARACAAVCRFHGWTVDSVIRHLDWSDWKVDPRGIDWNKMRARITTILKGKPNATPFASWADGDGGDAEQPEQPQPEQPDADAYPGAAAFGPGKSGLHVTRLGQMLVKRGGSRFYSVGPGPRWGEADRKATEAFQRAQGWSGSDADGIPGGTTWRLLVTGTGKSIPAAAPAKKPKVSLANIIAASRKDPGAPQGKTSYPADVRLVEAALKKLGFLGATYASDGAYGTVTVAAYNAFRRSIGLKGADATGDPGRLSLGTLGSRSGLFTV
Physico‐chemical
properties
protein length:420 AA
molecular weight:44058,0 Da
isoelectric point:9,55
hydropathy:-0,42
Representative Protein Details
Accession
6Fcrq
Protein name
6Fcrq
Sequence length
445 AA
Molecular weight
46445,37780 Da
Isoelectric point
7,19038
Sequence
MPDRWMPGAEIHDIGDHAPTDGGPAKAIAHITWDKNASAGAPQDWVSFDALVNYFTGSGAGAAPHIIWDPFSGRIAQLVPADSRSKSVVDSAGGTRTNRAGSVVIQVEAVFFPYCRKGGQVYPRLVDTPCAGWDRLHAWIASWGVPDMWPMGRPVDFTSHRSESVWESQGGWYAHAHVPENDHQDPGSWPAFSSSPSPAPPVPATTRVTVRVGQTLTAIAAAAGVALAVILGLNPDVARHPDAIRPGDSIVVPAVPGQVPVPSQDPVPAPSAPGGGFPGASTFGPGASNANVTLLGQMLVARGAARFYAVGPGPAWGDADRRATEAFQLAQGWTGSDADGIPGATTWDYLLTGKGHDIPAAAKVASAPAFPGAAKFGPGQSNAYVTQLGQQLVRKGYGRYYTKGPGPTWGKADRLNVQAFQRAQGWRGSGADGIPGPRTWALLFS
Other Proteins in cluster: phalp2_19946
Total (incl. this protein): 4 Avg length: 435,0 Avg pI: 8,67

Protein ID Length (AA) pI
6Fcrq 445 7,19038
7uicA 453 8,44447
A0A0E3JQE4 422 9,49395
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3990
7zjtr
195 50,7% 349 4.340E-117
2 phalp2_16564
7umlh
28 35,6% 451 2.417E-73
3 phalp2_19394
2oOR4
46 32,0% 321 9.517E-27
4 phalp2_8051
5Ickg
15 26,5% 463 2.181E-14

Domains

Domains [InterPro]
Unannotated
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 6Fcrq (445 AA)
Member sequence: A0A0U4K3I2 (420 AA)
1 445 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage Maih
[NCBI]
1775283 Woodruffvirus > Woodruffvirus TP1604
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KU189325 [NCBI]
CDS location
range 25551 -> 26813
strand +
CDS
ATGTCGAAGACGATGACCGCCGCGCAGTTCCTCGCGGCACTGAAGGCCGAGGGCGTCCCGGTCCGTGAGCGCAAGGGCTGGCGTACCCACAACCGGAACCACAAGGGCGCGTCCGGCCCGCACAACGGCGTCATGGTGCACCACACCGCCTCGGGCTCGGACGGTATCGAGTCGTTCGTCAGCAACGGCAACAGCGCGCTGCCGGGGCCGCTGTGCCACGGGCTGATCGAGAAGTCCGGGCGCGTGGTCCTGATCGGGTGGGGACGCGCGAACCACGCGGGCGGCGGTGACCCTGACGTCCTGGCCGCTGTCCGTGCCGAGCGCTGGCCGGTGCCGAAGACGAACGAGCACGACGGGTCCGCCGGATCGGTGGACGGCAACGCGCACTTCGTCGGGTACGAGTGCGTCAACCAGGGAAACGGGAAGGACCCGTGGCCGGCCATCCAGCTGGAAGGCATCGCGCGCGCCTGCGCGGCGGTGTGCCGCTTCCACGGCTGGACCGTGGACAGCGTGATTCGTCACCTGGACTGGTCGGACTGGAAGGTGGACCCGCGCGGCATCGACTGGAACAAGATGCGGGCCCGTATCACGACCATCCTGAAGGGGAAGCCGAACGCCACGCCGTTCGCGTCGTGGGCCGATGGCGACGGGGGCGACGCCGAGCAGCCCGAGCAGCCGCAGCCCGAGCAGCCGGACGCTGACGCCTACCCCGGCGCTGCTGCCTTCGGCCCCGGCAAGAGCGGCCTGCACGTGACGCGCCTCGGGCAGATGCTGGTGAAGCGCGGCGGTAGCCGCTTCTACAGCGTGGGTCCCGGTCCCCGGTGGGGTGAGGCGGACCGGAAGGCCACCGAGGCGTTCCAGCGGGCGCAGGGCTGGTCCGGCAGCGACGCGGACGGCATCCCCGGCGGGACCACGTGGCGGCTGCTGGTCACCGGGACCGGCAAGAGCATCCCCGCAGCCGCGCCCGCGAAGAAGCCGAAGGTGTCCCTGGCCAACATCATCGCGGCGTCGCGGAAGGACCCCGGTGCCCCGCAGGGAAAGACGTCCTATCCGGCGGACGTGCGGCTGGTGGAAGCCGCGCTGAAGAAGCTCGGGTTCCTCGGGGCCACGTACGCGTCCGATGGCGCGTACGGAACCGTGACGGTGGCCGCGTACAACGCGTTCCGCCGCAGCATCGGACTGAAGGGCGCGGACGCCACCGGCGACCCCGGCCGGCTGTCCCTCGGGACACTGGGCAGCCGCTCGGGTCTCTTCACCGTGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0005feb3ef_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6Fcrq) rather than this protein.
PDB ID
6Fcrq
Method AlphaFoldv2
Resolution 87.87
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50