Protein

Protein accession
A0A249XLC1 [UniProt]
Representative
4ciq2
Source
UniProt (cluster: phalp2_37621)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSTIQKSVVGCSVAAALGLVAGLFPNELRTSPEGLALIGKWESCTLTAYRDIVQVPTIGIGSTKGVCMGQVIDLKEVAYRFKEDVKEAEQCVDRWFNGKRMPGPVFDATVSLVYNNGCYGTRWNRKANRATQIASYASQQNWDRVCYHLGDFINAGGKPSKGLINRRTGEQAWCLTGVHY
Physico‐chemical
properties
protein length:180 AA
molecular weight:19785,4 Da
isoelectric point:8,99
hydropathy:-0,20
Representative Protein Details
Accession
4ciq2
Protein name
4ciq2
Sequence length
398 AA
Molecular weight
43405,22670 Da
Isoelectric point
8,77874
Sequence
MAIQINQAGLELIKHFEGCLLIAYTDPVGVPTIGYGHTNGVMLGQRITQAEAYSLLQKDLKYFEDAIAFWAMSMSVSLNENEFAALVSLAFNIGMGAFAGSTVANELIAGNRAAASTAFLMWVRGDDKILPGLERRRAAERDLFLKRANPMTTTANFTLTFKFSSFLKSRPASSKDLSAAEMIAFSKGAILNIAAIALEKEAGHHLITLGQVNGKQLEYLGRNTLWVFGKDVEISKGGKLYANNPVDSVGAVTGKEVFIPRIGKVTTDTLIIPIMGSGGFLTWGMATHNGTRIPGGSHVGNIMEIARRFEQEVKPLLQKRTKQQIQITSWYRPEPWNSYAGGVSNSTHLSGGAIDFWVDPLSSVEAYRLLDPIWDGGLGFYANGISHLDTGANRRWQL
Other Proteins in cluster: phalp2_37621
Total (incl. this protein): 6 Avg length: 218,8 Avg pI: 8,89

Protein ID Length (AA) pI
4ciq2 398 8,77874
A0A0H4TH14 185 8,88067
A0A1K2IXT0 185 8,84798
A0A9X9E5C5 180 9,12397
A0A9E7E0L1 185 8,70370
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37664
4mZY2
1 31,2% 253 2.799E-20
2 phalp2_16330
5ni24
119 28,4% 401 6.911E-17

Domains

Domains [InterPro]
GH24
Unannotated
PET_M15
Representative sequence (used for alignment): 4ciq2 (398 AA)
Member sequence: A0A249XLC1 (180 AA)
1 398 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959, PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Aeromonas phage CF7
[NCBI]
2507411 Autographiviridae > Ahphunavirus > Ahphunavirus CF7
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF683623 [NCBI]
CDS location
range 27282 -> 27824
strand +
CDS
ATGTCCACTATTCAGAAATCGGTAGTGGGCTGTTCAGTGGCAGCAGCTCTCGGCCTAGTGGCCGGCCTCTTCCCGAACGAGCTAAGGACCTCTCCCGAGGGCCTAGCCCTTATCGGTAAGTGGGAGAGCTGCACACTGACAGCCTACCGCGACATCGTGCAGGTGCCAACTATCGGCATCGGCTCGACCAAGGGCGTCTGCATGGGGCAGGTCATCGACCTCAAGGAGGTGGCCTACCGGTTCAAGGAAGACGTGAAGGAGGCGGAGCAGTGCGTGGACCGTTGGTTCAACGGGAAGCGTATGCCGGGCCCTGTGTTCGACGCTACAGTGTCCCTCGTCTACAATAACGGCTGCTATGGCACTCGCTGGAACCGCAAGGCTAATCGGGCCACACAGATTGCCAGCTACGCCTCCCAGCAGAACTGGGACCGGGTATGCTACCATCTGGGTGACTTCATCAATGCCGGCGGTAAGCCCAGCAAGGGCCTCATCAACCGGCGGACAGGAGAGCAAGCATGGTGCTTGACTGGCGTGCACTACTAG

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
A0A249XLC1
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
A0A9E8K2Y8
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
A0AAT9TU08
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi000bbfa1d6_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4ciq2) rather than this protein.
PDB ID
4ciq2
Method AlphaFoldv2
Resolution 87.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50