Protein

Protein accession
A0A3G2KFZ3 [UniProt]
Representative
40ve1
Source
UniProt (cluster: phalp2_4234)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MSGFTLPTSRAITQPWAAEFDDWDGDGVVDIPGGFYHSIGWFGHNGIDYGCFEGDPVHAIADGVIEFAGDAASHYLLSGGGNAILQHIPGYGVWAEYLHLSRFAVTNGQSVKKDQVIAYSGKTGSSTAAHLHLGMFASNAPNQWDGWRGRIDPTPYLYGNLNSDYTIKAHSAAIQAEKETEMPAVLHKVVSPAMNRRLAKGKAAVITTDATNTAHQNFAVGGVGTYDIQAYIAGTGLPDGQRIKGRFLIVEKGKSPSGYYPFQIDGTFDGEFNGLVGGRFKVNSGTVIWIELTSSVDTAYVASVDGSVIVHPFK
Physico‐chemical
properties
protein length:314 AA
molecular weight:33592,1 Da
isoelectric point:5,79
hydropathy:-0,14
Representative Protein Details
Accession
40ve1
Protein name
40ve1
Sequence length
255 AA
Molecular weight
27492,62610 Da
Isoelectric point
6,36974
Sequence
MSGFTLPTSRPITQAWAAEFDDWDGDGVVDYPGGFYNSIGWNGHNGIDYGCHEGDPIEAIADGVVAYADWAGNHWLLSGGGIAVLIEHPAYGIQSEYLHLSRTDLQPGQRVKKGQVIGYGGSTGASTAAHLHLGILPLTGINLNNRMRGRIDPTPYLYGALNPDYAPKIQTQATVKGFLMALSDKQQTDLYNRVMRYVDSKVSDVPKRVWGTPIRRGGKQISALQELADAKTLIGKQQATIDALSEAVRKLGGAA
Other Proteins in cluster: phalp2_4234
Total (incl. this protein): 8 Avg length: 308,0 Avg pI: 6,06

Protein ID Length (AA) pI
40ve1 255 6,36974
A0A3G3M3K7 315 5,97119
A0A649VKK0 312 5,79169
A0A976U8L0 315 6,10248
A0A9E7U0G2 315 6,10248
A0A9X9K526 327 6,62972
A0AAE9GQR0 311 5,72774
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10612
2eP5N
1 26,3% 190 3.664E-11
2 phalp2_5936
6ajyZ
15 31,3% 172 6.625E-11

Domains

Domains [InterPro]
PET_M23
Unannotated
Unannotated
Representative sequence (used for alignment): 40ve1 (255 AA)
Member sequence: A0A3G2KFZ3 (314 AA)
1 255 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Arthrobacter phage Faja
[NCBI]
2419957 Fajavirus > Fajavirus faja
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH834612 [NCBI]
CDS location
range 17929 -> 18873
strand +
CDS
ATGAGCGGATTCACTCTGCCCACCTCTCGCGCCATCACACAGCCATGGGCTGCGGAGTTTGACGACTGGGATGGTGACGGCGTCGTGGACATCCCCGGTGGCTTCTACCATTCCATCGGATGGTTCGGGCACAACGGCATCGACTACGGATGCTTCGAGGGCGACCCGGTGCACGCGATCGCCGATGGCGTCATCGAGTTCGCTGGGGACGCCGCCAGCCACTACCTGCTCTCCGGGGGCGGGAACGCGATACTGCAGCACATCCCCGGATACGGGGTGTGGGCCGAGTACCTGCACCTGTCCCGGTTCGCGGTCACCAACGGGCAGTCCGTGAAGAAGGACCAGGTGATCGCTTACTCCGGCAAGACCGGATCCTCAACCGCAGCGCACCTGCACCTGGGCATGTTCGCCTCCAACGCCCCGAACCAGTGGGACGGCTGGCGTGGCCGGATCGACCCCACCCCGTACCTCTACGGCAACCTCAACTCCGACTACACCATCAAGGCACACTCCGCCGCTATCCAAGCCGAGAAGGAAACCGAAATGCCTGCCGTATTGCACAAGGTTGTATCGCCCGCTATGAACCGTCGCCTCGCCAAGGGCAAGGCCGCCGTCATCACGACCGACGCCACCAACACCGCGCATCAAAACTTCGCCGTCGGCGGGGTCGGGACCTACGACATCCAGGCGTACATCGCCGGGACCGGGCTGCCGGACGGCCAACGCATCAAGGGCCGGTTCCTGATCGTGGAGAAGGGCAAATCCCCCTCCGGATACTACCCGTTCCAGATCGACGGCACTTTCGACGGGGAGTTCAACGGCCTCGTCGGCGGGCGCTTCAAGGTCAATTCCGGCACGGTCATCTGGATCGAACTGACCTCCAGCGTCGACACCGCCTATGTGGCCAGTGTCGACGGTTCCGTTATCGTCCACCCGTTCAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40ve1) rather than this protein.
PDB ID
40ve1
Method AlphaFoldv2
Resolution 79.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50