Protein

Protein accession
A0A6G5XZM2 [UniProt]
Representative
1m78P
Source
UniProt (cluster: phalp2_10315)
Protein name
NlpC/P60 domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 90% (predicted by ML model)
Protein sequence
MSNQSLYAMYVIGKVESNNNWSSVNFHDPITIGMMQWYGTRAYGLLNRGRSADPTGWSNFKNSANSLASQVESNSANWNLRYLTQSEGNAWIEWSKRVENHAFQQAQWEEDYHDYSGVCDRYGFPASNVKERIFFMSMYHQSPVSAFRVIGSVSSTANINLLHSAALNDQVLGNYKNRYNTVYDLLKNWDGQSAPPDFGQSGDVDSSPGGNAPTISGTSDKTAWIHVKQNTIYLHDNGKIRTFYPSSAQNYIEKFNQGTPVDNNNQTEHGSDTGSGSSSKVVEWVKARIGKYDYSQGPGRLNPDSTGYTDCSGLWWRAYMDVTGVDVGTYTGAMAQKGTLIADSNNTTIQEAISKVKSGDLLLLGKRPIFDHVEGFVSDGQAQTLSHGGPGKGPNYQNAIEEIPYFNNSWQIRRYV
Physico‐chemical
properties
protein length:416 AA
molecular weight:46266,2 Da
isoelectric point:6,06
hydropathy:-0,66
Representative Protein Details
Accession
1m78P
Protein name
1m78P
Sequence length
263 AA
Molecular weight
N/A Da
Isoelectric point
9,31473
Sequence
MSNQSMYAMYVIGTVESNCNWASVNYNDPITLGMMQWYGNRAADLIRLGAQSDPSGYAAFKSAAPTLAQQVENNSVDFPSRYVTQEEGNAWVSWAQRKENHQFQQAQWETDYTNYSQICDAQGFPGGNIRERIFFMSMYHQSPQRAFNVLRSCSATASLDLLYTTCLNDGILGKYRNRYNTVYNLLKNWDGQSAPPDFGXXXXGNPATRQREAIPHRFSQPPRTRLGLANKGIVFICTRAALRYNSRKARLRIGFRAAMEEYR
Other Proteins in cluster: phalp2_10315
Total (incl. this protein): 8 Avg length: 326,9 Avg pI: 6,90

Protein ID Length (AA) pI
1m78P 263 9,31473
5Vhn5 263 6,43897
6lGoh 342 6,50911
89wPr 249 10,53815
8oOIg 258 4,39294
A0A6G5XVW6 416 5,69995
A0A6G5YG97 408 6,20451
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9264
70WV0
16 32,3% 204 1.990E-41
2 phalp2_8094
6uNZy
1 34,1% 199 8.026E-28
3 phalp2_9457
z6kh
1 27,6% 177 1.914E-19

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 1m78P (263 AA)
Member sequence: A0A6G5XZM2 (416 AA)
1 263 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteriophage sp
[NCBI]
38018 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN855676 [NCBI]
CDS location
range 13979 -> 15229
strand +
CDS
ATGTCGAACCAATCTTTGTATGCTATGTACGTTATAGGAAAAGTTGAATCAAACAACAACTGGTCTTCTGTGAATTTTCACGACCCCATCACTATCGGCATGATGCAATGGTACGGCACAAGAGCATATGGACTGCTTAATCGCGGGCGTTCCGCTGATCCTACGGGATGGTCTAATTTTAAAAATTCTGCAAACAGTCTCGCAAGCCAAGTGGAATCGAACTCAGCGAACTGGAATCTGCGATACCTGACGCAATCCGAGGGGAACGCTTGGATAGAATGGTCAAAAAGAGTTGAAAATCATGCGTTTCAGCAAGCACAATGGGAAGAAGATTATCACGACTATTCTGGCGTTTGCGACAGGTACGGTTTCCCCGCTTCAAACGTTAAAGAGCGAATATTTTTTATGAGCATGTACCACCAGTCGCCCGTATCCGCTTTTCGCGTGATAGGATCGGTAAGCTCTACGGCTAATATCAATCTTCTGCATTCAGCCGCGCTAAATGACCAAGTTTTAGGAAATTACAAGAATCGTTATAATACTGTGTATGATCTGCTTAAAAATTGGGACGGGCAGAGCGCACCCCCCGATTTTGGACAGTCAGGAGATGTAGACAGCAGTCCGGGCGGCAATGCGCCGACGATCAGCGGAACGAGCGATAAGACCGCATGGATACACGTAAAGCAGAATACGATATACTTGCATGATAATGGAAAAATTCGTACATTTTATCCATCGTCTGCGCAAAATTATATTGAAAAATTCAACCAAGGCACACCGGTAGACAACAACAATCAAACCGAGCACGGATCAGATACAGGATCGGGTTCAAGTTCAAAAGTTGTTGAATGGGTGAAAGCACGGATCGGCAAGTACGACTACTCGCAAGGCCCCGGGCGGCTTAATCCCGACTCTACAGGATACACGGACTGCTCAGGCTTGTGGTGGCGAGCCTACATGGATGTCACCGGCGTTGATGTCGGAACATACACTGGAGCCATGGCGCAAAAAGGAACGCTTATAGCAGACTCAAATAATACAACAATACAAGAAGCTATATCAAAAGTCAAGTCCGGCGATCTTCTGCTTTTAGGCAAACGCCCCATTTTTGACCATGTAGAGGGGTTTGTTTCCGACGGTCAGGCTCAGACGCTTTCGCATGGCGGCCCCGGCAAAGGGCCGAACTATCAAAACGCCATAGAGGAAATACCGTATTTTAACAATTCTTGGCAGATCAGGAGATATGTCTGA

Gene Ontology

Description Category Evidence (source)
GO:0008234 cysteine-type peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1m78P) rather than this protein.
PDB ID
1m78P
Method AlphaFoldv2
Resolution 82.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50