Protein

Protein accession
A0A889IRK7 [UniProt]
Representative
1gMiR
Source
UniProt (cluster: phalp2_6589)
Protein name
DUF1906 domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTIYFDASAAYPPAEAVKAAGVSGVVAYLSGRRPGAEWMNAKPLTRDVADSYRAAGLEIVSNYQFGKGSTADWLGGEPAGIYHATLALAFHQQAGGPDTAPIYAPVDDGFGPGYPYSVDHWNNHVAPFLRGWQKVLGKERTGVYCNYRCIDWALEDGLGTFFWQHGWGKPSYDTPHQAAHLLQYEIDKRQIGGVGVDLNRVLKENHGQWSAYADNTWDAAYVQLMGPIR
Physico‐chemical
properties
protein length:229 AA
molecular weight:25153,8 Da
isoelectric point:5,86
hydropathy:-0,35
Representative Protein Details
Accession
1gMiR
Protein name
1gMiR
Sequence length
697 AA
Molecular weight
77386,97550 Da
Isoelectric point
5,87109
Sequence
MDKKGVRIMADYTRYPEAFTAAGIPFTEEPGWRDRGHGDVTDTRFIVIHHTASANNDAAGIGPVRDGVAGLEGPLSQLCLKRDGVPHIIAAGVSWHAYGTISYRGVAPKMGNYYSIGIEGIDSGYNTWTDQQRDMYPKVVAALLKDMGLPADAWIFHRDYQPGEKIDPGGFDKAWFDRQVRAAYNGIITETAIQAKRRENPWLGNRVIKEEESPTLDGIGRYAQYESGYIYWHPNTGAVTINLDFWPRFEQERWEQGDLGYPVKDAQAVEGGSFQVFQNGNVLVLNGATKVVSRMYGMIGGRWNALGGVKSELGFPVREEIVLPDGEGRLAQFDHGHIYWHPRTAVAKDILNDGPGGTWEEFVRLDYEKGPLGYPVGDSILSLDGRAKIQAFEHGTIYNLFKTEKIDAHAVWGQIFAMYAQLGYENGRLGLPISDVYRNGEVLRSDFEAGSIEWNDKTNDIYMVISGKRVDIPLPKPDPKPEPKPEPPIQGDPNLVGKTLLDFSVNQVPAKDIKAAGHVGVIHYVSDPREKWMKAKPVTKWYASNLTANGLLNVSNFQYGKGSTSDWRTGYANGVKCAQRALELHKAAGGPDSAPIYMSIDDNPTDNELTDLIKPYLEGAQSVLGKDRMGVYGNRKTIEFANKNGLGTYYWQHFWNGTGDRTVSPIANITQDRIDKDTVGGIGVDVNTIRKAYFGQW
Other Proteins in cluster: phalp2_6589
Total (incl. this protein): 2 Avg length: 463,0 Avg pI: 5,87

Protein ID Length (AA) pI
1gMiR 697 5,87109
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27885
8LlFs
25 43,6% 474 5.529E-101
2 phalp2_7159
2QxCN
1 29,1% 466 6.689E-24

Domains

Domains [InterPro]
Representative sequence (used for alignment): 1gMiR (697 AA)
Member sequence: A0A889IRK7 (229 AA)
1 697 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510, PF08310, PF08924

Taxonomy

  Name Taxonomy ID Lineage
Phage Nocardia phage NC1
[NCBI]
2805752 Zierdtviridae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MW452562 [NCBI]
CDS location
range 46227 -> 46916
strand +
CDS
GTGACCATCTACTTCGACGCGAGCGCCGCGTACCCTCCCGCAGAAGCCGTGAAGGCCGCTGGCGTGAGCGGGGTCGTGGCGTACCTCTCTGGCCGCCGACCCGGCGCGGAGTGGATGAACGCCAAGCCGCTGACCCGTGACGTCGCCGACTCCTACCGCGCGGCCGGTCTGGAGATCGTCTCGAACTACCAGTTCGGCAAAGGGTCCACGGCCGACTGGCTCGGCGGCGAGCCAGCGGGGATCTACCACGCCACTCTGGCACTCGCGTTCCACCAGCAGGCCGGTGGCCCCGACACTGCTCCCATCTACGCACCGGTCGACGACGGTTTCGGTCCCGGCTATCCCTACTCGGTCGACCACTGGAACAACCACGTCGCGCCGTTCCTGCGCGGGTGGCAGAAGGTCCTGGGCAAGGAACGCACCGGCGTCTACTGCAACTACCGCTGCATCGACTGGGCTCTGGAGGACGGCCTCGGCACGTTCTTCTGGCAGCACGGCTGGGGTAAGCCCTCCTACGACACGCCGCACCAGGCTGCGCATCTGCTCCAGTACGAGATCGACAAGCGCCAGATCGGCGGGGTCGGGGTGGACTTGAACCGCGTGCTCAAGGAGAATCACGGGCAGTGGTCCGCGTACGCGGACAACACCTGGGACGCGGCATACGTGCAGCTCATGGGTCCCATCCGATAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gMiR) rather than this protein.
PDB ID
1gMiR
Method AlphaFoldv2
Resolution 88.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50