Protein

Protein accession
A0A2Z2U806 [UniProt]
Representative
7vQnK
Source
UniProt (cluster: phalp2_12320)
Protein name
Endopeptidase
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MMAETKELRHAGIGIDLNVNGLEEFRKANSMLDEFMRSFHEMTGQADKLKESLGSGLNISRDVNQSKESMAGFQSEFRKTAQQADIFKHNLDFSNVGAKDTESMRKLNDQVGKLRSDKITQIKTEMQGSNKSTNDGSEAIKKYSDHVDEAHHRMRRLHDIIFGSFVGTAISNGLQNMTSGIQNVVKSGYELAEGGEQIRNQWKDIGLSKAQAKGMTDQIGEIRSKSNMAGSAIDAMQKKFYAVTNSVPQAKKFTNEIAAFGAAANKSSQQIQQISMGVAKLAGSKKVSAGFFQRSIGQLPAFQKAIISASGMTTKAFNDQLKNGKLTGAKLQQYMTTAAKMSSKEWANFSKTTKGQLAGIEGTWQNLKAKFAGPLVEGVAKALESVDSKKGGLGDVKKQLQGIAEALGAKMGNYIGEAIKFLVKNRKALSEIASSVFTIGKNLAIGAWKPIASIIKTIGGQSGKASKGLRGFGDALNAISKHKSAIQSVGKVLMGMFAAKKLLDMGSGIRGLRKHILEFTSSTRLMGAAIKLLPWALWIAGIAAAIAILVKLYQHDKKFRKFVNGIMASVRKMAKSFKNLWGDAKGIFKNGFKTIESIVNVGIDVLTGDWKGFKKDGVKLIKSFWSLAKDVFKADFDFINDLTGGKLEKMTKAFSNTWKDIGKGWKSFWNGISDWFGDLWKGIVKHVQDGINNVIKVLNSGISGIDSVIHAFGGSSKAIGTINPVHLATGTGALSGQRRAITKPTMAMLNDGHDSPETGNREMLIHPNGMSELIKGTNVMRMLEPGAEVLNATEAKMAMSMQHFASGTGFFSNLWKGTKKVAADAVGGVESGISGIGNFASKAWHGATHLLSTIQKIIAGPGKYLNSLMGKKPSGQGTILSDFAGGFYNSMKKQASTWWSSLWSMASGVLDSSGTGGSWRHDPGLTKTNGFGASRSFGSHDGVDFSGSLGSPILAVHGGKVTHTGRPLHGWPYSQLGDVITVASDDGYQEIYQEFGGMNNIKTSTGDIIKTGQKIATLGHLNGAGSGSHVHIGVSHGSLWDHGGSNTSGWYDVTKMHGKDNGSSKLSHSHTGGAMHKLIQQETGGMMGWIKKHLSPLMDDGGGSMGNLGGAGVQRWRSYVKKALSALNLSTSGSMVDRILRQINTESSGNPKAMGGTDGLSDGHAEGLMQVKPGTFAANKLSGHGNIWNGYDNILAGLNYAKHRYGSGLSFLGNGHGYAKGGKIPKGQLSVVGEKGWELFQPNTSGTVIPHEASERLINGSGKGKVSISAPTKVVIQGNADKSAIDELDSRLEKRNDDLVEKLRELWGLNDEGGLTV
Physico‐chemical
properties
protein length:1317 AA
molecular weight:140855,0 Da
isoelectric point:9,74
hydropathy:-0,33
Representative Protein Details
Accession
7vQnK
Protein name
7vQnK
Sequence length
1662 AA
Molecular weight
179317,71990 Da
Isoelectric point
10,04722
Sequence
LQSEGKTYQANKAQVEAYRSAVKTLTNQQEKLEQSLNKIAETSGKTSEEYRAQQILVNKNATEINRFKSSINSLNDEMRRSNPTFMDKVRSKLTATNTEAEKTHSLFKSMFAANIATNAVTSGFNIVRNHLGGMIRSAHEYNIQQQTMDATWLTLTGHANKGKAMVEQIDNMAAAAQNDTHMVDTLSQKFYAINKSPEQTAKLTKSILTLQDAFGKTDPEVENFATQFSQMMANQKVSAQDMLSFVNVFPEYRLELLKTEQQQTHNSKLTMKQFNKLMSAGKISSKMAIDTLERMSNKYKNATDNFTKTIPGMIRTVKSQVPRLVSAFDEPFTKMENPLIKQVSDWATSKETEKAFGRLGKTVSTGFNRVMTHTFTVGQKRPKPLTRAKRAQMDRFLAANAYSGQLPKNQLQKVMRELPKSERNSISILNRKLPHSVLEPVMNGNKSSKPGTLAGVGNEHLKKQLSYYQGLVKAEKSYRAGQNKTVTLTDILNKGIDKLNHGLGQLFNYLSKHGKDIKDIGKSLFSITGTIARGVWKDFSAIAINIGKSLGLIGKNADKNGGSLHAIAEMLNNIAKNKSALKKVSDIIVAISAAKLFKQTSTPFIGLAKGSYKAFLRVRGLARGLKGVNDAAKLSKMGDIEKTFFNIGSNARKAAGSVKDFAKNFKSLSNLKKFGKGLFSAKGGAGKLSGLLQSAHSAKGFKNLSTAGKWGTGLAAAGVAVDAGASLIDAVKNRHSATKRSKAIGKSIGSALGGGIGLWFGGPLGAALGAKIGGIVGKWGGSAVNKFTKGWQRKKPPKKFWSLANLGWSAHSMWNGFTSSVGKTINWFKKHWQVVGSFLLHPFATGFGLLYKYNKGFHKWIDGLTGYFRKKFAPLSDWFHDHIAKPIGDISGKVADFFTGGHGSKGGSKRAKAHARGGLMTSTHGALVGEAGPELAYKPYANHVRLLGANGPQFAKVHAGEKILNARDTHKVMTGGLGRGLVLKGYANGNTGLAKTTKNVSRDYKKINKTSTSQLNSLSKKSKQTWSGITSHTTKQAEKTRKAAISKYTSMRKGVHKQMDAMHDGVISLAGTTAKGFGKELDHMTKYAHSAMSDTIGQINGGIRGIDKVLGQFGGNTSVIKPVKFAKGTDANGRLTQSTYAMVNDATTGPRQEALISDKNEVFMPQGRNVRMVIPKGWGVLNGTQAQQAGLTHFAKGSGIGHNQLKKIAERAGNNPAKSFAEMYSKFIKPAGADLKEGAESLAKNASTHYGNPWSGAMWSVINNAINDGGGATGKASGLLKAVEKNGEGHRYVWGAEGPNTFDCSGLVKYTLEHDFGIDYPHFSGSQYSRTQHISKSEARMGDLVFWGAGGSEHVGVYAGGNKYFSAQSPAQGIHMNTLASVRNEGAPMFGRVKGLKAPKAAAKKAKRPDSRLTALAKRELGPSALKWIKDNLGDTAGGSFGNPAGDGVQRWKPLVKKALGVLNLSTSPSMVSRVLAQIATESSGNPKAMGGTDGLSDGHAEGLMQVKPPTFNAYKLKGHGNIWNGYDNMLAGLNYAKHRYGQSLYYLGQGHGYAKGGKPKAHTPFIAGEQGPELITADGPVKVDTHEQTKRKFAELGDLIKPPKVHKGGSGKPQLPPINININGPISSEKDANKVAQIVKRELATVLENIGDEFGGDPTVY
Other Proteins in cluster: phalp2_12320
Total (incl. this protein): 16 Avg length: 1695,8 Avg pI: 9,84

Protein ID Length (AA) pI
7vQnK 1662 10,04722
1FCvJ 1675 9,78058
1jEKX 2091 9,98469
2Fmf 1645 9,89553
61pR6 1599 9,63894
7WDgg 1672 9,69535
7vzxt 1645 9,87316
7xRob 1819 10,07700
A7DYB8 1733 9,73062
A0A0K0MWQ1 1718 9,73313
G7YZ66 1733 9,71450
A0A6G5Y172 1633 9,93092
A0A6G5Y1P1 1820 9,93859
A0A6M8EZM5 1733 9,72746
A0A6G5YDB8 1638 9,89946
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3742
5tUpG
27 46,8% 1271 9.010E-300
2 phalp2_32239
7pZ5j
148 29,4% 1986 5.178E-235
3 phalp2_39680
eQI
275 25,1% 1681 2.295E-166
4 phalp2_36199
7blf7
55 26,0% 1662 2.189E-157
5 phalp2_10036
7fPkw
316 24,6% 1599 2.264E-141
6 phalp2_7660
6v5eq
4 26,8% 1148 6.062E-128
7 phalp2_3948
7g15J
9 22,0% 1650 1.682E-119
8 phalp2_8739
8tQlt
110 22,7% 1898 6.413E-104
9 phalp2_38613
291uz
173 23,8% 1542 8.748E-96
10 phalp2_36980
8KjJ4
4 23,7% 1610 4.586E-75

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage Lb
[NCBI]
2048517 Heilongjiangvirus > Heilongjiangvirus Lb
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG020111 [NCBI]
CDS location
range 4573 -> 8526
strand -
CDS
ATGATGGCAGAAACTAAAGAGTTGCGGCATGCCGGTATTGGCATTGATCTTAACGTCAATGGACTAGAGGAATTTCGTAAGGCAAACTCGATGCTTGACGAATTCATGCGTTCATTTCATGAAATGACTGGTCAGGCTGATAAACTAAAAGAATCACTAGGTTCCGGGCTTAACATATCTCGTGATGTTAATCAGTCTAAAGAAAGCATGGCTGGCTTTCAAAGTGAGTTTCGTAAGACGGCGCAGCAGGCTGATATTTTTAAGCACAATCTGGACTTTTCTAATGTAGGGGCTAAAGATACTGAATCGATGCGTAAGCTCAACGATCAAGTCGGCAAGCTACGTTCTGACAAGATTACGCAGATCAAGACTGAAATGCAGGGATCAAATAAGTCAACGAACGATGGTTCTGAGGCAATTAAAAAATATAGTGATCACGTAGATGAAGCTCATCATCGCATGCGTCGGCTACATGATATTATCTTCGGCAGCTTTGTAGGAACGGCCATTTCTAATGGATTGCAGAATATGACATCAGGTATTCAAAACGTCGTCAAATCGGGCTATGAATTAGCTGAAGGTGGCGAACAAATTCGCAATCAGTGGAAAGATATTGGGCTAAGCAAAGCACAAGCCAAGGGCATGACCGATCAGATTGGTGAGATTCGTAGTAAGTCAAATATGGCTGGTTCAGCAATTGATGCGATGCAAAAGAAGTTTTATGCAGTGACGAACAGTGTGCCACAGGCAAAGAAGTTTACGAACGAAATTGCAGCGTTCGGTGCAGCTGCAAACAAGTCCAGTCAGCAGATTCAGCAAATCTCTATGGGTGTTGCTAAATTAGCTGGATCTAAGAAAGTTTCGGCTGGATTTTTCCAACGTTCAATTGGACAGCTACCAGCATTCCAAAAAGCAATCATTTCAGCTAGTGGCATGACGACCAAGGCTTTTAACGATCAGTTGAAGAATGGCAAACTGACTGGTGCTAAGCTACAGCAGTACATGACAACTGCCGCTAAGATGAGCAGTAAGGAATGGGCCAACTTTAGCAAGACGACTAAGGGGCAGTTAGCCGGAATTGAAGGGACTTGGCAAAACTTAAAAGCAAAATTCGCTGGTCCTCTCGTTGAGGGCGTAGCTAAGGCTTTAGAATCGGTTGATAGCAAGAAAGGTGGCCTAGGCGACGTTAAAAAGCAATTGCAAGGCATTGCAGAAGCTTTGGGCGCTAAGATGGGCAATTATATCGGTGAGGCCATCAAATTCTTGGTTAAAAACCGGAAGGCTTTGTCTGAGATCGCTAGTTCTGTATTTACCATTGGTAAAAATTTAGCTATTGGGGCCTGGAAGCCTATTGCGTCGATAATCAAAACTATTGGTGGTCAAAGTGGCAAAGCTTCTAAAGGTTTGCGTGGCTTTGGAGACGCGTTAAATGCTATTTCTAAACACAAGAGTGCTATTCAGTCTGTAGGTAAGGTTCTTATGGGTATGTTTGCCGCCAAGAAGCTTTTAGACATGGGCAGTGGGATTCGGGGACTAAGAAAGCACATTTTAGAGTTCACATCATCAACGAGATTAATGGGAGCAGCCATTAAATTGCTTCCCTGGGCTTTGTGGATTGCAGGTATTGCCGCGGCAATTGCAATCTTAGTTAAGCTATACCAGCATGATAAAAAATTCCGCAAGTTTGTTAATGGCATTATGGCATCGGTTAGAAAGATGGCTAAGTCGTTTAAAAATTTGTGGGGAGACGCCAAAGGCATCTTTAAAAATGGATTTAAGACAATTGAAAGCATTGTTAATGTTGGAATTGATGTTCTAACTGGCGATTGGAAAGGATTTAAGAAAGACGGCGTTAAGCTGATCAAATCATTTTGGTCCTTAGCCAAAGACGTCTTTAAGGCTGACTTTGACTTTATCAATGATCTGACTGGTGGAAAATTAGAAAAAATGACTAAGGCATTTAGCAATACCTGGAAAGATATTGGCAAGGGCTGGAAATCATTTTGGAATGGGATATCTGATTGGTTTGGCGATCTCTGGAAAGGCATCGTTAAGCACGTTCAGGACGGTATCAACAATGTTATCAAAGTTCTCAACTCAGGGATCAGCGGTATTGATTCAGTCATTCATGCATTTGGTGGATCTAGCAAAGCAATTGGGACGATTAATCCAGTTCACTTAGCAACTGGGACCGGTGCTTTATCTGGTCAGCGTAGAGCAATTACTAAGCCAACCATGGCTATGTTGAATGATGGCCATGATTCGCCAGAAACTGGTAATCGAGAAATGCTAATTCACCCTAATGGTATGAGTGAACTGATTAAGGGAACCAATGTTATGCGCATGTTAGAGCCGGGCGCTGAAGTGCTGAATGCCACGGAAGCCAAAATGGCTATGAGCATGCAACACTTTGCTTCGGGTACTGGCTTCTTTAGTAATCTATGGAAGGGGACTAAAAAGGTGGCTGCTGACGCAGTCGGTGGTGTCGAATCAGGCATTTCAGGCATTGGTAACTTTGCGTCGAAAGCTTGGCATGGTGCGACACACTTGCTGAGCACGATTCAAAAGATTATTGCCGGCCCCGGCAAGTATTTAAATAGTCTTATGGGCAAGAAGCCATCAGGACAAGGCACTATTCTTAGTGACTTTGCCGGTGGCTTTTATAATTCCATGAAAAAGCAAGCCTCGACTTGGTGGTCCTCACTTTGGTCGATGGCGTCTGGAGTGCTAGATAGTAGTGGAACCGGTGGCAGTTGGCGTCATGATCCAGGATTGACTAAGACCAATGGATTTGGAGCATCTCGTAGCTTTGGATCACATGATGGCGTAGATTTTTCTGGATCGTTGGGATCTCCTATTTTAGCAGTTCATGGTGGTAAAGTTACACACACCGGTCGGCCATTACACGGATGGCCTTATAGTCAGCTTGGAGATGTTATCACAGTTGCTAGTGATGATGGATACCAAGAAATTTATCAAGAGTTTGGCGGAATGAACAATATTAAAACCAGTACGGGAGATATCATCAAGACTGGGCAGAAGATTGCTACTTTAGGTCACTTGAATGGGGCTGGTAGCGGATCACACGTTCATATTGGGGTGTCTCATGGTTCCCTTTGGGACCATGGTGGATCTAATACTAGTGGGTGGTATGACGTTACTAAGATGCATGGTAAGGATAATGGGTCATCAAAACTAAGCCACTCTCACACTGGTGGAGCTATGCATAAGCTCATTCAACAGGAAACTGGTGGAATGATGGGGTGGATTAAGAAACATCTATCACCGCTTATGGATGATGGTGGTGGCTCGATGGGCAATCTCGGTGGCGCCGGTGTTCAGCGTTGGAGATCTTATGTCAAAAAGGCCTTGAGTGCTTTGAATCTGTCTACTTCCGGATCAATGGTCGATAGAATTCTACGTCAGATCAATACGGAATCTAGTGGTAATCCAAAGGCTATGGGTGGTACAGATGGATTGAGTGATGGTCATGCAGAAGGACTTATGCAAGTAAAACCGGGAACTTTTGCTGCCAATAAATTATCCGGACATGGCAACATTTGGAATGGATATGACAATATCCTTGCTGGACTTAACTACGCCAAGCACCGATATGGAAGCGGCCTAAGCTTTTTGGGAAACGGACATGGTTATGCTAAGGGCGGAAAAATACCAAAGGGCCAACTTTCAGTTGTTGGGGAGAAAGGTTGGGAACTTTTCCAACCCAACACTTCGGGAACAGTGATTCCACACGAAGCTTCAGAGAGGTTGATTAATGGTAGCGGCAAAGGCAAAGTCTCAATTAGTGCACCCACTAAGGTAGTTATTCAAGGTAACGCTGACAAGTCAGCAATTGACGAGTTAGATAGCCGGTTAGAAAAACGTAATGATGATTTAGTTGAAAAGCTCCGTGAACTTTGGGGACTAAATGATGAAGGAGGGCTTACTGTCTGA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7vQnK) rather than this protein.
PDB ID
7vQnK
Method AlphaFoldv2
Resolution 54.51
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50