Protein

Protein accession
A0AAF0JQ65 [UniProt]
Representative
4UrxT
Source
UniProt (cluster: phalp2_31965)
Protein name
Lytic tail protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAQGLERRAQPVDNLRGAEAVEPVTRSVPSVGTPPTPSVPVESYGAQIAQQVAQFASGKLAEIQQKRQEQSMIDGQIAAMQGDSFESVEMGGDKWALEGYRVVTAQTMSAGLLRAQEQEIASGAYEQDPDAYRQRLVGRIDAMTSEIGDERTRELARENLMRQMPTLVDAHMRQNLAFREKQNFDALAQSVDVLSRDNSSTGALVAFATGQSEATAGLSIERRRTAVVQGVVNAFENDNPAAYAHLEAAGFFTTENLTASQLRTIRSAQASYHSRMEGTWNAEWHAENTRIQDAVRSGDLDPLVAVEQYAANNATHGRRTSAANAGQVYDAARAGVEFAEGTRGLNIQAAGAAGDYDLQARLMQAAVIHQESRGNPNAVSPVGATGIMQLMPGTAMSPGFGVRNIFQVARDMGVPVMGETEEVAQILMRNEEVNKAMGTEYLSAMLERYNGDVPRALAAYNWGAGNADNWDGDMASLPAETRGYIRNITGSWEDNIPDPQADRIAAERNLQRVREQAALTAYENATPQLDILDEQYRRGQIPHTQWIEQSREVMGQWGMEVDMQRLNHEAQINRSVIRGIADRAETAAATENEIALAADLRVAEAEYTQVVEAVRNGTMGQPELQTAMQNFMQTRRNLHEQYGVPLDAGTEIGQQDADVRAAIEAVRVGRVAAEERAVRERAAAMGTAGALPVEQQQRLVQENDARLRQQYTDAVAAGQIDPQLAESAFQQERMQFFAESGIVQPTQRRVINAGLDQEPMIDGQVNPAFQEAVQAYMTMRDVNPTVADRYIDPENRAVLDTIVGRVGTGGNVSAAIYGYANQMSRASTSPYRTPDEFVQDAGVQRRIASAVDNYIASSDIGMFQAVFWSDADVSQVWDRRSSFDLDVHQDALRTALEQEVAAGYQHNPNLRVADQVAMAAERVQRRYAFAGGDLIDVGQGNDAHELFFGNRGPEMSGQQDSINAAIMHYLRSDEFREEYPDIENSTGGEILGTVGLWNGFQGGLNSVFGTNFDTETSGLGPRGAFSTLVTGVRPFRVISNPMAGGQTRIAIEYNLPGGGFSQPIEIDPADIGARYLDHIRQNNR
Physico‐chemical
properties
protein length:1084 AA
molecular weight:118526,9 Da
isoelectric point:4,75
hydropathy:-0,47
Representative Protein Details
Accession
4UrxT
Protein name
4UrxT
Sequence length
710 AA
Molecular weight
76114,96880 Da
Isoelectric point
5,04494
Sequence
MVSVPIARQTARKTQATGARTSGVVDISGGLNRVIGAVDENNQRIAKVKAQQADLEFDGELFEMLHGEDGYLSQRGQNAAEGINDFADRARALYDQKMSGLDGYTKQAAGPAFNARLRSALHGASRHATEQRGAWEKGLYEARVSQAVNGAVAARTPEEMQTNLLSGKIAIQNRGAEMGRSREEIEQDVLAFTSDVHASTALRMSADDPRASLAYVEENRDAMLPQVADQVVLKLSGEAARRNGMDVAQGAWMEAHSAGPVNAALQSGDYQAAAASILGGLVKVESGGNPNAVSPVGAIGLTQVMPATARAPGYGIADVFSLAESMGVSFADRSDASVEALLKNPEIALQYGENYLAAMLHEYDGNLPMALAAYNAGPGKVDEWIESIGDPRDGAISASEWVDQIPYDETRNYVPSVMAKARGGAADPMAVAASITDPDARAAAIARTRQLDAVAAGRRKREAEQAKQYAFAHVEEGGSVDDLPMEMRVGLGREYVSGLRTYEQKMAAGTPIETDLATYAMLSTMAADDPRAFGRVDIRKYAHLLSKTDAKKFADMIAKSGGGSDGTSYAGMQSSLKLARPEFYKFDKDHEQTQAAVAAYTRHVDRFVAKEGRNPNDREKLEFAREVSRDVVTSAGNVWDSKEPVAVVLQEAREALEAGGSYSVGTGEGRDVIDAVRFAQLTEIIAQRDGRTPTDDEVLLEWLRLTGDVQ
Other Proteins in cluster: phalp2_31965
Total (incl. this protein): 3 Avg length: 845,3 Avg pI: 5,13

Protein ID Length (AA) pI
4UrxT 710 5,04494
80XpI 742 5,59474
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30318
4xP4a
34 28,8% 559 8.696E-62
2 phalp2_19843
5sKQN
31 28,7% 671 9.975E-60
3 phalp2_1726
2rrT3
1 26,5% 715 3.032E-55
4 phalp2_27844
fq3d
11 27,1% 692 2.368E-54
5 phalp2_31894
4JkOt
8 26,7% 644 7.967E-53
6 phalp2_5719
4Jlyi
1 24,9% 758 1.584E-39
7 phalp2_2664
6WyWF
26 25,2% 530 2.562E-36
8 phalp2_35637
1oBtr
46 23,9% 764 1.286E-33
9 phalp2_541
xkAu
8 23,8% 758 1.608E-27
10 phalp2_13955
8JaZZ
1 27,3% 446 2.126E-27

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Paracoccus phage ParMal1
[NCBI]
3032416 Autographiviridae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OQ376858 [NCBI]
CDS location
range 3551 -> 6805
strand +
CDS
ATGGCCCAAGGTTTGGAGCGGAGGGCACAGCCCGTGGACAACCTTCGTGGTGCAGAGGCAGTTGAGCCTGTCACCCGGAGTGTCCCTAGCGTGGGTACACCGCCTACGCCAAGTGTCCCTGTGGAGAGCTATGGGGCACAGATCGCCCAGCAAGTCGCGCAGTTCGCGTCAGGCAAGCTGGCCGAGATACAGCAGAAGCGGCAAGAGCAGAGCATGATCGACGGTCAAATTGCAGCAATGCAAGGCGACAGCTTCGAGAGTGTTGAGATGGGTGGTGACAAGTGGGCGCTGGAAGGCTACCGCGTTGTCACCGCTCAGACCATGAGTGCTGGCCTGCTTCGCGCACAGGAACAAGAGATCGCAAGCGGTGCATACGAGCAAGACCCTGACGCCTACCGTCAGCGGTTGGTCGGTCGTATCGACGCCATGACCTCAGAGATCGGTGACGAGCGCACGCGGGAACTTGCCCGTGAGAACCTGATGCGGCAGATGCCGACACTGGTGGACGCTCACATGCGTCAGAACCTCGCATTCCGCGAGAAGCAGAACTTCGACGCGCTCGCACAGAGTGTGGACGTTCTGTCTCGGGACAACAGCAGCACGGGTGCCCTTGTGGCGTTCGCGACTGGTCAGAGCGAGGCCACTGCTGGTCTGAGCATCGAGCGCCGCCGCACAGCGGTTGTGCAGGGCGTTGTGAACGCCTTCGAGAACGACAACCCTGCTGCCTATGCACACTTGGAAGCCGCTGGCTTCTTCACGACCGAGAACCTGACAGCTTCCCAGCTTCGGACTATCCGGTCTGCGCAGGCGTCCTATCACTCCCGTATGGAAGGGACATGGAACGCAGAATGGCATGCAGAGAACACCCGTATCCAAGATGCAGTCCGCTCGGGCGATCTGGACCCGCTGGTCGCTGTTGAGCAGTACGCTGCCAACAATGCCACCCACGGTCGTCGGACAAGCGCCGCCAACGCAGGGCAGGTCTATGACGCTGCTCGGGCAGGTGTCGAGTTCGCAGAAGGGACACGGGGCCTGAACATCCAAGCGGCTGGTGCCGCAGGTGACTACGACCTCCAAGCACGTCTCATGCAGGCGGCTGTCATCCATCAAGAGAGCCGGGGCAACCCGAATGCAGTCTCACCCGTTGGTGCCACAGGCATCATGCAGCTTATGCCCGGTACAGCAATGTCACCCGGCTTCGGCGTCCGCAACATCTTTCAGGTGGCGCGGGACATGGGAGTGCCCGTCATGGGCGAGACTGAGGAAGTGGCTCAAATCCTGATGCGGAATGAGGAAGTCAACAAAGCGATGGGCACTGAGTACCTGTCCGCGATGTTGGAGCGTTACAACGGTGACGTACCTCGTGCACTTGCTGCCTACAATTGGGGAGCAGGGAACGCAGACAATTGGGATGGCGACATGGCCTCACTCCCAGCGGAGACCCGTGGCTACATCCGCAACATCACTGGATCGTGGGAAGACAACATCCCTGACCCACAGGCGGATCGGATCGCAGCAGAGCGTAACCTTCAGCGGGTGCGTGAGCAAGCTGCCCTGACTGCATACGAGAACGCGACACCGCAACTGGACATCCTTGATGAACAGTATCGTCGGGGTCAAATCCCGCATACGCAGTGGATCGAACAGAGCCGCGAGGTCATGGGCCAGTGGGGCATGGAAGTTGACATGCAACGCCTGAACCATGAGGCGCAGATCAACCGTTCCGTGATCCGTGGGATCGCAGATCGGGCCGAGACCGCTGCCGCAACTGAGAACGAGATCGCCTTGGCGGCTGACCTCAGAGTTGCAGAGGCCGAGTACACGCAGGTCGTAGAGGCAGTCCGTAACGGGACGATGGGGCAACCCGAGCTTCAGACGGCCATGCAGAACTTCATGCAAACAAGACGGAACCTCCATGAACAGTACGGCGTCCCACTGGACGCTGGCACCGAGATCGGACAACAGGACGCGGACGTGCGTGCAGCCATTGAAGCTGTGCGGGTGGGTCGGGTTGCTGCCGAGGAACGTGCAGTTCGTGAGCGGGCCGCAGCGATGGGCACAGCCGGGGCACTCCCGGTCGAGCAACAGCAGCGACTTGTTCAGGAGAACGACGCCAGACTTCGCCAGCAGTACACCGATGCTGTGGCGGCTGGGCAGATCGACCCTCAACTCGCGGAGAGCGCGTTCCAGCAGGAGCGTATGCAGTTCTTTGCCGAGAGCGGGATCGTGCAGCCAACTCAGCGCCGTGTGATCAACGCGGGACTGGACCAAGAGCCTATGATCGACGGACAGGTGAACCCTGCCTTCCAAGAGGCGGTCCAAGCCTACATGACCATGCGGGATGTGAACCCTACCGTAGCAGATCGCTATATCGACCCTGAGAACCGGGCTGTACTCGACACCATCGTGGGTCGTGTGGGCACTGGTGGGAACGTGTCGGCGGCAATCTACGGTTACGCGAACCAGATGTCTCGTGCATCGACTTCGCCATACCGGACGCCTGACGAGTTCGTTCAGGACGCGGGTGTGCAGCGTCGTATCGCTTCGGCGGTCGATAACTACATCGCATCCTCGGACATCGGTATGTTCCAAGCTGTGTTCTGGTCTGACGCGGACGTGTCTCAAGTCTGGGACCGTCGATCCAGTTTCGATCTGGATGTACACCAAGACGCACTGCGCACTGCACTTGAACAAGAAGTGGCAGCAGGCTACCAGCACAATCCCAACCTCCGGGTTGCGGATCAGGTGGCTATGGCAGCAGAGCGGGTTCAACGCCGTTATGCCTTCGCAGGCGGTGACTTGATCGACGTTGGCCAAGGCAATGACGCCCACGAACTGTTCTTCGGGAACCGTGGGCCTGAGATGTCTGGTCAGCAAGACAGCATCAACGCAGCAATCATGCACTACCTTCGGAGCGACGAGTTCCGTGAGGAGTACCCTGACATTGAGAACAGCACAGGTGGAGAAATCCTCGGCACTGTTGGCTTGTGGAACGGGTTCCAAGGTGGCTTGAACTCTGTGTTCGGCACGAACTTCGATACCGAGACTTCTGGTCTTGGTCCTCGGGGTGCATTCAGCACACTGGTGACAGGTGTGAGACCTTTCCGGGTTATCAGTAACCCAATGGCAGGTGGGCAGACGCGCATCGCCATCGAGTATAACCTACCCGGTGGTGGCTTCAGCCAACCAATCGAGATCGACCCTGCCGACATCGGTGCACGGTATCTCGATCACATCCGTCAAAACAATCGTTGA

Gene Ontology

Description Category Evidence (source)
GO:0000270 peptidoglycan metabolic process biological process None (UniProt)
GO:0008933 peptidoglycan lytic transglycosylase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4UrxT) rather than this protein.
PDB ID
4UrxT
Method AlphaFoldv2
Resolution 71.79
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50