Protein

Protein accession
A0A873WEP0 [UniProt]
Representative
4UsSQ
Source
UniProt (cluster: phalp2_18275)
Protein name
Putative glycoside hydrolase
Lysin probability
82%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAESRTEETDERKPLTAEDLRDDNSSLSKFLNFARSRKGGRDELLSTLSRIEMVAKLLTSEGAESTLNAQKDGASPVEMASIEAKLDSIDRINKFRKSEAKEIIEKVYNNEEVTKEESSALVEAISSLGDDIKMLESSELISRESWHSAQINQIGNRNTGYGRRGDIASRFLMDINSTSGMKNSRSLERLKSVLRLTGMSDEGLTQLLNSDNGKKIMAKRVNSNSEEVEIAMEAVVRELEELKVINSRVETSSEMLDLDASAPELIDSLKDLGNRLDKMPQFFKDTKELHTIIEESEASGEVDRVKLNELLTRLDTDTTPAKTLYSLRKMNGQMDDMLVSQESIENSVSKGVLGSFKEKAEDFTGSELIKDGILGMIAGSVGINPVVVEGAMDMLTSAGSGALGYMAGRRGGRDATRGGRRGGRAGRGGRGLVGTSMAKMKGLTEVLNMKTLGKFLGKSLKFVPVIGTIVQGAMSIFDAFEGWDNASSITGKSEEALTTMDRVKAASASVMSGLTLGLVDVQTAFGMVDKAWNWITGWWDKITTFMSDLGGKLEEALKTLMEELWNKVTEFIPDSIKNLSDFEVPGLSDLSEKVSDKVEEVSTSVKESASEAWNEVTDASSRAYNFISNSFFGDDEDTEFPKRQVSKSTSEYLKKNLESDSSSSKVYNDRIAHLDTKSKLAQYAVVNQRLSGSDLSVFLKSISEDDLSKHKGKSVSELYDMYRSEAESVDKTFSSVSKTTDPVKLRADLTKHALKKNLDREELFKFREVLNRSDFSDIEGKSLNSIFKEVAVEAGLERFNSSSTDSVESSTDKMKTLLLDKNSSSFEREFSLMSHAAESGITGENLANFMGQMAHESGNFMDKDLTENYNGDPREYFKKYDGRKDLGNLNPGDGFKYRGRGYIQLTGKANYEKYGELLGIDLVNNPDLAADPNVASAIAIQYWKENDLNGKSVKQATRIINGGYNGLDDREVKTDKYREMMDSKSMTLASENKHIESNVNSSVKGVEAEAMKKVMDSQRGEKGPSIARTSTTPKVVQLPGNTATPVSSTPSSELMMIGAASIFS
Physico‐chemical
properties
protein length:1064 AA
molecular weight:117047,0 Da
isoelectric point:5,14
hydropathy:-0,54
Representative Protein Details
Accession
4UsSQ
Protein name
4UsSQ
Sequence length
333 AA
Molecular weight
36765,31460 Da
Isoelectric point
9,82706
Sequence
MLNNIIDGIGGAVGGLFEGIGNAAGGLASGVGSLLGGLPGSSQPSQNMQQSPPMDMGQPPNQMIDPKQFMVPRSDRFNRFNDMGIQRQPYNPNQDPMAGAQQLPRNPRQPSEIQRPIMNLGGQQRPKANPRAGLENPVFKDFLNTMPENVRDKFVNRGMTLSPQGLQKIFPNADPKIVKTLTDSSDLLKEYGIDTPQRMRHFLAQMGHESGNFKYLKELGSPGYFNKYEGRKSLGNTQPGDGARYKGRGIIQLTGRYNYKKYGDKLGIDLVNNPELASNPDVALRIAAQYWKDKGLNKLADNDDLRGITKRINGGHNGLRHRQKLYTNLGGMF
Other Proteins in cluster: phalp2_18275
Total (incl. this protein): 3 Avg length: 521,0 Avg pI: 8,20

Protein ID Length (AA) pI
4UsSQ 333 9,82706
Q5ZGC9 166 9,64313
Similar Clusters

No similar clusters were found for representative 4UsSQ.

Domains

Domains [InterPro]
Disordered region
Unannotated
Representative sequence (used for alignment): 4UsSQ (333 AA)
Member sequence: A0A873WEP0 (1064 AA)
1 333 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage Va2
[NCBI]
2783668 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MW073017 [NCBI]
CDS location
range 38707 -> 41901
strand +
CDS
ATGGCTGAATCAAGAACTGAAGAAACTGATGAAAGAAAACCACTCACTGCTGAAGATTTAAGAGATGACAACTCTTCATTGTCGAAATTCTTGAACTTCGCTAGGTCTCGGAAAGGTGGTCGTGACGAACTTCTGTCCACCTTATCCAGAATTGAGATGGTAGCTAAACTCCTTACTTCTGAAGGTGCTGAGTCGACTCTCAATGCTCAAAAGGATGGTGCCAGTCCTGTGGAGATGGCATCTATCGAAGCTAAGTTAGACTCAATAGATAGAATCAATAAGTTCAGAAAGTCTGAAGCCAAAGAAATCATCGAGAAAGTTTACAACAACGAAGAGGTGACTAAAGAAGAAAGTTCTGCTCTCGTGGAGGCTATCTCTTCTCTCGGTGATGATATCAAAATGCTAGAAAGTAGCGAGCTCATCAGTCGTGAAAGCTGGCATAGTGCTCAGATAAATCAGATAGGTAACCGTAATACAGGTTATGGTCGTCGTGGTGATATAGCTTCTCGATTCTTGATGGATATTAACTCCACCTCAGGAATGAAGAATAGTAGATCACTTGAAAGATTAAAGTCTGTCTTACGATTGACTGGTATGAGTGATGAAGGTCTTACTCAGTTGTTGAACTCAGACAACGGTAAGAAAATAATGGCTAAGCGAGTGAACTCTAACTCTGAAGAAGTAGAGATTGCTATGGAAGCAGTCGTACGAGAGTTAGAAGAATTGAAAGTCATTAACTCTCGTGTAGAGACTTCATCTGAAATGCTTGACCTTGATGCATCTGCTCCAGAACTTATCGACTCTTTGAAGGACTTAGGTAATCGCTTAGATAAGATGCCTCAATTCTTCAAAGATACGAAGGAGTTACATACCATCATCGAGGAAAGTGAAGCTTCTGGTGAAGTTGATCGAGTAAAGTTGAACGAACTTCTAACTCGTTTAGATACTGACACCACTCCAGCTAAAACTTTGTACTCACTTCGAAAGATGAATGGTCAGATGGATGATATGCTAGTTTCTCAAGAATCAATTGAGAATAGTGTGTCAAAAGGTGTCTTAGGTAGCTTCAAAGAGAAGGCTGAGGACTTCACTGGCTCTGAGTTAATCAAAGATGGTATTCTTGGCATGATTGCAGGCAGCGTAGGCATCAACCCAGTTGTTGTCGAAGGTGCTATGGATATGCTAACGAGTGCAGGTAGTGGTGCGTTAGGTTACATGGCAGGTCGTCGTGGTGGTCGTGACGCAACGAGAGGTGGTAGAAGAGGTGGTAGAGCAGGTCGTGGTGGTCGAGGCTTAGTCGGTACTTCTATGGCTAAGATGAAAGGACTTACTGAAGTTCTCAATATGAAAACTCTAGGTAAGTTCCTTGGTAAGAGCTTGAAGTTCGTACCAGTAATCGGAACCATCGTACAAGGTGCTATGAGTATCTTCGATGCCTTCGAAGGTTGGGATAATGCATCCTCTATCACAGGTAAGTCAGAAGAAGCACTAACCACGATGGATAGGGTCAAGGCAGCTTCTGCAAGTGTGATGTCTGGTCTTACTCTAGGTCTTGTGGATGTTCAAACTGCCTTCGGTATGGTGGACAAAGCGTGGAACTGGATAACTGGTTGGTGGGATAAGATAACTACCTTCATGTCTGATCTAGGGGGTAAGTTAGAAGAAGCTTTGAAGACTCTGATGGAAGAACTCTGGAACAAGGTTACTGAATTCATCCCAGACTCTATCAAGAACTTATCTGACTTCGAAGTTCCTGGACTTAGTGACTTATCTGAGAAAGTATCTGACAAAGTTGAAGAAGTTAGCACGAGTGTTAAAGAATCAGCTTCAGAAGCTTGGAATGAGGTTACTGATGCATCTTCTCGTGCGTACAACTTCATCTCTAATAGTTTCTTCGGTGATGATGAAGATACTGAGTTCCCTAAGAGACAAGTATCTAAGTCTACCAGCGAGTACCTTAAGAAGAATCTAGAATCGGATAGCTCTTCTTCGAAAGTCTACAATGACCGAATCGCTCATCTAGATACTAAGTCCAAACTGGCTCAGTACGCAGTTGTCAATCAAAGATTGTCAGGTTCTGATCTATCTGTATTCTTGAAATCTATCAGTGAAGATGACCTCTCTAAGCATAAAGGCAAATCAGTTTCTGAGTTGTACGATATGTATAGAAGTGAAGCTGAGAGTGTTGACAAGACTTTCAGTTCTGTCTCCAAGACAACTGATCCAGTGAAGTTGCGTGCGGACTTGACGAAGCATGCTCTCAAGAAGAATCTAGATAGAGAAGAGTTATTCAAGTTCAGAGAAGTTCTGAATCGTTCTGATTTTTCTGATATTGAAGGCAAATCTCTAAATAGCATTTTCAAAGAAGTGGCAGTTGAAGCAGGTCTCGAGAGGTTCAATAGTTCTTCGACAGACTCTGTTGAGAGTTCCACTGATAAGATGAAGACGCTTCTCCTAGATAAGAACTCATCTTCATTCGAGCGTGAATTCTCTTTGATGAGTCATGCTGCTGAAAGTGGTATAACTGGTGAGAATCTTGCTAACTTTATGGGTCAAATGGCTCATGAGTCAGGCAACTTCATGGACAAGGATTTAACTGAGAACTATAACGGTGACCCTAGAGAGTACTTCAAGAAGTACGATGGTCGTAAAGACTTAGGGAACTTGAATCCAGGTGATGGTTTCAAATACCGAGGTAGAGGTTACATCCAGCTTACAGGTAAGGCGAACTACGAGAAATACGGAGAACTCCTTGGAATTGACCTAGTCAATAATCCTGATTTAGCTGCTGATCCTAACGTAGCTAGTGCAATCGCAATTCAGTACTGGAAAGAAAATGACTTGAACGGTAAGTCAGTCAAACAAGCTACCAGAATCATAAACGGTGGCTATAATGGATTGGACGATCGTGAAGTTAAGACTGACAAGTACAGAGAGATGATGGATTCTAAATCTATGACCTTAGCTAGTGAGAATAAGCATATTGAATCTAATGTGAATTCTTCTGTCAAAGGTGTAGAAGCAGAAGCAATGAAGAAAGTAATGGATAGCCAACGTGGAGAAAAAGGTCCTAGCATCGCAAGAACCTCTACGACACCTAAAGTGGTCCAGCTTCCTGGAAATACTGCTACTCCAGTCTCGTCTACTCCTTCATCGGAGTTGATGATGATAGGTGCAGCATCCATATTCTCTTAG

Gene Ontology

Description Category Evidence (source)
GO:0004568 chitinase activity molecular function None (UniProt)
GO:0006032 chitin catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4UsSQ) rather than this protein.
PDB ID
4UsSQ
Method AlphaFoldv2
Resolution 68.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50