Protein

Protein accession
U3PIV2 [UniProt]
Representative
7POyD
Source
UniProt (cluster: phalp2_40626)
Protein name
Uncharacterized protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MADGVVTIDLKFPANESGFKSDVAQVEEILKKMGSGTGNQAVANFNRKANDLKTIANEAGGQVSKSVNSAGKQAGNQMQKNFASTASEVEKAAERVAGNVSTTATTMGKHAGVEMQSNFGSNSNDVKKSASGMGQDVDRTAKGMGKGAGDEMAGSARKNIGEVKNAFGSLGDYIKGSFIGSALEQATERVADFFKDMVTQATESFDALKSYTSTMKFAGFDTSEIKKGEEELKEYSKETIYNVGEMSNTAATLAANGVKNYIDVTKALGNLVAVAGGSTQDMQSASLALTQMVGAGKMYAGDWNQFINGIPGASGKLKKALKDAGAYTGNFKDAMSKGQISAQEMIAAIEKLGNTQIAKKAATDTSKFSVAWQGAQESVQDGVLALMDSLGTSGFTGAISAAGDSAYNALASIGKWISKHKDEIATLGKKLGYIKDNLEEIGGEIGQGFIQFFKDSYKWISKVIDKTDDGNDALDDLGLALDKVAKNKGTLRAIGKGLAAITTAALGLKALKGVAGLVTTLVKPFARLISLFMSFSGGPIILAIVAIGAAFVLAYKHIKPFRDFVNAIGKKAGSAFRAVSRAVSKAFKRISKAMAPIVKEFKKAWGQLLKFLEALWKDISAVVVGALKILFVVMSPLLLVVIGLVKLAMITIKALIKGAMGFIKTIWHTTWKAIGTILKTAWNVIRDIFKYELKVITDILKLGTDILKGNWNGVWKDIKKIFSDAWNGMKSIVRDIWTGIKEYVADGVNGVIDLINGMITAINKVWNFFGGKGGINKLSHVHFATGGQLGSDGSVMAIVNDDGSPDPRELIQRKDGTLQMYQDRNAKTIINPGDKVYNSQQTKEIFNSVGVHYAKGNVGGDIWSGVKSFFGGVTDKLKNAIEWLKHPLQNTAKLIKSATDSFMSVLPDSFKNLAGSMIGKMTSMISSKFKKLIQGYKDDNEDGGGSVGNPGGAGVMRWKSYVAKALKANGIEPTGYRVSKILATIQRESGGNPRAINLWDSNAKAGIPSKGLMQTIGPTFNAYKFAGHGNIYNGYDNLLAAINYIKHRYGTSDAAFARVAASGYAKGGVIDREQMALIGEGNRTEFVVPNPSVAGPSRTYEMIGRAAAYASQTDGGNSSLMNGNALKLVERKLDALIDYNATQLEELKKPMRSYILQSDIYKGYNEQQKLNDMRGFFVR
Physico‐chemical
properties
protein length:1179 AA
molecular weight:126111,2 Da
isoelectric point:9,52
hydropathy:-0,15
Representative Protein Details
Accession
7POyD
Protein name
7POyD
Sequence length
370 AA
Molecular weight
40412,47970 Da
Isoelectric point
9,43729
Sequence
MIQRKDGTLQMYQERNAKTIINPGDKVYNSKQTKEIFDSVGVHYAKGNVGGSIWDGVKSFFGGVADKLKDAVEWLKHPLQNTAKLIKGATDSFMSILPDNFKNLAGSMIGKMTSMISDKLKKLIQGYKDDNEDGGGSVGNPGGAGVMRWKSYVAKALEANGIEPTGYRVSKILATIQRESGGNPRAINLWDSNAKAGIPSKGLMQTIDPTFNAYKLAGHGNIYNGYDNLLAAINYIKHRYGTSDAAFARVAAYGYAKGGVIDREQMALIGEGNRTEFVVPNPSVAGPARTYEMIGRAAAYASQAGGGSGSIMNENALKLVERKLDSLIDYNATQLEELKKPMRSYVLQSDIYKGYNEQRKINDMRGFFVR
Other Proteins in cluster: phalp2_40626
Total (incl. this protein): 9 Avg length: 456,2 Avg pI: 7,67

Protein ID Length (AA) pI
7POyD 370 9,43729
5HE5L 376 6,60608
6Ylll 259 7,16486
6ZK9V 441 6,64569
70jpX 357 6,83667
854Ok 263 6,98133
HvAB 501 6,40964
Q38352 360 9,39609
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33337
6BBGM
4 39,4% 238 1.270E-56
2 phalp2_35192
6deMH
4 30,4% 338 1.450E-48
3 phalp2_13621
5tXtb
6 29,7% 299 1.088E-40
4 phalp2_39595
6vRSd
2 36,0% 244 6.822E-40
5 phalp2_10744
3mWmf
5 36,2% 254 1.346E-34
6 phalp2_11380
3UBu
3 32,0% 259 3.535E-31
7 phalp2_16319
5iojU
2 31,7% 274 2.373E-26
8 phalp2_25683
4z9jR
3 32,9% 246 2.227E-23
9 phalp2_26052
79GVR
1 27,4% 364 4.534E-21

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage phiJB
[NCBI]
1399941 No lineage information
Host Lactobacillus delbrueckii
[NCBI]
1584 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KF188409 [NCBI]
CDS location
range 29120 -> 32659
strand +
CDS
ATGGCTGATGGAGTCGTAACAATCGACTTAAAATTCCCGGCTAACGAATCGGGATTTAAATCCGATGTTGCCCAGGTCGAAGAGATCCTTAAAAAGATGGGGTCGGGCACTGGCAATCAAGCGGTGGCTAACTTCAATAGAAAAGCCAACGACCTGAAAACTATCGCCAACGAGGCGGGTGGCCAAGTCTCTAAGAGTGTAAATAGCGCTGGTAAACAAGCGGGTAACCAGATGCAAAAGAATTTTGCATCTACCGCAAGCGAGGTGGAAAAGGCCGCAGAGAGAGTGGCAGGAAACGTGTCAACTACTGCCACTACCATGGGCAAGCATGCCGGGGTAGAGATGCAGAGCAACTTTGGCTCTAACTCCAACGACGTCAAAAAGTCGGCGTCAGGTATGGGCCAAGATGTCGATCGGACTGCCAAGGGCATGGGTAAAGGCGCTGGCGATGAGATGGCCGGATCTGCACGAAAGAACATTGGCGAGGTAAAAAACGCCTTTGGCTCACTTGGGGATTACATCAAGGGTTCCTTTATCGGTAGTGCACTGGAGCAAGCAACCGAAAGGGTAGCTGATTTCTTTAAAGACATGGTCACCCAGGCGACTGAATCCTTTGACGCCTTAAAGAGCTACACGTCAACCATGAAGTTTGCCGGCTTTGACACGTCCGAAATCAAAAAGGGCGAGGAAGAACTTAAAGAATATTCCAAGGAAACCATCTACAACGTGGGCGAAATGTCCAATACCGCGGCTACTTTGGCCGCCAACGGCGTTAAAAACTATATTGACGTTACTAAGGCCCTGGGTAACCTTGTCGCCGTAGCGGGTGGTAGTACCCAGGACATGCAATCGGCATCTCTGGCCTTAACTCAAATGGTTGGGGCTGGCAAGATGTATGCCGGGGACTGGAACCAATTCATTAACGGCATTCCTGGTGCTTCTGGCAAGCTTAAGAAGGCGCTGAAAGATGCAGGGGCATATACTGGCAACTTCAAAGATGCCATGTCTAAGGGCCAAATTTCGGCCCAAGAAATGATCGCGGCAATCGAGAAACTAGGCAACACCCAGATCGCTAAAAAGGCTGCCACGGACACAAGCAAGTTCTCGGTCGCTTGGCAAGGAGCACAAGAATCAGTCCAGGACGGTGTCTTGGCACTGATGGATAGCTTGGGGACCAGCGGCTTCACCGGAGCAATTTCGGCAGCTGGTGACAGTGCCTATAATGCTCTAGCGTCGATCGGTAAATGGATTTCCAAGCACAAGGACGAAATAGCCACGCTGGGGAAGAAACTAGGCTATATCAAAGACAACCTTGAAGAAATTGGCGGCGAAATTGGCCAAGGATTTATCCAGTTCTTTAAAGACAGCTACAAGTGGATCAGCAAAGTCATTGACAAAACGGATGATGGTAATGATGCACTGGATGATCTGGGACTGGCTCTGGACAAAGTCGCCAAAAATAAGGGCACTCTCCGGGCAATCGGCAAAGGCCTGGCGGCAATTACGACGGCGGCACTGGGGCTTAAAGCACTCAAAGGCGTAGCCGGGCTGGTAACGACCTTGGTTAAACCTTTTGCAAGACTAATCAGTCTCTTTATGTCCTTTAGTGGTGGCCCGATTATTCTGGCGATTGTCGCCATTGGTGCGGCATTCGTGCTGGCTTACAAGCATATCAAGCCATTTCGAGACTTTGTCAATGCTATCGGCAAGAAGGCCGGGTCCGCCTTTAGAGCGGTGTCTAGAGCCGTTTCTAAGGCATTTAAGCGGATTTCAAAGGCTATGGCCCCGATCGTCAAGGAATTTAAAAAGGCTTGGGGCCAATTGCTCAAGTTCCTTGAAGCACTCTGGAAAGATATCTCTGCTGTGGTCGTAGGCGCGCTTAAGATCCTATTCGTAGTTATGTCGCCCCTGCTCCTCGTTGTCATCGGCCTGGTGAAATTAGCCATGATCACGATTAAAGCCCTTATAAAGGGCGCCATGGGCTTTATAAAGACCATTTGGCACACAACTTGGAAGGCGATTGGAACCATTCTAAAAACGGCTTGGAACGTTATTAGAGACATCTTTAAGTATGAACTTAAAGTCATCACTGATATCCTTAAGCTTGGAACAGACATTCTTAAGGGCAACTGGAATGGTGTTTGGAAAGACATCAAGAAAATCTTTAGCGACGCCTGGAACGGCATGAAGTCGATTGTCCGTGATATTTGGACTGGGATCAAAGAATATGTTGCTGACGGTGTCAATGGCGTAATCGACCTCATCAACGGTATGATCACAGCAATCAACAAAGTTTGGAACTTCTTTGGCGGCAAAGGGGGCATCAACAAGCTCAGTCATGTCCACTTTGCAACAGGTGGCCAACTGGGCAGCGATGGGTCGGTTATGGCTATTGTCAATGACGACGGCAGTCCTGACCCGCGGGAGTTGATCCAGCGCAAGGACGGCACCTTGCAGATGTATCAAGACCGCAATGCCAAGACTATCATCAACCCAGGCGACAAGGTCTACAATTCCCAGCAAACCAAAGAAATCTTCAACTCCGTCGGCGTGCACTATGCCAAGGGCAATGTCGGCGGGGATATCTGGAGCGGCGTTAAGTCGTTCTTTGGTGGCGTAACGGATAAGCTTAAAAACGCTATTGAGTGGCTCAAGCATCCCTTGCAGAACACTGCCAAGCTGATAAAGAGCGCTACTGATTCCTTTATGTCAGTTTTGCCGGATAGCTTTAAGAATCTGGCAGGGTCGATGATCGGCAAGATGACCAGCATGATCTCCAGCAAGTTTAAAAAACTGATCCAGGGTTATAAGGACGACAACGAAGATGGCGGTGGTAGTGTTGGCAACCCAGGCGGTGCTGGCGTTATGCGCTGGAAGTCATATGTTGCTAAGGCCCTTAAGGCTAACGGTATCGAACCGACTGGCTATCGTGTTAGCAAGATCCTGGCAACTATCCAGCGTGAGTCGGGTGGTAATCCAAGAGCAATTAACCTTTGGGACAGCAACGCAAAAGCTGGTATTCCGTCTAAAGGCTTGATGCAAACAATCGGACCAACTTTTAACGCTTACAAGTTTGCAGGTCATGGCAATATCTACAACGGATACGACAACCTATTGGCGGCCATCAACTACATTAAGCATAGATATGGGACATCTGACGCCGCTTTTGCGCGTGTTGCCGCAAGTGGCTATGCAAAAGGCGGCGTTATTGATCGGGAGCAAATGGCTCTAATCGGTGAAGGAAACCGCACAGAATTTGTGGTTCCAAATCCTTCGGTTGCAGGCCCATCACGCACCTACGAGATGATTGGGCGAGCTGCAGCTTATGCTAGTCAGACAGACGGCGGTAACAGCTCTTTGATGAACGGAAACGCCTTGAAGCTCGTAGAGCGCAAGCTTGATGCGCTGATTGATTACAACGCAACCCAACTGGAAGAACTGAAAAAGCCAATGAGATCTTACATTTTGCAAAGTGACATCTACAAGGGCTACAACGAACAGCAGAAGCTCAACGATATGCGCGGATTTTTCGTCAGATAG

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0003b01e2a_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7POyD) rather than this protein.
PDB ID
7POyD
Method AlphaFoldv2
Resolution 56.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50