Protein

Protein accession
O03937 [UniProt]
Representative
7mF1o
Source
UniProt (cluster: phalp2_8208)
Protein name
Minor capsid protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MADGTVTIDVLMGTKSFMSDRERVENLLKTLGADAGNQMDEAFTNNSNKVQKKARETKKKIKNEFDSPIIIKLEAKAKEAGVKDFRKILNQIPRNQLTRLKAKSERDEVIDWKKEISRIPEKKSTRLKVDKKQASDDLTALKKQSESTEHSFSHLKEIVVGTFLGGAIQAGVQGLVTGLKDAAKAGMEYNKQQDMMRMNWHNLTTEAPKDGEELLTYINHVSQHSIYAADTIDKMAQSFYHVHSSEKETKKWTDDFVALGSTLHVSNDALKESGEQFAKIVAGGKTSAEDMSVMISRFPMFGEALQKATGKSMSQLYAMSAAGKLTSKQFTEALDYLGKKYRGSTEEAMNSFQGMSMYIKSRWSMLTGNIMASSFKMSKGVAQDMRNLLSDNMMKKYADLASTAISHVTGWLVELIKYVNAHKNTIVDIIGNLGKILGIIGKTVWKTFSDIVYDIAKMFGLVGEKAQESKDPLDKIDDALKNLSKNQELIENLTKAFIAMFALKKGMEFIGMLASLRKSLIETAAVSKMVDLFGGSGVTSAGGKAVTQTVAKEAGGTAATAGSSKVLGRLFAKGGATSTAELEAASGLGGGKAMMAARGLTKAVPYMSIAASIPELFGTTQKTLGKHLGGFAGSAGGPAAGAAAGSAVMPVVGTAVGGVIGGLAGSKLGQSVGGSIQKGITKSFPKLTSKMSDLGHDMAKKFSGSFKPKPSLNDKQFSKSYTSLTKTLNKQAKIKIKTDTSGISKAQKLTDTTYGKMKKSVDKYYGHKRQMSIKDYATLVQNGSMTEKEANKLLNKAKENYNKQAKAQKDNIGKMKKDSDSYYSKLGKAESQKNKDLAAARKKDGKNHEKYLADKKKIEKDFQTKTAGDRKKYLAQLAKDENKSNDAVTKATKISSGKQLDILENLKDHKGKLSKQQMTETIKNSAQERDKTIDNADKQRDKSVSAAKKKYKETVDAADKERYENGTMSRKQYEEVVDKARQQRDDSIDAADAQKKKTVKKAEETHTKVVDEATKQAGEHKGAVDSETGDVITFWGTFISTLRGDWNDMTGGINSILHALNKNWGNIPTWKKHAAGLNGSMGEHTALVGEEGFEYMGTSNGSIMPIGVEGPEIRNIPAGASILPHGMSVEFAQMAKDLPGYKIGLPGWLTSTFSALKKGAEGAADLVSEGASGVVNKIANATGIGKLAKTLNDNTTAFGAIASGAKDSLIDNAVKYVQGFFDQFSDTSEDGAGSLAPHFGSPFKESSGYGPRAGGFHKGIDFAAPLGTPIPAQYGGTVVQAGPASGFGNWVVIKPSGASVDTIYGHMKRMKVKTGQHVKAGQIIAWVGSEGQSSGPHVHYELRAGLGGKSYNPMTYGASAGNPCGHSVNRWRPYVVRALKANGFAATDSQVAAWMKVIKRESNGDPSVINTWDRNAQLGHPSKGLVQTIQPTFDAYKFKGHNNPLNGYDDLLAGIHYMKAIYGSGPSAFARVSGPMGYDSGGRVMKKQLAWLAENNPEYVVNPERDSADSLIVEAARARAAKAPNGLVAKAMRVVGTAKAGIQRTAPSFASRGVAQAEGQVAGNQAISGDLTITVPLDSNVLAQAVYPKAKVMQQRDITIQAKKGGLH
Physico‐chemical
properties
protein length:1608 AA
molecular weight:172849,9 Da
isoelectric point:9,57
hydropathy:-0,47
Representative Protein Details
Accession
7mF1o
Protein name
7mF1o
Sequence length
746 AA
Molecular weight
79730,12120 Da
Isoelectric point
9,50040
Sequence
MRILMLLEKKYLKSLSKDESKMNDSVTKATKISSGKQLDILENLKDHKGKLSKQQMTEAIKNSAKERDQTIKNAEKQRDNRVDKANEQYKKTVAAADKERYENGTMSRKQYDEVIKNARTQRDNAIDAADTQKKHTVKKAEETHEKVVTEATKQAGEHKGAVDSETGDVKGSWNEFIDNMRGIWNGMIGGINGVLHALNKKWGNIPTWKKHAAGLNGSMGEHTALVGEEGFEYMGTSDGSITPIGVEGPEIRNIPAGASILPHGMSVEFAQMAKGLPGYKFGLPGWLTSTFSALKKGADGAVDLVSEGASGVVNKIANATGLGKLAKTFNDNTTAFGAIASGAKDSLIDNAIKYVQGFFDQFSDTSEDGAGSLAPHFGSPFKVSSGYGPRAGGFHKGIDFAAPLGTPIPAQYGGTVVQAGPASGFGNWVVIKPSGASVDTIYGHMKRMKVKTGQHVKAGQIIAWVGSEGQSSGPHVHYELRAGLGGKSYNPMTYGASAGNPSGHSVNRWRPYVVRALKANGFAATDSQVAAWMKVIKRESNGDPSVINTWDRNAQLGHPSKGLVQTIQPTFDAYKFKDHNNPLNGYDDLLAGIRYMKAIYGSGPSAFARVSGPMGYDSGGRVMKKQLAWLAENNPEYVVNPERDSADSLIVEAARARAAKAPNGLVAKAMRVVGTAKAGIQRTAPSFASRGVAQAEGQVAGNQAISGDLTITVPLDSGVLAQAVYPRAKLMQQRDITIQAKKGGLH
Other Proteins in cluster: phalp2_8208
Total (incl. this protein): 13 Avg length: 867,6 Avg pI: 9,43

Protein ID Length (AA) pI
7mF1o 746 9,50040
1k4nU 827 9,60793
1ru1d 827 10,03091
1rudd 825 10,03252
2YkbF 1002 9,81107
2llzw 603 9,45824
4hcim 747 9,83389
5oeER 627 9,54856
76ewD 853 6,90153
7Rely 754 9,81507
8HUqv 765 9,83086
uhc8 1095 8,64606
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28053
76jg0
1 39,3% 683 1.131E-175
2 phalp2_2694
7805Z
1 27,5% 618 1.453E-94
3 phalp2_3009
1FWiT
1 22,7% 755 1.660E-78
4 phalp2_14216
2YkkB
4 27,4% 715 1.387E-67
5 phalp2_2359
7DVi7
21 25,3% 670 2.477E-63
6 phalp2_1714
2kVOd
3 27,5% 560 2.111E-54
7 phalp2_7599
5FUza
2 25,6% 506 2.338E-48
8 phalp2_28873
5i1Sw
11 25,4% 476 9.569E-44
9 phalp2_10032
7dnml
5 25,4% 618 5.055E-41
10 phalp2_29846
3Llpd
4 24,1% 638 1.148E-39

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage phig1e
[NCBI]
52979 No lineage information
Host Lactobacillus
[NCBI]
1578 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
X98106 [NCBI]
CDS location
range 15416 -> 20242
strand -
CDS
ATGGCAGACGGAACAGTAACAATTGATGTGTTAATGGGTACCAAGTCGTTTATGAGTGACCGTGAACGGGTCGAAAACCTACTTAAGACGCTTGGAGCCGATGCTGGTAACCAAATGGACGAAGCCTTTACTAACAACTCTAACAAGGTACAGAAGAAAGCTAGAGAAACTAAGAAGAAAATTAAGAATGAATTTGATTCCCCAATTATTATTAAGTTGGAAGCTAAGGCGAAAGAAGCTGGCGTAAAAGATTTTAGAAAGATACTCAACCAGATTCCTAGAAATCAGTTAACACGTTTGAAAGCTAAGTCGGAACGCGACGAGGTTATAGACTGGAAAAAAGAAATCAGTCGCATTCCTGAAAAGAAGTCTACACGATTAAAAGTAGATAAGAAACAAGCTTCTGATGATTTAACTGCTTTGAAGAAGCAGTCGGAATCAACCGAGCATAGTTTCTCACACCTCAAAGAGATTGTTGTGGGAACATTTCTTGGTGGCGCGATTCAGGCTGGTGTTCAAGGCCTAGTGACTGGGTTAAAAGATGCTGCTAAAGCTGGTATGGAATATAACAAGCAGCAGGATATGATGCGCATGAACTGGCATAATCTAACGACTGAAGCACCTAAGGATGGTGAGGAACTATTAACATACATCAATCATGTTTCACAGCACTCTATTTACGCTGCCGATACTATTGATAAGATGGCGCAAAGTTTTTATCATGTCCATTCAAGTGAAAAAGAGACTAAAAAGTGGACTGATGATTTTGTAGCATTGGGATCAACACTGCATGTTTCAAATGATGCGTTAAAAGAATCCGGTGAGCAATTCGCAAAAATTGTAGCTGGTGGGAAAACATCGGCTGAAGATATGTCTGTTATGATTAGTCGCTTTCCAATGTTTGGTGAAGCTTTACAAAAGGCAACAGGGAAGTCAATGAGTCAGCTTTATGCGATGTCAGCTGCTGGAAAATTGACCTCAAAACAATTTACTGAAGCGCTGGATTATTTAGGCAAAAAATATAGAGGCAGTACCGAAGAGGCAATGAATAGTTTCCAAGGTATGTCAATGTATATAAAGTCGCGATGGTCAATGCTGACTGGTAATATCATGGCTTCATCTTTCAAAATGAGTAAGGGCGTTGCCCAAGATATGAGAAATTTATTATCTGACAATATGATGAAAAAGTATGCTGATTTAGCATCTACTGCAATTTCACATGTTACTGGATGGTTGGTTGAACTCATTAAATACGTTAATGCTCATAAAAATACAATTGTCGACATTATCGGAAATCTTGGCAAAATACTAGGCATCATTGGTAAAACTGTCTGGAAAACATTTAGCGACATAGTCTATGACATTGCAAAGATGTTTGGGCTGGTGGGCGAAAAGGCACAAGAATCTAAAGATCCACTAGACAAGATTGATGATGCTTTAAAGAACTTATCCAAGAACCAAGAGTTGATCGAGAACTTGACCAAAGCATTTATTGCGATGTTTGCACTCAAAAAAGGTATGGAGTTTATTGGCATGCTGGCAAGTTTGCGTAAGTCACTTATCGAAACGGCTGCTGTGTCTAAGATGGTTGATTTGTTCGGTGGTAGTGGCGTTACTAGTGCGGGCGGTAAGGCCGTTACTCAGACGGTTGCTAAAGAAGCCGGTGGAACTGCTGCTACGGCTGGTAGTTCTAAAGTTCTTGGACGTCTGTTTGCAAAAGGTGGCGCTACTTCAACGGCAGAACTTGAAGCGGCTAGTGGCCTAGGCGGTGGCAAAGCCATGATGGCTGCTCGTGGGCTCACTAAAGCTGTTCCATATATGAGCATTGCCGCTTCAATACCAGAGCTGTTTGGCACGACTCAGAAGACACTAGGTAAGCACTTGGGTGGGTTCGCTGGTTCGGCTGGTGGGCCTGCCGCGGGTGCTGCTGCCGGCTCTGCAGTTATGCCGGTCGTTGGGACTGCTGTTGGTGGTGTAATCGGTGGATTAGCAGGTAGTAAGCTTGGCCAATCGGTGGGTGGCAGTATTCAAAAAGGCATTACCAAGAGCTTCCCTAAACTTACTAGTAAGATGTCTGATCTAGGCCATGATATGGCTAAGAAGTTCAGTGGTAGCTTCAAACCTAAGCCATCGCTAAATGATAAGCAATTTTCGAAATCATATACCTCACTGACGAAGACACTAAATAAACAGGCCAAAATAAAAATTAAGACCGACACTTCCGGCATCAGCAAGGCTCAGAAGCTCACTGATACAACGTATGGCAAGATGAAGAAGTCGGTCGACAAGTACTATGGTCACAAGCGTCAGATGTCTATCAAGGACTATGCAACGTTGGTTCAGAACGGTTCTATGACTGAAAAAGAGGCCAATAAGCTGCTAAACAAGGCCAAAGAGAACTACAACAAGCAGGCGAAAGCTCAGAAAGATAACATTGGGAAAATGAAAAAAGATTCCGATAGTTATTACTCGAAGCTTGGCAAGGCTGAATCACAAAAGAACAAAGACTTGGCTGCTGCCCGTAAGAAGGACGGCAAAAATCATGAAAAGTATTTAGCTGATAAAAAGAAAATCGAAAAGGACTTCCAAACCAAAACGGCCGGCGACCGTAAGAAGTATTTAGCTCAGCTAGCCAAGGATGAAAATAAATCGAATGATGCGGTTACAAAAGCAACTAAGATTTCATCTGGAAAGCAGCTCGATATTCTTGAAAACTTGAAAGACCACAAGGGCAAGCTGTCTAAGCAACAAATGACTGAAACAATTAAAAATTCAGCTCAAGAACGCGATAAGACCATTGATAACGCCGACAAGCAACGTGATAAGTCGGTTAGCGCGGCCAAAAAGAAGTACAAGGAAACAGTTGACGCTGCTGATAAGGAACGCTACGAGAACGGTACGATGAGCCGTAAGCAGTATGAAGAAGTTGTCGATAAAGCTAGACAACAACGCGACGACTCCATTGATGCTGCTGATGCTCAGAAGAAGAAGACCGTCAAGAAAGCGGAGGAAACGCACACTAAGGTCGTTGATGAAGCGACTAAGCAGGCTGGGGAGCATAAAGGTGCGGTTGATTCCGAAACCGGTGACGTCATTACTTTTTGGGGAACATTCATTTCCACCTTGCGTGGTGATTGGAATGATATGACGGGTGGCATTAACTCTATCTTGCATGCTTTAAATAAGAATTGGGGGAACATTCCTACTTGGAAAAAGCATGCCGCTGGTCTGAACGGTTCCATGGGCGAACATACGGCGCTCGTTGGTGAAGAAGGATTCGAATACATGGGAACGTCGAATGGTTCAATCATGCCAATTGGTGTCGAAGGACCTGAAATTCGTAACATTCCAGCGGGTGCGTCCATTTTGCCACATGGTATGTCCGTTGAGTTTGCTCAGATGGCTAAAGACTTGCCTGGGTACAAGATTGGATTGCCTGGTTGGTTAACCAGCACGTTCAGCGCTTTGAAGAAAGGTGCTGAGGGCGCTGCTGATCTTGTTAGCGAAGGTGCTAGTGGCGTGGTCAATAAGATTGCTAACGCAACTGGCATTGGTAAGCTTGCAAAGACGCTCAACGATAATACCACCGCGTTTGGCGCGATTGCGAGTGGGGCTAAGGACTCCTTGATTGATAATGCAGTCAAGTATGTACAAGGATTCTTTGATCAGTTCTCCGACACATCTGAAGATGGTGCTGGTTCATTAGCACCGCACTTTGGTTCACCGTTCAAGGAATCTTCGGGATATGGCCCACGTGCAGGTGGTTTCCACAAAGGTATCGACTTTGCGGCGCCATTAGGTACGCCGATCCCAGCTCAATATGGTGGTACTGTCGTGCAGGCAGGCCCAGCTAGTGGGTTCGGTAACTGGGTTGTTATCAAGCCGTCTGGTGCGTCCGTAGATACGATTTACGGACACATGAAACGAATGAAAGTGAAGACTGGTCAGCATGTCAAAGCCGGTCAAATTATTGCGTGGGTTGGTAGTGAAGGCCAATCAAGTGGCCCACACGTCCATTATGAGTTGCGTGCTGGTTTGGGTGGTAAGAGCTATAACCCAATGACTTATGGCGCTAGTGCGGGTAACCCGTGTGGTCATTCAGTTAATCGCTGGCGACCATATGTTGTACGTGCATTAAAGGCCAATGGGTTCGCTGCTACCGACAGTCAAGTGGCTGCTTGGATGAAGGTTATCAAACGCGAGTCAAACGGGGACCCATCGGTGATTAACACTTGGGACCGTAACGCTCAACTTGGGCACCCTTCTAAAGGGCTCGTTCAGACGATTCAGCCAACATTTGATGCGTATAAGTTCAAAGGTCACAACAATCCGCTCAACGGGTATGACGACCTGCTAGCTGGTATTCACTACATGAAGGCCATTTATGGTTCAGGTCCAAGCGCGTTTGCTCGCGTGAGTGGCCCAATGGGTTACGATTCGGGTGGCCGTGTCATGAAGAAACAGCTAGCATGGTTGGCTGAAAATAACCCAGAATACGTGGTTAACCCAGAACGCGATAGTGCTGACAGCCTGATTGTTGAGGCGGCACGGGCACGCGCTGCTAAAGCGCCTAATGGCTTAGTTGCTAAGGCTATGCGAGTAGTTGGAACTGCTAAGGCAGGTATTCAACGCACAGCGCCAAGCTTTGCATCACGGGGCGTGGCACAGGCAGAAGGCCAAGTTGCCGGTAACCAAGCAATCAGTGGCGATTTGACAATCACTGTGCCATTAGATAGCAATGTATTGGCACAGGCGGTATATCCTAAGGCCAAAGTTATGCAGCAACGTGATATTACGATTCAAGCTAAGAAGGGAGGTTTGCATTAG

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000009c1f4_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7mF1o) rather than this protein.
PDB ID
7mF1o
Method AlphaFoldv2
Resolution 63.45
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50