Protein
- Protein accession
- O03937 [UniProt]
- Representative
- 7mF1o
- Source
- UniProt (cluster: phalp2_8208)
- Protein name
- Minor capsid protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MADGTVTIDVLMGTKSFMSDRERVENLLKTLGADAGNQMDEAFTNNSNKVQKKARETKKKIKNEFDSPIIIKLEAKAKEAGVKDFRKILNQIPRNQLTRLKAKSERDEVIDWKKEISRIPEKKSTRLKVDKKQASDDLTALKKQSESTEHSFSHLKEIVVGTFLGGAIQAGVQGLVTGLKDAAKAGMEYNKQQDMMRMNWHNLTTEAPKDGEELLTYINHVSQHSIYAADTIDKMAQSFYHVHSSEKETKKWTDDFVALGSTLHVSNDALKESGEQFAKIVAGGKTSAEDMSVMISRFPMFGEALQKATGKSMSQLYAMSAAGKLTSKQFTEALDYLGKKYRGSTEEAMNSFQGMSMYIKSRWSMLTGNIMASSFKMSKGVAQDMRNLLSDNMMKKYADLASTAISHVTGWLVELIKYVNAHKNTIVDIIGNLGKILGIIGKTVWKTFSDIVYDIAKMFGLVGEKAQESKDPLDKIDDALKNLSKNQELIENLTKAFIAMFALKKGMEFIGMLASLRKSLIETAAVSKMVDLFGGSGVTSAGGKAVTQTVAKEAGGTAATAGSSKVLGRLFAKGGATSTAELEAASGLGGGKAMMAARGLTKAVPYMSIAASIPELFGTTQKTLGKHLGGFAGSAGGPAAGAAAGSAVMPVVGTAVGGVIGGLAGSKLGQSVGGSIQKGITKSFPKLTSKMSDLGHDMAKKFSGSFKPKPSLNDKQFSKSYTSLTKTLNKQAKIKIKTDTSGISKAQKLTDTTYGKMKKSVDKYYGHKRQMSIKDYATLVQNGSMTEKEANKLLNKAKENYNKQAKAQKDNIGKMKKDSDSYYSKLGKAESQKNKDLAAARKKDGKNHEKYLADKKKIEKDFQTKTAGDRKKYLAQLAKDENKSNDAVTKATKISSGKQLDILENLKDHKGKLSKQQMTETIKNSAQERDKTIDNADKQRDKSVSAAKKKYKETVDAADKERYENGTMSRKQYEEVVDKARQQRDDSIDAADAQKKKTVKKAEETHTKVVDEATKQAGEHKGAVDSETGDVITFWGTFISTLRGDWNDMTGGINSILHALNKNWGNIPTWKKHAAGLNGSMGEHTALVGEEGFEYMGTSNGSIMPIGVEGPEIRNIPAGASILPHGMSVEFAQMAKDLPGYKIGLPGWLTSTFSALKKGAEGAADLVSEGASGVVNKIANATGIGKLAKTLNDNTTAFGAIASGAKDSLIDNAVKYVQGFFDQFSDTSEDGAGSLAPHFGSPFKESSGYGPRAGGFHKGIDFAAPLGTPIPAQYGGTVVQAGPASGFGNWVVIKPSGASVDTIYGHMKRMKVKTGQHVKAGQIIAWVGSEGQSSGPHVHYELRAGLGGKSYNPMTYGASAGNPCGHSVNRWRPYVVRALKANGFAATDSQVAAWMKVIKRESNGDPSVINTWDRNAQLGHPSKGLVQTIQPTFDAYKFKGHNNPLNGYDDLLAGIHYMKAIYGSGPSAFARVSGPMGYDSGGRVMKKQLAWLAENNPEYVVNPERDSADSLIVEAARARAAKAPNGLVAKAMRVVGTAKAGIQRTAPSFASRGVAQAEGQVAGNQAISGDLTITVPLDSNVLAQAVYPKAKVMQQRDITIQAKKGGLH
- Physico‐chemical
properties -
protein length: 1608 AA molecular weight: 172849,9 Da isoelectric point: 9,57 hydropathy: -0,47
Representative Protein Details
- Accession
- 7mF1o
- Protein name
- 7mF1o
- Sequence length
- 746 AA
- Molecular weight
- 79730,12120 Da
- Isoelectric point
- 9,50040
- Sequence
-
MRILMLLEKKYLKSLSKDESKMNDSVTKATKISSGKQLDILENLKDHKGKLSKQQMTEAIKNSAKERDQTIKNAEKQRDNRVDKANEQYKKTVAAADKERYENGTMSRKQYDEVIKNARTQRDNAIDAADTQKKHTVKKAEETHEKVVTEATKQAGEHKGAVDSETGDVKGSWNEFIDNMRGIWNGMIGGINGVLHALNKKWGNIPTWKKHAAGLNGSMGEHTALVGEEGFEYMGTSDGSITPIGVEGPEIRNIPAGASILPHGMSVEFAQMAKGLPGYKFGLPGWLTSTFSALKKGADGAVDLVSEGASGVVNKIANATGLGKLAKTFNDNTTAFGAIASGAKDSLIDNAIKYVQGFFDQFSDTSEDGAGSLAPHFGSPFKVSSGYGPRAGGFHKGIDFAAPLGTPIPAQYGGTVVQAGPASGFGNWVVIKPSGASVDTIYGHMKRMKVKTGQHVKAGQIIAWVGSEGQSSGPHVHYELRAGLGGKSYNPMTYGASAGNPSGHSVNRWRPYVVRALKANGFAATDSQVAAWMKVIKRESNGDPSVINTWDRNAQLGHPSKGLVQTIQPTFDAYKFKDHNNPLNGYDDLLAGIRYMKAIYGSGPSAFARVSGPMGYDSGGRVMKKQLAWLAENNPEYVVNPERDSADSLIVEAARARAAKAPNGLVAKAMRVVGTAKAGIQRTAPSFASRGVAQAEGQVAGNQAISGDLTITVPLDSGVLAQAVYPRAKLMQQRDITIQAKKGGLH
Other Proteins in cluster: phalp2_8208
| Total (incl. this protein): 13 | Avg length: 867,6 | Avg pI: 9,43 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7mF1o | 746 | 9,50040 |
| 1k4nU | 827 | 9,60793 |
| 1ru1d | 827 | 10,03091 |
| 1rudd | 825 | 10,03252 |
| 2YkbF | 1002 | 9,81107 |
| 2llzw | 603 | 9,45824 |
| 4hcim | 747 | 9,83389 |
| 5oeER | 627 | 9,54856 |
| 76ewD | 853 | 6,90153 |
| 7Rely | 754 | 9,81507 |
| 8HUqv | 765 | 9,83086 |
| uhc8 | 1095 | 8,64606 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_28053
76jg0
|
1 | 39,3% | 683 | 1.131E-175 |
| 2 |
phalp2_2694
7805Z
|
1 | 27,5% | 618 | 1.453E-94 |
| 3 |
phalp2_3009
1FWiT
|
1 | 22,7% | 755 | 1.660E-78 |
| 4 |
phalp2_14216
2YkkB
|
4 | 27,4% | 715 | 1.387E-67 |
| 5 |
phalp2_2359
7DVi7
|
21 | 25,3% | 670 | 2.477E-63 |
| 6 |
phalp2_1714
2kVOd
|
3 | 27,5% | 560 | 2.111E-54 |
| 7 |
phalp2_7599
5FUza
|
2 | 25,6% | 506 | 2.338E-48 |
| 8 |
phalp2_28873
5i1Sw
|
11 | 25,4% | 476 | 9.569E-44 |
| 9 |
phalp2_10032
7dnml
|
5 | 25,4% | 618 | 5.055E-41 |
| 10 |
phalp2_29846
3Llpd
|
4 | 24,1% | 638 | 1.148E-39 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Lactobacillus phage phig1e [NCBI] |
52979 | No lineage information |
| Host |
Lactobacillus [NCBI] |
1578 | Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
X98106
[NCBI]
CDS location
range 15416 -> 20242
strand -
strand -
CDS
ATGGCAGACGGAACAGTAACAATTGATGTGTTAATGGGTACCAAGTCGTTTATGAGTGACCGTGAACGGGTCGAAAACCTACTTAAGACGCTTGGAGCCGATGCTGGTAACCAAATGGACGAAGCCTTTACTAACAACTCTAACAAGGTACAGAAGAAAGCTAGAGAAACTAAGAAGAAAATTAAGAATGAATTTGATTCCCCAATTATTATTAAGTTGGAAGCTAAGGCGAAAGAAGCTGGCGTAAAAGATTTTAGAAAGATACTCAACCAGATTCCTAGAAATCAGTTAACACGTTTGAAAGCTAAGTCGGAACGCGACGAGGTTATAGACTGGAAAAAAGAAATCAGTCGCATTCCTGAAAAGAAGTCTACACGATTAAAAGTAGATAAGAAACAAGCTTCTGATGATTTAACTGCTTTGAAGAAGCAGTCGGAATCAACCGAGCATAGTTTCTCACACCTCAAAGAGATTGTTGTGGGAACATTTCTTGGTGGCGCGATTCAGGCTGGTGTTCAAGGCCTAGTGACTGGGTTAAAAGATGCTGCTAAAGCTGGTATGGAATATAACAAGCAGCAGGATATGATGCGCATGAACTGGCATAATCTAACGACTGAAGCACCTAAGGATGGTGAGGAACTATTAACATACATCAATCATGTTTCACAGCACTCTATTTACGCTGCCGATACTATTGATAAGATGGCGCAAAGTTTTTATCATGTCCATTCAAGTGAAAAAGAGACTAAAAAGTGGACTGATGATTTTGTAGCATTGGGATCAACACTGCATGTTTCAAATGATGCGTTAAAAGAATCCGGTGAGCAATTCGCAAAAATTGTAGCTGGTGGGAAAACATCGGCTGAAGATATGTCTGTTATGATTAGTCGCTTTCCAATGTTTGGTGAAGCTTTACAAAAGGCAACAGGGAAGTCAATGAGTCAGCTTTATGCGATGTCAGCTGCTGGAAAATTGACCTCAAAACAATTTACTGAAGCGCTGGATTATTTAGGCAAAAAATATAGAGGCAGTACCGAAGAGGCAATGAATAGTTTCCAAGGTATGTCAATGTATATAAAGTCGCGATGGTCAATGCTGACTGGTAATATCATGGCTTCATCTTTCAAAATGAGTAAGGGCGTTGCCCAAGATATGAGAAATTTATTATCTGACAATATGATGAAAAAGTATGCTGATTTAGCATCTACTGCAATTTCACATGTTACTGGATGGTTGGTTGAACTCATTAAATACGTTAATGCTCATAAAAATACAATTGTCGACATTATCGGAAATCTTGGCAAAATACTAGGCATCATTGGTAAAACTGTCTGGAAAACATTTAGCGACATAGTCTATGACATTGCAAAGATGTTTGGGCTGGTGGGCGAAAAGGCACAAGAATCTAAAGATCCACTAGACAAGATTGATGATGCTTTAAAGAACTTATCCAAGAACCAAGAGTTGATCGAGAACTTGACCAAAGCATTTATTGCGATGTTTGCACTCAAAAAAGGTATGGAGTTTATTGGCATGCTGGCAAGTTTGCGTAAGTCACTTATCGAAACGGCTGCTGTGTCTAAGATGGTTGATTTGTTCGGTGGTAGTGGCGTTACTAGTGCGGGCGGTAAGGCCGTTACTCAGACGGTTGCTAAAGAAGCCGGTGGAACTGCTGCTACGGCTGGTAGTTCTAAAGTTCTTGGACGTCTGTTTGCAAAAGGTGGCGCTACTTCAACGGCAGAACTTGAAGCGGCTAGTGGCCTAGGCGGTGGCAAAGCCATGATGGCTGCTCGTGGGCTCACTAAAGCTGTTCCATATATGAGCATTGCCGCTTCAATACCAGAGCTGTTTGGCACGACTCAGAAGACACTAGGTAAGCACTTGGGTGGGTTCGCTGGTTCGGCTGGTGGGCCTGCCGCGGGTGCTGCTGCCGGCTCTGCAGTTATGCCGGTCGTTGGGACTGCTGTTGGTGGTGTAATCGGTGGATTAGCAGGTAGTAAGCTTGGCCAATCGGTGGGTGGCAGTATTCAAAAAGGCATTACCAAGAGCTTCCCTAAACTTACTAGTAAGATGTCTGATCTAGGCCATGATATGGCTAAGAAGTTCAGTGGTAGCTTCAAACCTAAGCCATCGCTAAATGATAAGCAATTTTCGAAATCATATACCTCACTGACGAAGACACTAAATAAACAGGCCAAAATAAAAATTAAGACCGACACTTCCGGCATCAGCAAGGCTCAGAAGCTCACTGATACAACGTATGGCAAGATGAAGAAGTCGGTCGACAAGTACTATGGTCACAAGCGTCAGATGTCTATCAAGGACTATGCAACGTTGGTTCAGAACGGTTCTATGACTGAAAAAGAGGCCAATAAGCTGCTAAACAAGGCCAAAGAGAACTACAACAAGCAGGCGAAAGCTCAGAAAGATAACATTGGGAAAATGAAAAAAGATTCCGATAGTTATTACTCGAAGCTTGGCAAGGCTGAATCACAAAAGAACAAAGACTTGGCTGCTGCCCGTAAGAAGGACGGCAAAAATCATGAAAAGTATTTAGCTGATAAAAAGAAAATCGAAAAGGACTTCCAAACCAAAACGGCCGGCGACCGTAAGAAGTATTTAGCTCAGCTAGCCAAGGATGAAAATAAATCGAATGATGCGGTTACAAAAGCAACTAAGATTTCATCTGGAAAGCAGCTCGATATTCTTGAAAACTTGAAAGACCACAAGGGCAAGCTGTCTAAGCAACAAATGACTGAAACAATTAAAAATTCAGCTCAAGAACGCGATAAGACCATTGATAACGCCGACAAGCAACGTGATAAGTCGGTTAGCGCGGCCAAAAAGAAGTACAAGGAAACAGTTGACGCTGCTGATAAGGAACGCTACGAGAACGGTACGATGAGCCGTAAGCAGTATGAAGAAGTTGTCGATAAAGCTAGACAACAACGCGACGACTCCATTGATGCTGCTGATGCTCAGAAGAAGAAGACCGTCAAGAAAGCGGAGGAAACGCACACTAAGGTCGTTGATGAAGCGACTAAGCAGGCTGGGGAGCATAAAGGTGCGGTTGATTCCGAAACCGGTGACGTCATTACTTTTTGGGGAACATTCATTTCCACCTTGCGTGGTGATTGGAATGATATGACGGGTGGCATTAACTCTATCTTGCATGCTTTAAATAAGAATTGGGGGAACATTCCTACTTGGAAAAAGCATGCCGCTGGTCTGAACGGTTCCATGGGCGAACATACGGCGCTCGTTGGTGAAGAAGGATTCGAATACATGGGAACGTCGAATGGTTCAATCATGCCAATTGGTGTCGAAGGACCTGAAATTCGTAACATTCCAGCGGGTGCGTCCATTTTGCCACATGGTATGTCCGTTGAGTTTGCTCAGATGGCTAAAGACTTGCCTGGGTACAAGATTGGATTGCCTGGTTGGTTAACCAGCACGTTCAGCGCTTTGAAGAAAGGTGCTGAGGGCGCTGCTGATCTTGTTAGCGAAGGTGCTAGTGGCGTGGTCAATAAGATTGCTAACGCAACTGGCATTGGTAAGCTTGCAAAGACGCTCAACGATAATACCACCGCGTTTGGCGCGATTGCGAGTGGGGCTAAGGACTCCTTGATTGATAATGCAGTCAAGTATGTACAAGGATTCTTTGATCAGTTCTCCGACACATCTGAAGATGGTGCTGGTTCATTAGCACCGCACTTTGGTTCACCGTTCAAGGAATCTTCGGGATATGGCCCACGTGCAGGTGGTTTCCACAAAGGTATCGACTTTGCGGCGCCATTAGGTACGCCGATCCCAGCTCAATATGGTGGTACTGTCGTGCAGGCAGGCCCAGCTAGTGGGTTCGGTAACTGGGTTGTTATCAAGCCGTCTGGTGCGTCCGTAGATACGATTTACGGACACATGAAACGAATGAAAGTGAAGACTGGTCAGCATGTCAAAGCCGGTCAAATTATTGCGTGGGTTGGTAGTGAAGGCCAATCAAGTGGCCCACACGTCCATTATGAGTTGCGTGCTGGTTTGGGTGGTAAGAGCTATAACCCAATGACTTATGGCGCTAGTGCGGGTAACCCGTGTGGTCATTCAGTTAATCGCTGGCGACCATATGTTGTACGTGCATTAAAGGCCAATGGGTTCGCTGCTACCGACAGTCAAGTGGCTGCTTGGATGAAGGTTATCAAACGCGAGTCAAACGGGGACCCATCGGTGATTAACACTTGGGACCGTAACGCTCAACTTGGGCACCCTTCTAAAGGGCTCGTTCAGACGATTCAGCCAACATTTGATGCGTATAAGTTCAAAGGTCACAACAATCCGCTCAACGGGTATGACGACCTGCTAGCTGGTATTCACTACATGAAGGCCATTTATGGTTCAGGTCCAAGCGCGTTTGCTCGCGTGAGTGGCCCAATGGGTTACGATTCGGGTGGCCGTGTCATGAAGAAACAGCTAGCATGGTTGGCTGAAAATAACCCAGAATACGTGGTTAACCCAGAACGCGATAGTGCTGACAGCCTGATTGTTGAGGCGGCACGGGCACGCGCTGCTAAAGCGCCTAATGGCTTAGTTGCTAAGGCTATGCGAGTAGTTGGAACTGCTAAGGCAGGTATTCAACGCACAGCGCCAAGCTTTGCATCACGGGGCGTGGCACAGGCAGAAGGCCAAGTTGCCGGTAACCAAGCAATCAGTGGCGATTTGACAATCACTGTGCCATTAGATAGCAATGTATTGGCACAGGCGGTATATCCTAAGGCCAAAGTTATGCAGCAACGTGATATTACGATTCAAGCTAAGAAGGGAGGTTTGCATTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004222 | metalloendopeptidase activity | molecular function | None (UniProt) |
| GO:0031640 | killing of cells of another organism | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000009c1f4_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(7mF1o)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50