Protein
- Protein accession
- Q5ULJ3 [UniProt]
- Representative
- 8MDGi
- Source
- UniProt (cluster: phalp2_15152)
- Protein name
- Orf121
- Lysin probability
- 77%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MVVNQGDTLSELASKYGTTSDKIASDNNLSNPNLIFVGDKLEVGSVKEENTQPTKVTTSTEANNTVASSYSQPEKASSAEQPQASAKDTQSSMSSSNNTDTDTNVSTSNGTLSDSEAQAVASQMASRTGQSASYWKGIMWRESNNQVHVANPSSSARGLFQLLYGGTGSVQEQINNAVSLYEKQGTQAWALTN
- Physico‐chemical
properties -
protein length: 193 AA molecular weight: 20430,7 Da isoelectric point: 4,64 hydropathy: -0,70
Representative Protein Details
- Accession
- 8MDGi
- Protein name
- 8MDGi
- Sequence length
- 133 AA
- Molecular weight
- 13893,71670 Da
- Isoelectric point
- 4,94280
- Sequence
-
MSQPVQSSSSSASQATQSSQSSVQATQSVSSTSQSSYTGSSVTTSNGTLADSEKQAVLSQMQSRTGVPASTWDAIITRESNWQVDAANPSSTARGLFQQLYGGTGSVQSQIDNAVKLYHNAGDSMSPWALTNY
Other Proteins in cluster: phalp2_15152
| Total (incl. this protein): 7 | Avg length: 151,0 | Avg pI: 5,79 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8MDGi | 133 | 4,94280 |
| 7zOtp | 133 | 6,52792 |
| 8LlSX | 141 | 7,98449 |
| A0A3S7UP64 | 133 | 4,94280 |
| A0A2K9VCL2 | 133 | 6,52792 |
| A0A2H4PB86 | 191 | 4,94337 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_34143
2kZPk
|
47 | 29,2% | 106 | 5.020E-07 |
| 2 |
phalp2_26606
8tBUU
|
34 | 29,8% | 114 | 1.274E-06 |
Domains
Domains [InterPro]
1
133 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Salchichonvirus LP65 [NCBI] |
298338 | Herelleviridae > Salchichonvirus > |
| Host |
Lactobacillus plantarum [NCBI] |
1590 | Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
AY682195
[NCBI]
CDS location
range 101790 -> 102371
strand +
strand +
CDS
GTGGTAGTCAATCAAGGAGATACCTTATCAGAACTAGCTAGTAAGTATGGTACAACCTCAGATAAGATTGCAAGTGATAACAACCTAAGCAATCCTAACCTAATCTTTGTTGGCGATAAACTAGAAGTAGGCTCTGTTAAAGAAGAAAACACGCAACCAACTAAGGTAACTACGTCTACCGAAGCTAATAATACGGTGGCTAGTTCGTATAGTCAACCGGAAAAAGCTAGCTCCGCTGAACAGCCACAAGCATCTGCAAAAGATACTCAGAGCTCTATGAGTTCATCTAATAATACTGACACTGATACTAACGTAAGTACTTCTAACGGAACATTGTCAGATAGTGAAGCACAAGCAGTAGCTAGCCAAATGGCGTCACGTACTGGTCAAAGTGCTTCATACTGGAAGGGGATCATGTGGCGGGAATCAAATAACCAAGTACACGTTGCTAACCCGTCGTCAAGTGCTAGAGGATTGTTCCAATTACTCTATGGTGGCACAGGTAGCGTTCAGGAACAAATTAACAATGCTGTAAGTTTGTATGAAAAGCAAGGTACGCAAGCGTGGGCTTTGACAAACTAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000045459d_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8MDGi)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50