Protein
- Protein accession
- A0A291LA05 [UniProt]
- Representative
- 2g1Nw
- Source
- UniProt (cluster: phalp2_6881)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MRTLRHLLVMAVAMIASCASSAVYAQNVRTFVPPAAHKLLPELRTVQENIWPTAPMPSFMAAQIEQESCISLTHSKCWNTRSELKTSREYGFGLGQITIAYRADGSVRFNKFQELRQEYASLRGWAWENRYDAKYQLTAIVEMDKGIYGRVSDAATSTDRLAFTLSAYNGGESGLRQDRLLCKNTDGCDRSRWFGHVEHTSLKSKAKWQGYGASPFHINREYVDNVINVRRPKYEPFFEGRS
- Physico‐chemical
properties -
protein length: 242 AA molecular weight: 27591,0 Da isoelectric point: 9,30 hydropathy: -0,48
Representative Protein Details
- Accession
- 2g1Nw
- Protein name
- 2g1Nw
- Sequence length
- 317 AA
- Molecular weight
- 35837,71430 Da
- Isoelectric point
- 8,14366
- Sequence
-
MKIRYGVLLAAALMTVGCSKKVDQEVKQSEQLAEQVGANSDVQGEVLEAVKEQPVLQEIGINVPDVSMPPEPIPTPKPELPAPTPTPIATEQVHPPEEVKQPAKPLPKIPANAEKLMPDVIAAIDEVWPDMPMRSYFPAQIEQESCITLTHSKCWNPRAELKTSREYGFGLGQLTKAWRADGSLRFDAWAEVKTQHPSLRGWDWEDRYNPRLQIMAIVVKNKVNWGSIKWETADLDNKMAFLATFYNGGNPIKDRNLCLNTAGCDPTRWWGNVEKYSIKSKTPLKEYGNRSLFQISREYPVKVLRERRPKYVPYTGT
Other Proteins in cluster: phalp2_6881
| Total (incl. this protein): 30 | Avg length: 249,5 | Avg pI: 9,55 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 2g1Nw | 317 | 8,14366 |
| 7HL2g | 321 | 8,13818 |
| A0A2D0W9F2 | 246 | 9,19933 |
| A0A2D0W997 | 245 | 9,47152 |
| A9J597 | 241 | 9,70418 |
| I6WMZ3 | 241 | 9,85556 |
| A0A2D0W917 | 246 | 9,05608 |
| A0A2D0W9X7 | 246 | 9,05608 |
| A0A2K8HPC2 | 241 | 9,74293 |
| A0A0N9ERA0 | 241 | 9,91738 |
| A0A0U1VU02 | 241 | 9,74293 |
| A0A1B0Z008 | 241 | 9,74293 |
| A0A2D0W9N2 | 246 | 9,05608 |
| A0A411BAT0 | 241 | 9,81210 |
| A0A411BCU2 | 241 | 9,70457 |
| A0A6G9LKB0 | 241 | 9,85556 |
| A0AA96EUZ4 | 235 | 9,72952 |
| A0AA96TEQ2 | 241 | 9,74293 |
| A0AAU8KTL9 | 241 | 9,70470 |
| A0AAX4F912 | 225 | 9,29887 |
| A0AAX4M1Q2 | 241 | 9,60793 |
| A0AAX4QEA6 | 273 | 10,30019 |
| A0AAX4QFE1 | 273 | 10,22464 |
| A0AAX4RD23 | 241 | 9,59104 |
| A0AAX4RE17 | 241 | 9,70457 |
| A0AAX4RE52 | 273 | 10,07965 |
| A0AAX4RE79 | 241 | 9,60793 |
| A0AAX4RES5 | 241 | 9,91654 |
| A0AAX4RF16 | 241 | 9,60793 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_5482
3dIDs
|
497 | 54,5% | 218 | 8.417E-92 |
| 2 |
phalp2_18118
4dLEG
|
5 | 45,4% | 209 | 3.277E-89 |
| 3 |
phalp2_8655
459Vq
|
3 | 39,1% | 202 | 2.951E-52 |
| 4 |
phalp2_39610
6EkfA
|
1 | 28,4% | 232 | 1.249E-19 |
| 5 |
phalp2_6836
8s2Bn
|
9 | 27,1% | 232 | 1.952E-17 |
| 6 |
phalp2_414
7u6ty
|
1517 | 28,7% | 216 | 3.528E-17 |
| 7 |
phalp2_6350
7rL94
|
1 | 21,7% | 193 | 5.832E-09 |
Domains
Domains [InterPro]
1
317 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bordetella phage vB_BbrM_PHB04 [NCBI] |
2029657 | Phabquatrovirus > Phabquatrovirus PHB04 |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MF663786
[NCBI]
CDS location
range 89306 -> 90034
strand -
strand -
CDS
ATGCGCACGCTTCGGCACCTGCTCGTCATGGCGGTCGCAATGATCGCCTCCTGCGCGAGCTCGGCGGTGTACGCTCAGAACGTCCGCACGTTCGTCCCGCCGGCCGCGCACAAGCTGCTGCCGGAGCTGCGTACCGTTCAAGAGAACATCTGGCCGACGGCGCCCATGCCGTCATTCATGGCGGCGCAGATCGAGCAGGAATCCTGCATTTCGCTCACGCACTCGAAATGCTGGAACACGCGTTCCGAGCTCAAGACGAGCCGCGAGTACGGGTTCGGCCTGGGCCAGATCACCATTGCCTACCGAGCGGACGGATCCGTCCGCTTCAACAAGTTCCAGGAGCTCCGCCAGGAGTACGCCAGTCTGCGGGGCTGGGCCTGGGAGAATCGCTACGACGCCAAGTATCAGCTCACGGCCATCGTCGAGATGGACAAGGGCATCTACGGCCGCGTGAGCGACGCGGCGACCTCGACCGACCGTTTGGCGTTCACACTGTCGGCCTACAACGGCGGCGAGTCCGGCCTGCGCCAGGATCGCCTGCTGTGCAAGAACACCGACGGCTGCGACCGCAGTCGCTGGTTCGGCCACGTCGAGCACACCAGCCTCAAATCCAAGGCCAAATGGCAGGGCTACGGAGCCAGCCCATTCCACATCAACCGCGAGTACGTCGACAACGTGATCAACGTTCGGCGCCCCAAGTACGAGCCATTTTTCGAGGGGCGTTCGTGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000be49fca_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(2g1Nw)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50