Protein

Protein accession
A0A2D0W917 [UniProt]
Representative
2g1Nw
Source
UniProt (cluster: phalp2_6881)
Protein name
Putative endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGKSLIHLLLALCAWAFGSLSPALAQDVRTFVPSGAEVYAPMLVEKQRAIWPAAPEPWTLAGLVEQESCISLTHSRCWNPRAELRTSREYGFGFGQITVAYNAXGTVRFNKFEELRAAHESLRAWSWANRYDPGYQLTAVVEMNLDLWRRVAAAPGATVTDQWAFVLASYNGGLGSVLQDRRLCSNTRGCDPARWFGHVENTSLKSRTPQPGYGGRSWFDINRSHVSNVINLRRSKYEPFWEATWP
Physico‐chemical
properties
protein length:246 AA
molecular weight: Da
isoelectric point:9,06
hydropathy:
Representative Protein Details
Accession
2g1Nw
Protein name
2g1Nw
Sequence length
317 AA
Molecular weight
35837,71430 Da
Isoelectric point
8,14366
Sequence
MKIRYGVLLAAALMTVGCSKKVDQEVKQSEQLAEQVGANSDVQGEVLEAVKEQPVLQEIGINVPDVSMPPEPIPTPKPELPAPTPTPIATEQVHPPEEVKQPAKPLPKIPANAEKLMPDVIAAIDEVWPDMPMRSYFPAQIEQESCITLTHSKCWNPRAELKTSREYGFGLGQLTKAWRADGSLRFDAWAEVKTQHPSLRGWDWEDRYNPRLQIMAIVVKNKVNWGSIKWETADLDNKMAFLATFYNGGNPIKDRNLCLNTAGCDPTRWWGNVEKYSIKSKTPLKEYGNRSLFQISREYPVKVLRERRPKYVPYTGT
Other Proteins in cluster: phalp2_6881
Total (incl. this protein): 30 Avg length: 249,5 Avg pI: 9,55

Protein ID Length (AA) pI
2g1Nw 317 8,14366
7HL2g 321 8,13818
A0A2D0W9F2 246 9,19933
A0A2D0W997 245 9,47152
A9J597 241 9,70418
I6WMZ3 241 9,85556
A0A2D0W9X7 246 9,05608
A0A2K8HPC2 241 9,74293
A0A0N9ERA0 241 9,91738
A0A291LA05 242 9,29752
A0A0U1VU02 241 9,74293
A0A1B0Z008 241 9,74293
A0A2D0W9N2 246 9,05608
A0A411BAT0 241 9,81210
A0A411BCU2 241 9,70457
A0A6G9LKB0 241 9,85556
A0AA96EUZ4 235 9,72952
A0AA96TEQ2 241 9,74293
A0AAU8KTL9 241 9,70470
A0AAX4F912 225 9,29887
A0AAX4M1Q2 241 9,60793
A0AAX4QEA6 273 10,30019
A0AAX4QFE1 273 10,22464
A0AAX4RD23 241 9,59104
A0AAX4RE17 241 9,70457
A0AAX4RE52 273 10,07965
A0AAX4RE79 241 9,60793
A0AAX4RES5 241 9,91654
A0AAX4RF16 241 9,60793
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5482
3dIDs
497 54,5% 218 8.417E-92
2 phalp2_18118
4dLEG
5 45,4% 209 3.277E-89
3 phalp2_8655
459Vq
3 39,1% 202 2.951E-52
4 phalp2_39610
6EkfA
1 28,4% 232 1.249E-19
5 phalp2_6836
8s2Bn
9 27,1% 232 1.952E-17
6 phalp2_414
7u6ty
1517 28,7% 216 3.528E-17
7 phalp2_6350
7rL94
1 21,7% 193 5.832E-09

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 2g1Nw (317 AA)
Member sequence: A0A2D0W917 (246 AA)
1 317 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Bordetella phage LK3
[NCBI]
1926943 Mesyanzhinovviridae > Vojvodinavirus > Bordetella virus CN1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KX961385 [NCBI]
CDS location
range 9608 -> 10348
strand +
CDS
ATGGGCAAAAGCCTGATCCACCTCCTCCTCGCCCTTTGCGCCTGGGCCTTCGGCTCCCTGAGTCCGGCCCTCGCTCAGGACGTTCGCACCTTCGTCCCGTCCGGGGCGGAGGTGTACGCGCCTATGCTGGTCGAAAAGCAGCGCGCCATTTGGCCCGCCGCGCCGGAGCCCTGGACCCTCGCGGGTCTGGTGGAACAGGAATCGTGTATCAGCCTCACCCACTCGCGGTGCTGGAACCCGCGCGCGGAGCTTCGGACCTCGCGTGAGTACGGCTTCGGCTTCGGGCAGATCACGGTCGCCTACAACGCCMACGGCACGGTCCGGTTCAACAAGTTCGAGGAGCTGCGAGCCGCCCACGAATCGCTGCGCGCCTGGTCTTGGGCGAACCGCTATGACCCGGGCTACCAACTCACCGCGGTGGTGGAAATGAACCTGGACCTTTGGCGACGGGTTGCCGCGGCCCCGGGCGCGACGGTCACGGATCAGTGGGCCTTCGTCCTCGCCAGCTACAACGGGGGCTTGGGCTCCGTCCTCCAGGACCGCCGCCTTTGCTCCAACACCCGCGGATGCGATCCGGCCCGCTGGTTCGGTCATGTCGAGAACACCAGCCTGAAGTCCCGGACTCCGCAGCCCGGGTACGGCGGGCGGTCCTGGTTCGACATCAACCGCAGCCACGTGAGCAACGTGATCAACCTCCGCCGGTCCAAATATGAACCCTTCTGGGAGGCAACATGGCCCTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000c3aaf7f_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2g1Nw) rather than this protein.
PDB ID
2g1Nw
Method AlphaFoldv2
Resolution 78.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50