Protein

Protein accession
B4UTR7 [UniProt]
Representative
7tJZH
Source
UniProt (cluster: phalp2_9312)
Protein name
p029
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDRAKFFAAVRSPLFAGKMSERQVQGVDAILDEAERRGTPLAHLAYMLATAFLETAKTMQPIAEYGKGAGRKYGVKGKYGQVPYGRGYVQLTWDSNYERADKELGLKGALLRDFNLAMRQDIAAKIMFVGMTEGWFTGKKLGDYIGGGRVDYVGARRYNQRHG
Physico‐chemical
properties
protein length:163 AA
molecular weight:18169,7 Da
isoelectric point:9,75
hydropathy:-0,44
Representative Protein Details
Accession
7tJZH
Protein name
7tJZH
Sequence length
105 AA
Molecular weight
11502,21040 Da
Isoelectric point
10,11214
Sequence
MDRAKFFAAVRTPLFAGKMSEKQVQGVGAILDEAERRGTPLAHLAYMLATAFLETAKTMQPIAEYGKGTGRKYGVKGKYGQVPYGRGYVQLTWTRTTSAPIRSLA
Other Proteins in cluster: phalp2_9312
Total (incl. this protein): 13 Avg length: 147,4 Avg pI: 9,15

Protein ID Length (AA) pI
7tJZH 105 10,11214
10CnM 129 9,39493
1c1em 254 9,20011
2eQ94 115 9,45153
4mun5 97 8,48792
4ucO3 99 9,05615
6FhTv 116 5,63942
83Iff 179 9,55726
QAFH 113 9,66015
ny7H 117 8,80008
A0A9X9JU61 216 9,72559
A0A9X9P276 213 10,05354
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21772
3Zpin
1 48,0% 102 1.033E-28
2 phalp2_14463
4HLl0
13 49,3% 81 2.670E-28
3 phalp2_12536
16dsz
27 38,2% 123 2.447E-27
4 phalp2_13642
5CmBQ
1 42,0% 100 6.676E-24
5 phalp2_40157
8frnL
3 33,3% 120 1.152E-22
6 phalp2_792
26XkJ
2 35,9% 64 2.545E-13
7 phalp2_3338
39QbB
3 43,2% 67 4.382E-12
8 phalp2_4388
2l7Fc
3 42,1% 64 9.434E-10

Domains

Domains [InterPro]
Unannotated
Disordered region
Representative sequence (used for alignment): 7tJZH (105 AA)
Member sequence: B4UTR7 (163 AA)
1 105 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage 16-3 (Bacteriophage 16-3)
[NCBI]
10704 No lineage information
Host Sinorhizobium meliloti
[NCBI]
382 Proteobacteria > Alphaproteobacteria > Rhizobiales > Rhizobiaceae > Sinorhizobium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
DQ500118 [NCBI]
CDS location
range 19471 -> 19962
strand +
CDS
ATGGATCGCGCGAAATTCTTCGCGGCGGTGCGCTCGCCCCTGTTCGCCGGCAAGATGTCAGAGCGGCAGGTGCAGGGCGTAGACGCCATTCTAGACGAGGCAGAGCGGCGCGGCACACCGTTGGCGCATCTGGCCTACATGCTCGCCACGGCATTTCTCGAGACGGCCAAAACGATGCAGCCGATTGCCGAATACGGCAAGGGCGCTGGCCGCAAGTACGGCGTCAAAGGCAAGTACGGGCAGGTTCCCTATGGGCGCGGCTATGTCCAGCTAACGTGGGACTCGAACTACGAGCGCGCCGATAAGGAGCTTGGTCTGAAGGGCGCGCTGCTGCGCGACTTCAATCTCGCCATGCGCCAGGACATCGCGGCCAAAATTATGTTCGTCGGCATGACCGAGGGCTGGTTCACCGGCAAGAAGTTGGGCGACTACATCGGCGGCGGGCGTGTGGATTATGTCGGCGCACGCCGCTATAATCAACGGCACGGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00017ba5c1_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7tJZH) rather than this protein.
PDB ID
7tJZH
Method AlphaFoldv2
Resolution 69.90
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50