Protein

Protein accession
G4W937 [UniProt]
Representative
7aZE9
Source
UniProt (cluster: phalp2_16534)
Protein name
Lytic transglycosylase
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTLPVALTVWALACGPAAPDYPVVGNRWSYDRMIVDAADQALLSRITARAVAHVETTFDHTKVSRENARGLFQILEKDQAYLVARYKVVGFQWDRPRDSATLGCLYMADLIKRFESKKHGIMAYNAGPGRMSKALRAQKETGQLELPEETAGYWPKVLSGRW
Physico‐chemical
properties
protein length:162 AA
molecular weight:18186,7 Da
isoelectric point:9,30
hydropathy:-0,27
Representative Protein Details
Accession
7aZE9
Protein name
7aZE9
Sequence length
162 AA
Molecular weight
18186,72450 Da
Isoelectric point
9,29790
Sequence
MTLPVALTVWALACGPAAPDYPVVGNRWSYDRMIVDAADQALLSRITARAVAHVETTFDHTKVSRENARGLFQILEKDQAYLVARYKVVGFQWDRPRDSATLGCLYMADLIKRFESKKHGIMAYNAGPGRMSKALRAQKETGQLELPEETAGYWPKVLSGRW
Other Proteins in cluster: phalp2_16534
Total (incl. this protein): 2 Avg length: 162,0 Avg pI: 9,30

Protein ID Length (AA) pI
7aZE9 162 9,29790
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14786
6WPO0
6 37,0% 116 1.011E-12
2 phalp2_17454
7Km5M
1 32,0% 153 1.247E-09
3 phalp2_20931
7dsYE
2 32,6% 153 7.909E-09
4 phalp2_4421
2GvBa
2 27,7% 137 4.086E-05
5 phalp2_6299
6TEAE
47 28,4% 130 6.173E-04
6 phalp2_21840
4nlmw
65 31,0% 129 8.337E-04

Domains

Domains [InterPro]
Disordered region
SLT
Representative sequence (used for alignment): 7aZE9 (162 AA)
Member sequence: G4W937 (162 AA)
1 162 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Tetrasphaera phage TJE1
[NCBI]
981335 Tijeunavirus > Tijeunavirus TJE1
Host Tetrasphaera jenkinsii
[NCBI]
330834 Actinobacteria > Actinobacteria > Micrococcales > Intrasporangiaceae > Tetrasphaera >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HQ225832 [NCBI]
CDS location
range 2552 -> 3040
strand +
CDS
ATGACTCTTCCCGTGGCGCTGACCGTCTGGGCCCTGGCATGTGGGCCCGCTGCCCCCGACTATCCCGTGGTGGGGAACCGATGGTCCTATGACCGGATGATCGTCGACGCGGCGGACCAAGCTCTGCTTTCCAGGATCACGGCTCGGGCGGTCGCCCACGTTGAAACCACGTTCGACCACACCAAGGTTAGCCGAGAGAACGCTCGAGGGCTTTTCCAGATCCTCGAGAAAGATCAGGCCTACCTGGTCGCCCGGTACAAGGTGGTCGGATTCCAGTGGGATCGGCCCCGGGACTCTGCAACCCTGGGATGCCTGTACATGGCCGACCTGATCAAGAGATTCGAGTCCAAGAAACACGGGATCATGGCCTACAACGCTGGACCCGGGCGCATGTCAAAGGCCTTGAGAGCCCAGAAAGAAACGGGCCAGCTCGAGCTTCCCGAGGAGACGGCCGGATACTGGCCCAAGGTTCTTTCCGGTCGGTGGTAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00022bd3a3_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7aZE9) rather than this protein.
PDB ID
7aZE9
Method AlphaFoldv2
Resolution 87.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50