Protein
- Protein accession
- A0A6J5Q6G1 [UniProt]
- Representative
- 1CwQg
- Source
- UniProt (cluster: phalp2_700)
- Protein name
- Transglycosylase SLT domain 1
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MRPQIRYLLQLLGFGLVFTISLQTLTAAPATPIVPVERPLLETIDAKAVARELLTAKQFKCLDRLFVKESQWNPKSDNPKSTAVGIGQLLDGTYKNLGMKHSEAEVPQLVAALAYISRHYGSGGPCAAWAHFKAKHWY
- Physico‐chemical
properties -
protein length: 138 AA molecular weight: 15290,6 Da isoelectric point: 9,48 hydropathy: -0,05
Representative Protein Details
- Accession
- 1CwQg
- Protein name
- 1CwQg
- Sequence length
- 71 AA
- Molecular weight
- 7807,63460 Da
- Isoelectric point
- 10,00764
- Sequence
-
GKESAWQDKDNPRSSASGVGQLLKGTYKNLGMRKGESRVSQTVAALSYIGRKYGSGGPCAAWNHHKKHNWY
Other Proteins in cluster: phalp2_700
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_5586
47BtV
|
1 | 72,0% | 43 | 7.150E-21 |
| 2 |
phalp2_8731
8mbTL
|
30 | 39,4% | 71 | 6.740E-12 |
| 3 |
phalp2_25853
5nNHC
|
19 | 37,5% | 72 | 6.740E-12 |
| 4 |
phalp2_13710
6xCR8
|
339 | 39,1% | 74 | 1.515E-09 |
| 5 |
phalp2_22403
x9gn
|
10 | 35,2% | 71 | 3.942E-09 |
| 6 |
phalp2_23290
5bWSM
|
1 | 34,4% | 61 | 6.951E-08 |
| 7 |
phalp2_25867
5vKf9
|
6 | 36,0% | 61 | 1.316E-07 |
| 8 |
phalp2_23380
5Kh9K
|
1 | 31,9% | 72 | 1.690E-06 |
| 9 |
phalp2_11782
870N1
|
4 | 35,2% | 68 | 1.579E-05 |
Domains
Domains [InterPro]
1
71 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured Caudovirales phage [NCBI] |
2100421 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
LR796960
[NCBI]
CDS location
range 10994 -> 11410
strand -
strand -
CDS
ATGCGCCCACAGATCAGATACCTGCTTCAACTTCTCGGATTCGGCCTAGTGTTTACAATCTCTCTGCAGACCTTGACAGCCGCACCGGCGACACCGATTGTGCCTGTCGAGCGACCTCTGCTGGAGACGATCGACGCGAAGGCCGTGGCTCGGGAGTTGCTTACCGCCAAACAGTTCAAGTGCCTAGACCGTCTTTTCGTCAAGGAGAGCCAGTGGAATCCGAAATCGGACAATCCCAAGAGCACTGCTGTGGGCATCGGCCAGCTTCTTGATGGCACTTACAAAAATCTCGGCATGAAACACTCTGAGGCCGAAGTCCCGCAGTTGGTTGCGGCTTTGGCTTACATCTCGCGGCATTACGGCTCGGGCGGCCCCTGTGCCGCATGGGCGCACTTCAAGGCAAAACACTGGTACTAG
CDS Source ID
CDS Source
LR797313
[NCBI]
CDS location
range 24421 -> 24837
strand -
strand -
CDS
ATGCGCCCACAGATCAGATACCTGCTTCAACTTCTCGGATTCGGCCTAGTGTTTACAATCTCTCTGCAGACCTTGACAGCCGCACCGGCGACACCGATTGTGCCTGTCGAGCGACCTCTGCTGGAGACGATCGACGCGAAGGCCGTGGCTCGGGAGTTGCTTACCGCCAAACAGTTCAAGTGCCTAGACCGTCTTTTCGTCAAGGAGAGCCAGTGGAATCCGAAATCGGACAATCCCAAGAGCACTGCTGTGGGCATCGGCCAGCTTCTTGATGGCACTTACAAAAATCTCGGCATGAAACACTCTGAGGCCGAAGTCCCGCAGTTGGTTGCGGCTTTGGCTTACATCTCGCGGCATTACGGCTCGGGCGGCCCCTGTGCCGCATGGGCGCACTTCAAGGCAAAACACTGGTACTAG
CDS Source ID
CDS Source
LR797410
[NCBI]
CDS location
range 10137 -> 10553
strand -
strand -
CDS
ATGCGCCCACAGATCAGATACCTGCTTCAACTTCTCGGATTCGGCCTAGTGTTTACAATCTCTCTGCAGACCTTGACAGCCGCACCGGCGACACCGATTGTGCCTGTCGAGCGACCTCTGCTGGAGACGATCGACGCGAAGGCCGTGGCTCGGGAGTTGCTTACCGCCAAACAGTTCAAGTGCCTAGACCGTCTTTTCGTCAAGGAGAGCCAGTGGAATCCGAAATCGGACAATCCCAAGAGCACTGCTGTGGGCATCGGCCAGCTTCTTGATGGCACTTACAAAAATCTCGGCATGAAACACTCTGAGGCCGAAGTCCCGCAGTTGGTTGCGGCTTTGGCTTACATCTCGCGGCATTACGGCTCGGGCGGCCCCTGTGCCGCATGGGCGCACTTCAAGGCAAAACACTGGTACTAG
CDS Source ID
CDS Source
LR798392
[NCBI]
CDS location
range 17421 -> 17837
strand -
strand -
CDS
ATGCGCCCACAGATCAGATACCTGCTTCAACTTCTCGGATTCGGCCTAGTGTTTACAATCTCTCTGCAGACCTTGACAGCCGCACCGGCGACACCGATTGTGCCTGTCGAGCGACCTCTGCTGGAGACGATCGACGCGAAGGCCGTGGCTCGGGAGTTGCTTACCGCCAAACAGTTCAAGTGCCTAGACCGTCTTTTCGTCAAGGAGAGCCAGTGGAATCCGAAATCGGACAATCCCAAGAGCACTGCTGTGGGCATCGGCCAGCTTCTTGATGGCACTTACAAAAATCTCGGCATGAAACACTCTGAGGCCGAAGTCCCGCAGTTGGTTGCGGCTTTGGCTTACATCTCGCGGCATTACGGCTCGGGCGGCCCCTGTGCCGCATGGGCGCACTTCAAGGCAAAACACTGGTACTAG
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(1CwQg)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50