Protein

Protein accession
A0A223W052 [UniProt]
Representative
7csIv
Source
UniProt (cluster: phalp2_15037)
Protein name
TtsA-like Glycoside hydrolase family 108 domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTDRFEICHAITAKWEGGWSDHPADPGGKTMYGITEKRWHEYQDKLKVKRTPVRNVTKAQALAFYRSEFWLACGADKLFPGVDLAVHDGSVNSGVSRGRKWLLASAGSNDHSETVKKICRARLSFMQSLAIWKTFGNGWGRRVADIEARGVAMALAAMGLSPSQVSGKIKTEAAKSAQQASSAKKAATTSATAASAPAAAPVVEPSTVTDATTVWILVAIVAAGAVATVIFIAKKRAADARVEAYNEVAA
Physico‐chemical
properties
protein length:250 AA
molecular weight:26620,1 Da
isoelectric point:9,68
hydropathy:-0,10
Representative Protein Details
Accession
7csIv
Protein name
7csIv
Sequence length
86 AA
Molecular weight
9739,94010 Da
Isoelectric point
8,93069
Sequence
MSTAKFRRCHDVTKAWEGGWSDHPADPGGKTMYGLTEAVFHAWLRQQRKPVRPVRQITAAEAEQIYFEQYWVPSGGPTISPPTMPP
Other Proteins in cluster: phalp2_15037
Total (incl. this protein): 3 Avg length: 134,7 Avg pI: 9,62

Protein ID Length (AA) pI
7csIv 86 8,93069
3aHYl 68 10,24121
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16609
esP7
11 33,8% 68 2.959E-17
2 phalp2_11557
12S8B
2 37,9% 58 7.627E-13
3 phalp2_15722
3iCcu
63 36,1% 72 1.227E-10
4 phalp2_21282
1uMBM
118 33,3% 66 1.686E-10
5 phalp2_18063
3IzUl
9 25,3% 75 7.044E-08
6 phalp2_33765
15NtY
387 28,4% 88 2.510E-07
7 phalp2_19635
4kzEN
2 26,7% 86 5.547E-05
8 phalp2_3878
6ORZl
656 25,0% 84 5.547E-05
9 phalp2_15477
82FB3
2 33,3% 54 3.724E-04

Domains

Domains [InterPro]
Disordered region
Unannotated
Representative sequence (used for alignment): 7csIv (86 AA)
Member sequence: A0A223W052 (250 AA)
1 86 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Agrobacterium phage Atu_ph08
[NCBI]
2024265 Roslyckyvirus > Roslyckyvirus ph08
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF403009 [NCBI]
CDS location
range 40486 -> 41238
strand +
CDS
ATGACTGACCGATTCGAGATTTGCCACGCCATCACGGCTAAGTGGGAAGGTGGATGGAGTGACCATCCCGCGGACCCCGGCGGCAAAACCATGTATGGCATCACCGAAAAGCGCTGGCACGAATATCAGGACAAGCTGAAGGTCAAGCGGACGCCGGTGCGCAACGTCACCAAGGCGCAAGCCCTCGCGTTCTACCGCAGCGAATTCTGGCTCGCCTGCGGAGCTGACAAGCTATTCCCCGGTGTTGATCTGGCTGTACACGACGGGTCGGTAAACTCCGGTGTTTCTCGTGGTCGCAAATGGCTGCTTGCATCCGCCGGCAGTAACGATCACAGCGAGACGGTGAAGAAAATCTGCCGCGCTCGCCTTTCCTTCATGCAGTCACTCGCGATCTGGAAAACGTTCGGCAATGGCTGGGGGCGTCGTGTTGCTGATATCGAAGCGCGTGGCGTTGCCATGGCGCTCGCAGCGATGGGGCTTTCTCCTTCGCAGGTCAGCGGGAAGATCAAGACAGAAGCGGCCAAATCGGCCCAGCAGGCCAGCTCGGCAAAGAAAGCGGCCACCACAAGCGCCACCGCGGCGTCAGCGCCAGCCGCCGCGCCCGTTGTCGAGCCTTCCACCGTGACGGACGCAACAACCGTCTGGATCCTCGTCGCCATTGTGGCGGCCGGTGCCGTTGCCACCGTTATCTTTATCGCCAAGAAGCGCGCCGCCGATGCCCGCGTTGAGGCCTACAACGAGGTGGCAGCATGA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000bb9f1a4_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7csIv) rather than this protein.
PDB ID
7csIv
Method AlphaFoldv2
Resolution 61.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50