Protein

Protein accession
A0A6J7W7A2 [UniProt]
Representative
3ACsn
Source
UniProt (cluster: phalp2_32911)
Protein name
TtsA-like Glycoside hydrolase family 108 domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTDFDEAFTRLIGNEGGYTNNPADPGGETKFGISKRAYPDVEIASLTMDQAKALYQRDFWNPLADAHPAIKFQVFDFAVNAGMQTAIRKLQSAIGVADDGHWGPTSAGTLAQMDVNDVLMLFAAERLNFYTSLTKWDAFGKGWTRRIANDLKLAASDN
Physico‐chemical
properties
protein length:158 AA
molecular weight:17371,3 Da
isoelectric point:4,91
hydropathy:-0,31
Representative Protein Details
Accession
3ACsn
Protein name
3ACsn
Sequence length
116 AA
Molecular weight
13121,77320 Da
Isoelectric point
8,66057
Sequence
MTIDFDTAFTRLLGHEGGYTNNPADPGGETNWGICKRSYPLLDIKNLTREQAAGIYYKDFWQPLADAHPAIRFQVFDFAAFRRPLENCSRLSKSLTMAIGGQSARRLWPPGMSMTY
Other Proteins in cluster: phalp2_32911
Total (incl. this protein): 4 Avg length: 147,3 Avg pI: 6,01

Protein ID Length (AA) pI
3ACsn 116 8,66057
A0A6J5KJS1 158 5,62265
A0A6J7WV29 157 4,84293
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25028
ZsUq
3460 50,4% 101 3.486E-29
2 phalp2_38550
1LK0Y
12 59,4% 79 1.693E-28
3 phalp2_33765
15NtY
387 49,4% 89 2.327E-21
4 phalp2_37851
4LyyS
5845 41,0% 112 5.484E-20
5 phalp2_29127
77Hso
2 49,3% 75 5.006E-19
6 phalp2_23908
1Le4q
257 40,5% 111 1.075E-16
7 phalp2_6321
72lJ7
149 39,0% 110 5.209E-16
8 phalp2_12946
360Ik
1926 38,3% 107 1.113E-13
9 phalp2_25836
5fx9S
68 34,8% 112 1.726E-11
10 phalp2_31261
8tSYR
473 37,8% 111 1.142E-10

Domains

Domains [InterPro]
GH108
Disordered region
Representative sequence (used for alignment): 3ACsn (116 AA)
Member sequence: A0A6J7W7A2 (158 AA)
1 116 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR798196 [NCBI]
CDS location
range 34289 -> 34765
strand +
CDS
ATGACTGACTTTGACGAAGCGTTCACCCGGCTGATCGGAAACGAAGGCGGCTACACCAACAACCCGGCAGACCCGGGTGGTGAGACAAAGTTCGGAATTTCTAAGCGTGCCTATCCTGATGTGGAAATCGCATCACTCACAATGGATCAGGCCAAGGCGTTGTACCAACGCGACTTCTGGAACCCGCTGGCCGATGCGCACCCGGCCATCAAGTTTCAGGTGTTCGACTTTGCAGTGAACGCCGGCATGCAGACTGCGATCCGCAAGTTGCAGTCGGCCATTGGCGTTGCCGACGATGGCCACTGGGGGCCGACCAGCGCAGGCACGCTGGCGCAGATGGACGTGAACGATGTGCTGATGCTGTTCGCGGCTGAACGCCTGAACTTCTACACCAGCCTGACCAAGTGGGATGCGTTCGGCAAAGGCTGGACACGGCGCATCGCCAACGATTTGAAACTCGCTGCCTCGGACAACTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3ACsn) rather than this protein.
PDB ID
3ACsn
Method AlphaFoldv2
Resolution 71.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50