Protein

Protein accession
A0A6J5KFS8 [UniProt]
Representative
2l7Fc
Source
UniProt (cluster: phalp2_4388)
Protein name
Glycoside hydrolase, family 19, catalytic
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDKSAFYASVRAGILGPTLEPGEFTGCEAILTAMAGLPVAHCAYALATAYHETARTMQPVREAYWLSEAWRKANLRYWPWYGRGYVQLTWEVNYQRADDEAAAAGLIDKGDLMADPDLAMRLNLAAFIMRRGMTEGWFTGRTLAKCLPDRLGTVAQFTAARRIINGTDKASSIAAYAEQFQNALIAGGWA
Physico‐chemical
properties
protein length:190 AA
molecular weight:20922,7 Da
isoelectric point:6,73
hydropathy:-0,09
Representative Protein Details
Accession
2l7Fc
Protein name
2l7Fc
Sequence length
80 AA
Molecular weight
9293,45610 Da
Isoelectric point
5,71962
Sequence
MATDMNLGETGLIVAECQKRGLLRNQAAYVLATAYWETAHTMEPVREAFWLSDEWRKANLRYYPWYGRGFVQLTWECTGG
Other Proteins in cluster: phalp2_4388
Total (incl. this protein): 3 Avg length: 121,7 Avg pI: 7,21

Protein ID Length (AA) pI
2l7Fc 80 5,71962
4ViX1 95 9,17103
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9312
7tJZH
13 42,1% 64 4.291E-09

Domains

Domains [InterPro]
Unannotated
Disordered region
Representative sequence (used for alignment): 2l7Fc (80 AA)
Member sequence: A0A6J5KFS8 (190 AA)
1 80 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR796135 [NCBI]
CDS location
range 14931 -> 15503
strand +
CDS
ATGGACAAGTCGGCATTTTATGCCAGCGTTCGTGCGGGCATACTTGGGCCTACTCTGGAGCCGGGCGAGTTCACTGGCTGCGAAGCCATCCTGACGGCCATGGCGGGGCTCCCTGTGGCGCACTGCGCCTATGCGCTGGCAACTGCCTACCATGAGACAGCGCGGACCATGCAGCCTGTGCGGGAGGCCTACTGGCTCTCCGAGGCATGGCGCAAGGCAAACCTGCGCTACTGGCCGTGGTACGGACGCGGCTACGTGCAGCTTACGTGGGAAGTAAATTACCAGCGCGCCGATGATGAGGCCGCTGCGGCCGGGTTGATCGACAAGGGTGACCTCATGGCTGACCCTGACCTTGCCATGCGTCTGAACCTTGCCGCGTTCATCATGCGCCGGGGGATGACCGAGGGCTGGTTCACTGGGAGGACATTGGCGAAGTGCCTGCCTGATCGGCTTGGGACGGTGGCACAGTTCACGGCTGCTCGGAGGATCATCAACGGAACCGATAAGGCATCGTCGATCGCGGCTTATGCCGAGCAGTTTCAGAATGCACTTATTGCAGGAGGTTGGGCATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2l7Fc) rather than this protein.
PDB ID
2l7Fc
Method AlphaFoldv2
Resolution 58.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50