Protein

Protein accession
A0A8S5Q8B1 [UniProt]
Representative
6qvgL
Source
UniProt (cluster: phalp2_36066)
Protein name
Protease
Lysin probability
100%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MLRRSMAQSTRSTSASGSTMPSSLSDGGRGVKDLIKTLTAWDGAVRGDARHKQIVDAYNSYLPHPRGYTLTVKDDYCAATVSAAAILCGLTDVIPVECSCGEQMRWYQARGQWVEADNHVPRVGEQVFYCWTDGADYAKTDNTGAPNHTGIVTAVRGNTFDIFEGNMGTAHKCGTRRMAVNGRYIRGFGRPAYPTERAAPGVVELGDKGDAVGKLQEFLRACGYTLDVDQSFGPATQRAWGEYLAAWIRESTK
Physico‐chemical
properties
protein length:253 AA
molecular weight:27546,6 Da
isoelectric point:8,13
hydropathy:-0,42
Representative Protein Details
Accession
6qvgL
Protein name
6qvgL
Sequence length
99 AA
Molecular weight
11192,59920 Da
Isoelectric point
4,97565
Sequence
MKRFLETLTAWEGAVRGDAVHKQIVDAYNSYLPHPRGYKLTYTDDYCAAMVSAAAILCGLTEVIPIECSCGEQMKWYQARGQWIEDDAHVPTVGEQVFY
Other Proteins in cluster: phalp2_36066
Total (incl. this protein): 4 Avg length: 142,8 Avg pI: 5,88

Protein ID Length (AA) pI
6qvgL 99 4,97565
41gah 101 5,18351
5QMTO 118 5,21943
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37240
3uYls
10 49,3% 77 2.342E-26
2 phalp2_10363
1FVFV
9 42,1% 95 4.048E-25
3 phalp2_13343
3ZYpv
1 35,3% 82 1.318E-23
4 phalp2_528
nLg4
1 40,8% 93 3.128E-22
5 phalp2_20125
24JiN
1 40,2% 67 1.618E-18
6 phalp2_7376
4z8rc
8 34,7% 95 9.938E-17

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 6qvgL (99 AA)
Member sequence: A0A8S5Q8B1 (253 AA)
1 99 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. cty3u30
[NCBI]
2825744 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK015598 [NCBI]
CDS location
range 21114 -> 21875
strand +
CDS
ATGTTGCGGCGATCAATGGCGCAGTCAACAAGATCAACGTCGGCCAGCGGCAGTACTATGCCATCAAGTTTATCTGACGGAGGGCGCGGCGTGAAGGATCTCATTAAGACGCTGACAGCATGGGACGGCGCGGTGCGCGGAGACGCCAGGCACAAACAGATCGTCGATGCCTACAACAGTTATCTCCCGCATCCGCGCGGCTACACGCTGACGGTCAAGGACGACTACTGCGCCGCCACGGTAAGCGCGGCCGCGATCCTCTGCGGTTTGACGGATGTGATCCCGGTCGAGTGCAGCTGCGGCGAGCAGATGCGCTGGTACCAGGCGCGTGGACAGTGGGTCGAGGCGGACAATCACGTGCCGCGGGTCGGAGAGCAGGTATTTTACTGCTGGACCGACGGTGCGGATTACGCCAAGACCGACAATACCGGCGCGCCCAACCATACCGGTATTGTGACTGCGGTGCGGGGCAATACGTTCGACATTTTTGAGGGCAATATGGGGACCGCACATAAGTGCGGCACCCGCAGGATGGCCGTCAACGGCAGGTATATTCGCGGCTTCGGCCGTCCGGCATATCCGACCGAGCGGGCCGCGCCGGGTGTGGTCGAATTGGGCGACAAGGGCGATGCCGTCGGCAAGCTGCAGGAGTTTCTCCGCGCCTGCGGGTATACGCTCGACGTGGACCAGTCTTTCGGCCCGGCAACGCAGCGCGCCTGGGGCGAGTATCTGGCGGCGTGGATCAGAGAGAGCACAAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6qvgL) rather than this protein.
PDB ID
6qvgL
Method AlphaFoldv2
Resolution 96.72
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50