Protein

Protein accession
A0A6B9WNQ5 [UniProt]
Representative
7sgqF
Source
UniProt (cluster: phalp2_26079)
Protein name
Minor tail protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MLTKKVKSDIAAHVAACMPEEACGLVVMVGRKQVFVPCMNVFEDPTGARSRKDAFTISDMAWMDAEDIGDVVRVVHSHPGQRELTPSLGDVNGCNGSGIVWTITNEYGDFIEIDPEDPPLVGRRFVLGITDCYGLVMDWHKKQGVNLPDFRVPYNWWEMGESLYMDNWYDAGFRECAENTPGAMVIMQISAPVPNHAGIFLPGNQLLHHIYGSLSSVVPFRAGFFRDNVVKWVRHKDLPGEITEWQ
Physico‐chemical
properties
protein length:246 AA
molecular weight:27571,2 Da
isoelectric point:5,11
hydropathy:-0,15
Representative Protein Details
Accession
7sgqF
Protein name
7sgqF
Sequence length
100 AA
Molecular weight
12032,39720 Da
Isoelectric point
4,31882
Sequence
MQDLSQLDWWLVCNGELHIFPKIQPLIGREFIHGTTDCYSIYKDFYYLAGLDMDEFKRQDYWWEKGENLYLENIEGQGFERLSEDAELQVGDVILMQSWR
Other Proteins in cluster: phalp2_26079
Total (incl. this protein): 62 Avg length: 243,5 Avg pI: 5,20

Protein ID Length (AA) pI
7sgqF 100 4,31882
C9EHB0 246 5,30100
G9JXH6 246 5,12178
A0A0M7QBK7 246 5,21176
K4PAQ4 246 5,21176
A0A1B2AP02 246 5,21176
A0A0A7DVB8 246 5,32481
A0A076YQM8 246 5,32265
W0LM02 239 5,07563
A0A1B2APZ9 246 5,21176
A0A343S156 246 5,11178
A0A3G8F5Q1 246 5,11178
A0A482N3U4 246 5,11178
A0A482N4B2 246 5,02357
A0A482N4U1 246 5,21904
A0A4V1E2M6 246 5,19454
A0A653FW65 246 5,21176
A0A6B9WWB4 246 5,21176
A0A6B9X3G2 246 5,21176
A0A6B9XGK5 246 5,33107
A0A6C0R0D8 246 5,21176
A0A6G5YHE0 245 4,96292
A0A6M3YRM6 246 5,21176
A0A7D7FN11 246 5,10354
A0A7H0XF12 246 5,45617
A0A7H0XF25 246 5,45617
A0A7H0XFG1 246 5,45617
A0A7L8ZKG3 246 5,45617
A0A8E5NQT7 246 5,21176
C0LP50 246 5,11178
K7P800 246 5,11178
A0A976R6E2 246 5,31623
A0A976SPR6 246 5,21176
A0A9E7LUV7 246 5,21176
A0A9E7MJF2 246 5,03397
A0A9E7MKU3 246 5,02357
A0A9E7RZL4 246 5,21176
A0AA48U0U9 246 5,11178
A0AAD1Q7C2 246 5,21176
A0AAD1Q8K3 246 5,21176
A0AAE7S309 246 5,33107
A0AAE7VY11 246 5,21176
A0AAE7VZI1 246 5,32481
A0AAE7XQF7 246 5,21904
A0AAE7XQX1 246 5,21904
A0AAE8B0T3 246 5,45617
A0AAE8B6H7 246 5,21176
A0AAE8B7H9 246 5,20221
A0AAE8YPW1 246 5,32481
A0AAE8YU49 246 5,21176
A0AAE8YV83 246 5,21176
A0AAE8YWU6 246 5,21176
A0AAE9CD94 246 5,21176
A0AAE9WZ21 246 5,21176
A0AAF0FFU8 246 5,24569
A0AAX4M4M4 246 5,21176
A0AAX4NW42 246 5,21176
A0AAX4QWU9 246 5,21176
A0AB39JCZ5 246 5,21176
A0AB74UEX9 246 5,20301
A0AB74UKL4 246 5,21176
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33148
7CcD2
99 44,7% 96 1.265E-22
2 phalp2_23201
4Trrv
17 39,4% 104 2.755E-20
3 phalp2_18066
3JA4t
92 33,6% 95 1.199E-14
4 phalp2_3626
4QVie
1 32,3% 71 1.597E-10

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage rolling
[NCBI]
2696442 Dhillonvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN850575 [NCBI]
CDS location
range 36800 -> 37540
strand +
CDS
ATGTTAACGAAAAAGGTTAAGAGCGATATCGCCGCACACGTTGCGGCATGTATGCCTGAAGAAGCGTGTGGCCTGGTCGTCATGGTGGGCCGCAAACAAGTATTCGTCCCATGCATGAACGTGTTCGAAGACCCGACCGGCGCGCGCTCGCGTAAGGACGCGTTTACAATTAGTGATATGGCATGGATGGACGCCGAGGATATAGGCGACGTCGTGCGCGTAGTCCACTCACATCCAGGCCAGCGCGAGCTAACCCCGTCATTGGGCGACGTTAACGGATGCAACGGCAGCGGCATAGTGTGGACCATTACCAACGAGTACGGCGACTTTATCGAGATCGACCCTGAAGACCCACCACTGGTAGGTCGCCGATTTGTTCTCGGAATTACGGATTGCTATGGCCTCGTCATGGACTGGCACAAAAAGCAGGGCGTGAATCTGCCTGACTTCCGCGTGCCGTACAACTGGTGGGAGATGGGCGAAAGCCTGTACATGGATAACTGGTATGATGCGGGATTCAGGGAGTGCGCGGAGAATACGCCGGGGGCAATGGTCATCATGCAGATTAGCGCACCAGTGCCAAACCATGCTGGGATCTTCCTTCCGGGCAACCAACTACTACACCATATCTACGGCAGCCTGTCGAGCGTGGTCCCCTTCCGGGCAGGATTTTTCCGCGACAATGTGGTTAAATGGGTACGTCATAAAGACTTACCTGGGGAAATCACAGAATGGCAATGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008234 cysteine-type peptidase activity molecular function None (UniProt)
GO:0008235 metalloexopeptidase activity molecular function None (UniProt)
GO:0008270 zinc ion binding molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7sgqF) rather than this protein.
PDB ID
7sgqF
Method AlphaFoldv2
Resolution 90.55
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50