Protein

Protein accession
A0AAE8Z0H4 [UniProt]
Representative
3KGn
Source
UniProt (cluster: phalp2_11378)
Protein name
Baseplate hub subunit and tail
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MLDNMRWFYGTVEDVNDPDQNGRVAVRIYGVHTDDTVLLPTDKLPWAKVLMPASNASSAGLGWSPTGIIPGTEVMGFALDEAYQNLRITWTWPGANPTDGADTNPLALGQVVQSVERQAYNAVTDVPVKQEDEPQPDPQPPVGGYDPEKWMTIARGELGVKEYAGKFNNNPRIIEYHKTTSLGASEDEVSWCAAFVGWVMLQAGYTSTRSALARSYLTWGSALSAPQYGAIVVFRRGNNPTFGHVAFVQKFDANYVWCIGGNQSDSVKVSRFSRSSVLGYRWPGPANTATAAPAQQNGKWSEPIPDRTPKEQPTPAPTGRVQDIDNTGETMVPAAGGSKYPYNNVMASRAGHIMEVDDTPGGERLHWMHMSGSYKQMLPNGDVVNKSVKDHYDLTMFDKRYYVGGDHNLTVKGTEVQRKTGEVYHLHSNNYSNVVAGTALMKFSNLAEIQAQNILRIICETLEVGGTLKVPKILATEIIADKLSVAQTIDGNIKYAEGAGRASSLSGATPATPSGPGEIDIKAELKDNGGNFGTE
Physico‐chemical
properties
protein length:535 AA
molecular weight:58091,4 Da
isoelectric point:5,49
hydropathy:-0,43
Representative Protein Details
Accession
3KGn
Protein name
3KGn
Sequence length
289 AA
Molecular weight
N/A Da
Isoelectric point
8,75869
Sequence
MEQPAWLAHAWREFGVREIAGAASNDRILQFFRDIGHNEINSDEIAWCAAFVGACLERSGYTSTRSLLARSYLDWGTXLQAAKLGAIAVCSRGNDPGKGHVGFVVGADATRVFLLGGNQQNSVSVQPYERARLLGLRWPGEKTIGAQTKNEVFERALTHVLEMEGGWSNDPYDPGGPTNRGITLAVYAAYRGIELTDFNKEALLRELKKLTVADVRPIYYKRYWVPSRAADLPPPLALMHFDAAVNHGVGNAARMLQRSLGVTVDGIIGPQTLAAANSQRVEVLSRHRP
Other Proteins in cluster: phalp2_11378
Total (incl. this protein): 35 Avg length: 521,1 Avg pI: 5,70

Protein ID Length (AA) pI
3KGn 289 8,75869
A0A075E0Z2 536 5,35460
A0A0A0YXE4 617 5,94862
A0A0N6YR19 536 5,35460
A0A140XB17 536 5,35460
A0A140XBI3 536 5,35460
A0A248H480 536 5,35460
A0A248H5A9 536 5,35460
A0A2H4PHI6 536 5,35801
A0A2P1JU72 617 5,94862
A0A2Z6C8A6 536 5,50261
A0A3G2K900 536 5,35801
A0A3G2K9H6 536 5,35801
A0A513QBV1 536 5,34186
A0A6G6XTW9 536 5,34186
A0A7L4YDR6 536 5,35801
A0A7L4YEQ2 536 5,35460
A0A7L4YF54 536 5,35460
A0A7L4YG19 536 5,42923
A0A7L4YGX8 536 5,35460
A0A7M3T3L2 536 5,35801
A0A7M3T4D9 536 5,35460
A0A8E7FLE2 486 5,91537
A0A8X8M467 532 5,77072
A0A8X8RIQ1 532 5,77072
C8XUI0 536 5,34186
I0J2T5 536 5,35801
K4I332 535 5,49164
A0A873WJ61 160 9,26754
A0A9E7MJ97 536 5,67312
A0A9E7NKK6 536 5,67312
A0AAE7MU77 536 5,66988
A0AAE8Z632 536 5,67312
A0AAX4G8U9 536 5,67312
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5261
8698a
186 61,4% 309 1.125E-118
2 phalp2_2910
IsDr
1 38,9% 285 4.170E-50
3 phalp2_4482
3hajG
2 33,1% 317 2.897E-42

Domains

Domains [InterPro]
Unannotated
GH108
Representative sequence (used for alignment): 3KGn (289 AA)
Member sequence: A0AAE8Z0H4 (535 AA)
1 289 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838|PF09374

Taxonomy

  Name Taxonomy ID Lineage
Phage Shigella phage vB_SboS_Gloob
[NCBI]
2902746 Ackermannviridae > Agtrevirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OL615011 [NCBI]
CDS location
range 47534 -> 49141
strand +
CDS
ATGTTAGACAATATGAGATGGTTTTATGGCACCGTTGAAGATGTCAACGATCCAGACCAAAACGGGCGCGTCGCCGTCCGCATCTATGGCGTCCACACTGATGACACCGTCCTCCTGCCTACGGATAAACTCCCATGGGCGAAGGTACTGATGCCTGCTTCAAACGCCTCCTCTGCTGGTTTAGGTTGGTCTCCAACGGGTATCATTCCCGGCACTGAGGTTATGGGGTTCGCCCTTGACGAAGCATACCAGAACCTGCGTATCACTTGGACTTGGCCTGGGGCAAATCCCACTGATGGGGCAGACACGAATCCACTGGCTCTCGGACAGGTCGTTCAATCGGTAGAACGTCAGGCGTACAACGCCGTCACTGACGTTCCTGTGAAACAAGAAGACGAACCACAACCAGATCCACAACCACCTGTTGGTGGGTATGATCCTGAAAAATGGATGACCATCGCTCGTGGTGAGCTGGGTGTCAAGGAGTACGCCGGAAAGTTCAACAACAACCCGCGCATTATAGAATATCACAAGACAACCTCCCTTGGTGCTTCTGAAGATGAAGTGTCGTGGTGTGCGGCGTTCGTGGGCTGGGTTATGTTGCAGGCTGGATACACGTCTACGCGCTCCGCTCTTGCCCGATCTTACCTTACATGGGGTAGTGCACTGTCAGCACCACAATATGGTGCTATCGTAGTCTTCCGGCGCGGTAACAACCCTACATTTGGACACGTGGCGTTTGTACAGAAGTTCGATGCAAACTATGTCTGGTGTATCGGTGGCAACCAATCCGATTCAGTGAAGGTCAGCCGCTTCAGCCGTTCGTCAGTCCTGGGCTACCGCTGGCCTGGTCCTGCTAACACAGCCACAGCCGCGCCTGCGCAGCAGAATGGTAAATGGTCAGAACCTATCCCAGATCGTACTCCAAAGGAACAACCGACTCCGGCTCCTACAGGGCGCGTCCAGGATATTGATAACACGGGCGAAACTATGGTTCCGGCAGCGGGTGGCTCCAAGTACCCGTACAACAACGTCATGGCGTCCCGAGCAGGTCACATAATGGAGGTGGACGACACTCCTGGTGGTGAGCGTCTTCATTGGATGCACATGTCCGGCTCCTACAAACAGATGCTCCCGAACGGCGACGTCGTCAACAAGTCAGTGAAGGATCATTATGACCTGACCATGTTTGACAAGCGCTATTATGTCGGCGGGGACCACAACCTGACGGTCAAAGGAACTGAAGTTCAGCGCAAGACTGGTGAGGTCTATCACCTTCACTCCAACAACTACTCAAATGTGGTTGCTGGTACAGCGCTTATGAAGTTCAGCAACCTAGCAGAGATCCAAGCCCAGAATATCCTGCGCATCATCTGTGAGACTCTGGAAGTTGGTGGAACTCTCAAGGTGCCGAAGATCCTGGCCACTGAGATCATCGCCGACAAACTGTCGGTCGCTCAGACGATTGATGGCAACATCAAATATGCCGAGGGTGCAGGTCGTGCTTCTTCTCTGTCTGGTGCTACCCCTGCAACCCCATCGGGTCCAGGCGAGATAGATATTAAGGCAGAATTAAAAGACAATGGTGGTAATTTCGGCACCGAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3KGn) rather than this protein.
PDB ID
3KGn
Method AlphaFoldv2
Resolution 87.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50