Protein

Protein accession
A0A3G8F848 [UniProt]
Representative
Hz7N
Source
UniProt (cluster: phalp2_559)
Protein name
Tail length tape-measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAKIQATMSTEIALDTLQAANSIKRLTQLVNSSTNAWKAQESQMRSAGDYLGAAQAKYEGLSNTIQNQQQKIEKLKQEQSQLKGNTVEVAEQYLKYQQQIDQATTRLAALENQQRQAKQSMSYYQSGLADLQRSYRLSNDLSESYVKRLQAEGRESEALQAKLNASKNAVNNLNKQYEAQVKLLKEVAQSGDGDAYIKQKIRINETASAIAKAKREQIELANELRKSNPTFFDKLKAGIHGTNSKIDELGRNVNRSNSILGTFRERLSFGAIAGMASSAIQSISSSVMGLSGEVMATSDAIEKFESTMNFAGKTKKETEEASKYFKTYADKTVYDLQDVANTGAQLASNGIEKYKEITIASGNLNAVAGGNKETFKSLGMVLTQTAGAGKLTTENWNQLADAIPGASGKLQEALSKNGAFVGDFREALENGEISSEEFLTAIEQLGNSKSAEKAAKSTKTFEGAFGSLKSTVVSGMKDMIDAVGKEKITKSITGFGDSVQKIFDYLKDHKKELSSIGKSIFEISKIFGMAVWNTAKGIIVEISDSISAMNGHSKKSKDHLKNIASALKEVSKHKEAIQTIGKLFVGYFASKAVLNTSKSLFGTITDGISNVKKAGSKVNGALNWVMGVRGEDAVKNKLGGIKKIGRGTKSAFKWTASVATKTAKLALTGLLNTAKFVGNGIKLAFNFAKANPLILIATAVIGISTALYELYKHNKEFKKFVDGIFSAAKKAFDKIFKVTKEIFGKVINFFKKDWKQVLLFIANPVAGAFALIYKYNKKFKKFVDDLAKNAKKAFDNIVKWFKDIPKNLSKTWENIKDGAKSGMKNLGSAITGKLSDIGKEWKKGWKNSKDYLSDRWDDMKGNTKESIKRLGSSIKDKHDEIHDRWSKTWNKSKNFLSDRWDDMNAETKKKFGNDLKGLLFDNLDKIKNKFQDTWSGIKDGFGDMWDVMKRLAGDGINAVIAIPNTGIDGINGLIHDFGGPKNAIGKIPKVKFADGTGLFSSYRNPITRPTLATLNDGNDSPETNNQEMVILPNGKSFLPQGRNVEYLLPAGSEVINASELAMLMGVERGAYAKGTGFWSKVWDTTTNVAGSVWNGMKNGVDKFKKMIEFIGSAIKDPVGTLAKKFSPNADKLGAMFTPLGNALYKNPVGEAKNWWKELWSMANASMDEGTVAIGAKGDDYRFKDKAKDAGVDPWGYYYRECVSFIASRLANLGVNPSLFSHLGNGNQWVSARVPHLSRPKPGVVSVYTGGPVSSNHVDFVTAVHGDTYDGEDYNYNGDGKYHQFTGRHVKNAATFLDFGVRDFGSSGDSGKALKDRNNPLQTLIKRQVGGMFDWIKKTLGPLLSPAGGGEDHPQGTGVARWRDTVVRALEANGIEANNFRVSKILATIQKESGGNPNAQNNWDINARMGDPSIGLMQTIGRTFNAYKHQGHNNIRNGYDNLLAAINYIKHRYGTSDAAFNYVATHGYANGGLVRKNGVYELAEGDMPEYVIPTDIAKRGRAWRLLSEAVAHFAGDAPQNNHDDSSSQQRVSMLESKLDVVIGLLSQLVTNGSKPIEIQNIIDGRSVSNGLAPFMTKATNEYERRQALLGGSII
Physico‐chemical
properties
protein length:1593 AA
molecular weight:174348,6 Da
isoelectric point:9,50
hydropathy:-0,46
Representative Protein Details
Accession
Hz7N
Protein name
Hz7N
Sequence length
155 AA
Molecular weight
16865,70410 Da
Isoelectric point
5,58303
Sequence
DNLLAAINYIKHRYGTSDAAFNRVAAYGYANGGLVHKSGVYELAEGDMPEYVIPTDIAKRGRAWQLLTEAVARFAGDAPQGNHDNTSDRERVSVLEDKLDVMIGLLSQLVTNGSNPIEIRNVIDGRSVSNGLAPFMTKATNDYERRQALLGGSII
Other Proteins in cluster: phalp2_559
Total (incl. this protein): 14 Avg length: 977,8 Avg pI: 8,28

Protein ID Length (AA) pI
Hz7N 155 5,58303
1odid 164 8,11478
3pWod 193 6,57385
6ZUa3 129 5,32271
6u4iS 211 9,21706
p6bw 128 5,09797
A0A2P0VJB7 1593 9,47410
A0A2I6QQQ0 1593 9,50440
A0A3G8FCA6 1559 9,46030
A0A3S5H0N0 1592 9,52632
A0A3S7W7X2 1593 9,54198
A0AAD0LNJ8 1593 9,49576
A0AAD0LP08 1593 9,49576
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4084
1oo4E
2 32,5% 120 3.412E-19
2 phalp2_3961
7qIYz
4 30,4% 125 3.617E-14
3 phalp2_7800
7tCXe
1 30,1% 116 1.029E-09
4 phalp2_24567
4VE6H
4 36,0% 97 3.071E-08

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage CHPC928
[NCBI]
2365049 Aliceevansviridae > Moineauvirus > Moineauvirus CHPC928
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH937472 [NCBI]
CDS location
range 8857 -> 13638
strand +
CDS
ATGGCGAAAATACAAGCTACGATGTCTACTGAAATAGCCTTAGACACGCTTCAGGCTGCTAACTCGATTAAACGATTAACTCAGTTAGTCAATAGTTCTACTAACGCTTGGAAGGCTCAAGAGAGTCAAATGCGTAGCGCTGGTGACTATTTAGGTGCAGCTCAAGCAAAATACGAAGGCTTGAGTAACACCATCCAGAACCAACAACAAAAAATTGAGAAGCTGAAACAAGAGCAGTCTCAACTTAAAGGGAATACTGTTGAAGTCGCTGAACAGTACCTCAAATACCAACAACAGATTGACCAAGCTACTACACGCTTAGCTGCGTTGGAAAATCAACAGCGTCAAGCTAAACAATCGATGTCCTATTATCAATCTGGACTCGCCGATTTACAAAGAAGCTATCGCTTAAGCAATGACTTGTCAGAAAGTTATGTTAAGAGATTACAAGCCGAAGGTCGTGAATCAGAAGCTTTACAAGCTAAGTTAAACGCTTCGAAAAATGCTGTTAATAACTTAAATAAGCAGTATGAAGCGCAAGTGAAACTTTTAAAAGAAGTCGCTCAATCAGGTGACGGTGATGCTTATATAAAGCAGAAAATACGTATTAACGAGACAGCAAGTGCGATAGCCAAAGCTAAGCGTGAACAAATTGAGTTAGCTAATGAGTTAAGGAAATCAAACCCTACTTTTTTTGATAAGTTAAAAGCAGGCATCCATGGTACTAATTCTAAAATCGATGAATTAGGAAGAAATGTAAATCGTTCGAATTCTATTTTAGGTACGTTTAGAGAGAGATTATCTTTTGGCGCAATAGCAGGGATGGCATCTAGTGCCATCCAATCCATTTCAAGTTCTGTCATGGGGTTGTCAGGAGAGGTTATGGCAACGTCCGATGCGATCGAAAAGTTTGAAAGTACAATGAACTTTGCTGGCAAAACTAAGAAAGAAACGGAAGAAGCTAGTAAATACTTTAAAACATACGCTGATAAAACAGTTTATGATTTGCAAGATGTCGCAAATACTGGTGCTCAATTAGCCTCAAACGGCATCGAAAAATATAAAGAGATTACTATTGCAAGTGGTAATTTAAATGCAGTAGCGGGTGGGAACAAAGAAACATTTAAGTCTCTTGGCATGGTGCTCACTCAAACAGCTGGGGCAGGGAAACTGACAACAGAAAACTGGAACCAGTTAGCTGATGCAATTCCCGGTGCATCTGGTAAGTTACAAGAAGCTCTTTCTAAAAACGGTGCATTCGTTGGCGACTTCCGTGAAGCTCTGGAAAACGGAGAAATCAGTTCTGAAGAGTTCCTGACAGCTATTGAGCAACTCGGAAACAGCAAGAGCGCTGAGAAGGCTGCTAAATCAACAAAAACATTTGAAGGAGCTTTTGGTAGTTTAAAATCAACAGTAGTCAGTGGCATGAAGGATATGATTGACGCTGTTGGAAAAGAAAAAATCACTAAAAGCATAACTGGTTTTGGTGATTCTGTACAAAAAATTTTTGACTATTTAAAAGATCACAAAAAAGAACTTTCATCTATCGGGAAAAGCATTTTTGAAATATCTAAGATTTTCGGTATGGCTGTTTGGAACACTGCAAAAGGAATTATTGTAGAGATTTCAGATAGCATTAGCGCTATGAATGGACATTCCAAAAAATCAAAAGACCATCTTAAAAATATTGCTAGCGCTTTAAAAGAAGTTTCAAAGCACAAAGAGGCTATCCAGACCATCGGTAAATTATTCGTTGGTTACTTTGCATCAAAAGCAGTTTTAAATACCTCAAAATCACTTTTTGGAACGATAACAGATGGTATTTCAAACGTCAAAAAAGCTGGTAGTAAGGTCAATGGCGCTTTAAATTGGGTTATGGGCGTTCGTGGAGAAGACGCAGTAAAGAATAAACTTGGTGGCATTAAGAAGATTGGTAGAGGAACTAAATCAGCTTTTAAATGGACTGCTTCTGTAGCAACTAAAACTGCTAAATTAGCTTTAACGGGATTGCTAAACACTGCTAAATTTGTAGGTAACGGTATTAAACTTGCATTTAATTTTGCTAAAGCAAATCCACTGATTTTAATTGCTACAGCTGTAATCGGTATATCTACTGCTCTCTACGAACTTTATAAACACAACAAAGAATTCAAAAAATTTGTTGACGGAATATTTTCTGCTGCTAAAAAAGCCTTTGATAAAATCTTCAAAGTAACCAAAGAAATCTTTGGCAAGGTTATTAATTTCTTTAAAAAAGACTGGAAACAAGTCCTTTTATTTATTGCAAATCCTGTTGCTGGAGCTTTCGCTTTAATTTACAAGTATAATAAGAAATTCAAGAAATTTGTTGATGATTTAGCAAAGAACGCAAAAAAAGCATTTGACAACATTGTCAAATGGTTTAAGGATATTCCTAAAAATCTTAGCAAGACTTGGGAAAACATCAAAGACGGCGCTAAAAGCGGCATGAAAAATCTTGGTTCTGCTATCACCGGTAAACTTTCTGACATCGGTAAAGAGTGGAAGAAAGGCTGGAAGAATTCCAAAGACTATCTATCAGACCGCTGGGATGATATGAAAGGCAATACTAAGGAAAGTATTAAACGTCTTGGGTCTTCTATCAAAGATAAGCATGATGAAATACACGACAGATGGTCTAAGACTTGGAATAAGTCAAAAAATTTCCTATCTGATCGCTGGGATGATATGAATGCCGAAACTAAAAAGAAATTTGGCAATGATTTGAAAGGGTTGCTTTTTGATAATCTGGATAAAATCAAAAACAAATTCCAAGATACATGGAGTGGTATTAAAGACGGCTTCGGCGACATGTGGGATGTAATGAAACGTCTAGCTGGCGATGGTATTAATGCCGTGATTGCAATCCCTAACACTGGTATCGACGGCATCAACGGCTTAATCCACGACTTCGGTGGTCCGAAGAACGCAATCGGTAAAATCCCTAAAGTTAAATTTGCGGATGGTACAGGTCTATTCAGCTCATACCGAAACCCAATCACTAGACCAACACTTGCTACACTAAACGATGGTAATGATAGCCCTGAGACTAACAACCAAGAGATGGTAATATTGCCAAACGGTAAATCATTCTTGCCACAAGGTCGCAATGTTGAATACCTCTTGCCAGCTGGTTCGGAAGTTATCAATGCCAGTGAATTGGCTATGCTCATGGGTGTTGAACGTGGAGCTTATGCTAAAGGTACTGGTTTTTGGTCTAAAGTCTGGGATACAACTACCAATGTAGCTGGCTCAGTTTGGAATGGGATGAAAAACGGTGTCGACAAATTCAAAAAAATGATTGAATTTATCGGAAGTGCTATTAAAGACCCTGTTGGTACACTAGCTAAAAAATTTAGTCCTAATGCTGATAAATTGGGCGCTATGTTTACCCCGCTCGGAAATGCGTTGTATAAGAACCCTGTCGGAGAAGCTAAAAATTGGTGGAAAGAACTCTGGTCAATGGCTAATGCTTCAATGGACGAAGGCACTGTAGCGATAGGCGCTAAAGGCGACGACTATCGCTTCAAAGATAAAGCGAAAGACGCTGGAGTAGACCCATGGGGATACTACTATCGTGAGTGTGTATCGTTCATTGCCAGCCGTTTGGCAAATCTTGGTGTTAACCCTAGCTTGTTTAGTCACCTAGGTAATGGTAACCAATGGGTATCTGCTAGAGTGCCACACTTAAGTAGACCAAAACCTGGTGTAGTATCTGTCTACACTGGTGGACCAGTATCAAGCAACCACGTTGACTTTGTAACAGCAGTACACGGTGACACTTACGACGGTGAAGATTATAACTATAATGGTGATGGTAAATATCATCAATTTACTGGTCGTCATGTCAAAAATGCTGCTACATTCCTTGATTTCGGTGTCCGAGATTTTGGAAGTAGTGGTGATAGTGGAAAAGCACTTAAAGATCGCAACAACCCACTTCAAACTTTGATTAAACGTCAAGTTGGTGGTATGTTCGATTGGATTAAGAAAACGCTTGGTCCGTTGCTCAGCCCAGCAGGCGGCGGTGAAGACCATCCGCAAGGGACTGGGGTTGCTCGTTGGCGTGATACGGTAGTTAGAGCGCTTGAAGCCAACGGCATTGAAGCTAACAACTTCCGTGTGTCTAAGATTTTAGCGACTATACAGAAGGAATCTGGTGGTAACCCTAACGCACAAAATAACTGGGATATTAATGCAAGAATGGGCGACCCATCAATTGGATTGATGCAAACTATTGGTCGTACATTTAATGCATACAAGCACCAAGGACACAACAATATCCGTAATGGATATGATAACTTGCTTGCTGCAATCAACTATATCAAGCATCGCTATGGAACATCTGATGCAGCCTTTAACTACGTTGCAACTCATGGTTATGCAAATGGTGGTTTAGTACGCAAGAACGGTGTTTATGAGCTCGCTGAGGGTGATATGCCAGAGTATGTTATTCCAACCGATATTGCCAAACGTGGCAGAGCGTGGCGATTACTTTCCGAAGCGGTGGCACATTTTGCTGGAGATGCACCACAAAACAACCATGATGATTCATCAAGTCAACAACGTGTTTCTATGCTTGAAAGTAAGCTAGACGTCGTGATTGGTTTGCTTAGCCAATTGGTAACTAATGGCTCTAAGCCAATCGAGATTCAAAACATCATTGATGGTAGAAGCGTTTCAAACGGATTAGCACCATTTATGACAAAAGCAACTAACGAATATGAGCGCAGGCAAGCGCTGTTAGGAGGTAGCATTATTTGA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (Hz7N) rather than this protein.
PDB ID
Hz7N
Method AlphaFoldv2
Resolution 64.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50