Protein

Protein accession
A0A3G8FCA6 [UniProt]
Representative
Hz7N
Source
UniProt (cluster: phalp2_559)
Protein name
Tail length tape-measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAKIQATMSTEIALDTLQAANSIKRLTQLVNSSTNAWKAQESQMRSAGDYLGAAQAKYEGLSNTIQNQQQKIEKLKQEQSQLKGNTVEVAEQYLKYQQQIDQATTRLAALENQQRQAKQSMSYYQSGLADLQRSYRLSNDLSESYVKRLQAEGRESEALQAKLNASKNAVNNLNKQYEAQVKMLKEVAQSGGGDAYVKQKIRINETASAIAKAKREQIELANELRKSNPTFFDKLKAGIHGTNSKIDELGRNVNRSNSILGTFRERLSFGAIAGMASSAIQSISSSVMGLSGEVMATSDAIEKFESTMNFAGKTKKETEEASKYFKTYADKTVYDLQDVANTGAQLASNGIEKYKEITIASGNLNAVAGGNKETFKSLGMVLTQTAGAGKLTTENWNQLADAIPGASGKLQEALSKNGAFVGDFREALENGEISSEEFLTAIEQLGNSKSAEKAAKSTKTFEGAFGSLKSTVVSGMKDMIDAVGKEKITKSITGFGDSVQKIFDYLKDHKKELSSIGKSIFEISKIFGMAVWNTAKGIVVEISDSISAMNGHSKKSKDHLKNIASALKEVSKHKEAIQTIGKIFVGYFASKAVLNTSKSLFGTITDGISNVKKAGSKVNGALNWVMGVRGEDAVNNKLGGIKKIGRGTKSAFKWTASVATKTAKLALTGLLNTAKFVGNGIKLAFNFAKANPLILIATAVIGISTALYELYKHNKEFKKFVDGIFSAAKKAFDKIFKVTKEIFGKVINFFKKDWKQVLLFIANPIAGAFALIYKHNKKFKKFVDGTVDHVKDMAKGIAKHMSNLKKDWSDKWDNVKKFASKTWEGVKDNATEAMTALGKGIDKHHKGINNNWFDGWENSKKFLSKKWDEIGALTQDKFGVKITKLITDALTNIAKFFKDTWDNVKNGFGEMWDGMKRLAGNGINAVIALPNAGIDGINKLISDFGGSKNAISKIPQVKFAGGTGLFSSYRNPITRPTLATLNDGNDSPETNNQEMVILPNGKSFLPQGRNVDYLLPAGSEVINASELAMLMGVERGAYAKGTGFWSRIWDTTTNVAGSVWDTMKAGVDKFMKMIEFVGDAVKDPVGTLTKKFSPKADKLAGVFNPLGNALFKKPVDEAKEWWKELWSMASSSMDEGTVAMGAKGDDYRFKDKAKDAGADPWGYYFRECVSFVASRLANLGVNPSLFSHLGNGNQWVSARVPHLSRPKPGTVAVYTGGPVSSNHADFVTAVHGDTYDGEEYNYGGTGQYHQYTGRHIANAATFLDFGVRDSGSGEDGKPLKDKNKPRQTLIKRQVGGMFDWIKKTLGPLLNPSGGGEDHPQGNGVARWRDTVVRALEANGIEPNNFRVSKILATIQRESNGDPNAQNNWDINALRGDPSVGLMQTIGRTFNAYKHPGHNNIRNGYDNLLAAINYIKHRYGTSDAAFNRVAAYGYANGGLVHKNGVYELAEGDMPEYVIPTDIAKRGRAWRLLSEAVARFAGDAPQNNHDDSSSQQRVSMLESKLDVVIDLLSQLVTNGSKPIEIQNIIDGRSVSNGLAPFITKATNEYERRQALLGGQII
Physico‐chemical
properties
protein length:1559 AA
molecular weight:170137,9 Da
isoelectric point:9,46
hydropathy:-0,43
Representative Protein Details
Accession
Hz7N
Protein name
Hz7N
Sequence length
155 AA
Molecular weight
16865,70410 Da
Isoelectric point
5,58303
Sequence
DNLLAAINYIKHRYGTSDAAFNRVAAYGYANGGLVHKSGVYELAEGDMPEYVIPTDIAKRGRAWQLLTEAVARFAGDAPQGNHDNTSDRERVSVLEDKLDVMIGLLSQLVTNGSNPIEIRNVIDGRSVSNGLAPFMTKATNDYERRQALLGGSII
Other Proteins in cluster: phalp2_559
Total (incl. this protein): 14 Avg length: 977,8 Avg pI: 8,28

Protein ID Length (AA) pI
Hz7N 155 5,58303
1odid 164 8,11478
3pWod 193 6,57385
6ZUa3 129 5,32271
6u4iS 211 9,21706
p6bw 128 5,09797
A0A2P0VJB7 1593 9,47410
A0A2I6QQQ0 1593 9,50440
A0A3G8F848 1593 9,50440
A0A3S5H0N0 1592 9,52632
A0A3S7W7X2 1593 9,54198
A0AAD0LNJ8 1593 9,49576
A0AAD0LP08 1593 9,49576
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4084
1oo4E
2 32,5% 120 3.412E-19
2 phalp2_3961
7qIYz
4 30,4% 125 3.617E-14
3 phalp2_7800
7tCXe
1 30,1% 116 1.029E-09
4 phalp2_24567
4VE6H
4 36,0% 97 3.071E-08

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage CHPC1041
[NCBI]
2365015 Aliceevansviridae > Moineauvirus > Moineauvirus CHPC1041
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH937493 [NCBI]
CDS location
range 8531 -> 13210
strand +
CDS
ATGGCGAAAATACAAGCTACGATGTCTACTGAAATAGCCTTAGACACGCTTCAGGCTGCTAACTCGATTAAACGATTAACTCAGTTAGTCAATAGTTCTACTAACGCTTGGAAGGCTCAAGAGAGTCAAATGCGTAGCGCTGGTGACTATTTAGGTGCAGCTCAAGCAAAATACGAAGGCTTGAGTAACACCATCCAGAACCAACAACAAAAAATTGAGAAGCTGAAACAAGAGCAGTCTCAACTTAAAGGGAATACTGTTGAAGTCGCTGAACAGTACCTCAAATACCAACAACAGATTGACCAAGCTACTACACGCTTAGCTGCGTTGGAAAATCAACAGCGTCAAGCTAAACAATCGATGTCCTATTATCAATCTGGACTCGCCGATTTACAAAGAAGCTATCGCTTAAGCAATGACTTGTCAGAAAGTTATGTTAAGAGATTACAAGCCGAAGGTCGTGAATCAGAAGCTTTACAAGCTAAGTTAAACGCTTCGAAAAATGCTGTTAATAACTTAAATAAGCAGTATGAAGCGCAAGTAAAAATGCTTAAAGAGGTTGCTCAATCTGGCGGAGGTGATGCTTATGTAAAGCAGAAAATACGTATTAACGAGACAGCAAGTGCGATAGCCAAAGCTAAGCGTGAACAAATTGAGTTAGCTAATGAGTTAAGGAAATCAAACCCTACTTTTTTTGATAAGTTAAAAGCAGGCATCCATGGTACTAATTCTAAAATCGATGAATTAGGAAGAAATGTAAATCGTTCGAATTCTATTTTAGGTACGTTTAGAGAGAGATTATCTTTTGGCGCAATAGCAGGGATGGCATCTAGTGCCATCCAATCCATTTCAAGTTCTGTCATGGGGTTGTCAGGAGAGGTTATGGCAACGTCCGATGCGATCGAAAAGTTTGAAAGTACAATGAACTTTGCTGGCAAAACTAAGAAAGAAACGGAAGAAGCTAGTAAATACTTTAAAACATACGCTGATAAAACAGTTTATGATTTGCAAGATGTCGCAAATACTGGTGCTCAATTAGCCTCAAACGGCATCGAAAAATATAAAGAGATTACTATTGCAAGTGGTAATTTAAATGCAGTAGCGGGTGGGAACAAAGAAACATTTAAGTCTCTTGGCATGGTGCTCACTCAAACAGCTGGGGCAGGGAAACTGACAACAGAAAACTGGAATCAGTTAGCTGATGCAATTCCCGGTGCATCTGGTAAGTTACAAGAAGCTCTTTCTAAAAACGGTGCATTCGTTGGCGACTTCCGTGAAGCTCTGGAAAACGGAGAAATCAGTTCTGAAGAGTTCCTGACAGCTATTGAGCAACTCGGAAACAGCAAGAGCGCTGAGAAGGCTGCTAAATCAACAAAAACATTTGAAGGAGCTTTTGGTAGTTTAAAATCAACAGTAGTCAGTGGCATGAAGGATATGATTGACGCTGTTGGAAAAGAAAAAATCACTAAAAGCATAACTGGTTTTGGTGATTCTGTACAAAAAATTTTTGACTATTTAAAAGATCACAAAAAAGAACTTTCATCTATCGGGAAAAGCATTTTTGAAATATCTAAGATTTTCGGTATGGCTGTTTGGAACACTGCAAAAGGAATTGTTGTAGAGATTTCAGATAGCATTAGCGCTATGAATGGACATTCCAAAAAATCAAAAGACCATCTTAAAAATATTGCTAGCGCTTTAAAAGAAGTTTCAAAGCACAAAGAGGCTATCCAGACCATCGGTAAAATATTCGTTGGTTACTTTGCATCAAAAGCAGTTTTAAATACCTCAAAATCACTTTTTGGAACGATAACAGATGGTATTTCAAACGTCAAAAAAGCTGGTAGTAAGGTCAATGGCGCTTTAAATTGGGTTATGGGCGTTCGTGGAGAAGACGCAGTAAATAATAAACTTGGTGGCATTAAGAAGATTGGTAGAGGAACTAAATCAGCTTTTAAATGGACTGCTTCTGTAGCAACTAAAACTGCTAAATTAGCTTTAACGGGATTGCTAAACACTGCTAAATTTGTAGGTAACGGTATTAAACTTGCATTTAATTTTGCTAAAGCAAATCCACTGATTTTAATTGCTACAGCTGTAATCGGTATATCTACTGCTCTCTACGAACTTTATAAACACAACAAAGAATTCAAAAAATTTGTTGACGGAATATTTTCTGCTGCTAAAAAAGCCTTTGATAAAATCTTCAAAGTAACCAAAGAAATCTTTGGCAAGGTTATTAATTTCTTTAAAAAGGACTGGAAACAAGTCCTTTTATTTATTGCCAATCCGATTGCTGGAGCTTTCGCTTTAATTTACAAGCATAACAAGAAATTTAAGAAATTCGTTGATGGCACTGTGGACCATGTCAAAGATATGGCCAAAGGTATTGCAAAACACATGAGTAACCTTAAGAAAGATTGGTCTGATAAGTGGGACAACGTTAAGAAATTCGCATCTAAAACATGGGAAGGTGTCAAGGATAACGCTACTGAAGCCATGACTGCTCTTGGTAAGGGTATCGACAAGCACCACAAAGGTATCAATAATAATTGGTTTGATGGATGGGAAAACTCCAAGAAATTTCTATCGAAAAAATGGGATGAAATCGGAGCGTTAACGCAAGATAAATTCGGTGTTAAAATTACCAAACTAATCACCGACGCTTTAACCAACATCGCTAAATTCTTCAAGGATACATGGGATAATGTCAAAAATGGTTTTGGCGAGATGTGGGATGGCATGAAACGCCTTGCTGGCAATGGTATCAATGCCGTGATTGCTCTCCCAAATGCCGGTATAGACGGCATTAACAAACTGATTTCTGATTTTGGTGGTAGCAAGAACGCTATCTCTAAAATCCCACAAGTTAAATTTGCAGGTGGTACAGGTCTATTCAGCTCATACCGAAACCCAATCACTAGACCAACACTTGCTACACTAAACGATGGGAATGATAGCCCAGAAACCAACAACCAAGAAATGGTAATTCTTCCTAATGGTAAGTCATTCTTGCCACAAGGTCGAAACGTTGATTATCTCTTGCCAGCTGGTTCAGAGGTAATTAATGCTAGCGAGTTAGCTATGCTTATGGGAGTAGAGCGTGGTGCATACGCTAAGGGTACAGGTTTCTGGTCAAGAATCTGGGATACAACTACAAACGTAGCGGGATCAGTTTGGGATACCATGAAGGCAGGCGTTGACAAATTCATGAAAATGATTGAATTTGTCGGTGACGCCGTAAAAGACCCAGTTGGAACTTTAACCAAAAAATTTAGCCCTAAAGCTGACAAGTTGGCGGGCGTTTTCAATCCGCTCGGTAATGCACTTTTCAAAAAACCTGTTGACGAAGCAAAAGAATGGTGGAAAGAACTCTGGTCTATGGCTAGCTCCTCAATGGATGAGGGGACTGTGGCAATGGGTGCTAAAGGCGACGACTATCGCTTCAAAGATAAAGCGAAAGACGCTGGAGCAGACCCATGGGGGTACTACTTCCGTGAGTGTGTATCATTCGTTGCCAGCCGTTTGGCAAACCTTGGTGTTAATCCTAGCTTGTTTAGCCACCTTGGTAACGGTAATCAATGGGTATCTGCTAGAGTGCCACACTTAAGTAGACCTAAACCAGGAACGGTAGCAGTCTACACTGGTGGACCAGTATCAAGCAACCACGCTGACTTTGTAACAGCCGTTCATGGTGATACCTACGACGGTGAAGAATATAACTACGGTGGTACCGGTCAGTATCACCAATACACTGGCCGTCACATTGCTAACGCTGCTACATTCCTAGACTTTGGAGTGCGAGACAGTGGAAGCGGTGAGGATGGAAAACCGCTTAAGGATAAAAACAAACCACGACAGACTTTAATCAAGCGTCAAGTTGGCGGTATGTTTGACTGGATTAAGAAAACGCTTGGTCCGTTGCTCAACCCATCGGGCGGCGGTGAAGACCATCCACAAGGAAATGGGGTTGCTCGTTGGCGTGATACGGTAGTTAGAGCGCTTGAAGCTAACGGTATAGAACCAAACAACTTCCGTGTTTCTAAGATTTTAGCTACCATCCAGAGAGAATCTAATGGTGACCCTAACGCTCAAAACAACTGGGATATTAACGCTCTAAGAGGTGACCCGTCAGTCGGTTTGATGCAAACAATTGGTCGAACCTTCAACGCATACAAGCACCCTGGTCACAATAATATCCGCAATGGATATGATAACTTGCTTGCTGCAATTAACTACATCAAACATCGCTATGGAACGTCAGACGCAGCCTTTAACCGTGTGGCTGCTTATGGTTACGCAAACGGTGGCCTAGTCCACAAGAACGGCGTTTATGAGCTGGCTGAGGGTGATATGCCAGAGTATGTTATTCCAACCGATATTGCAAAACGTGGCAGAGCGTGGCGATTACTTTCCGAAGCAGTGGCTCGTTTTGCTGGGGATGCCCCACAAAACAACCATGATGATTCATCAAGTCAACAACGTGTTTCTATGCTTGAAAGTAAGCTAGACGTCGTGATTGATTTGCTTAGCCAATTGGTAACTAATGGCTCTAAGCCAATTGAGATCCAAAATATCATTGATGGAAGAAGCGTATCAAACGGTCTAGCACCATTTATTACAAAAGCCACAAACGAATATGAGCGCAGGCAAGCGCTGTTAGGAGGTCAAATTATTTGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (Hz7N) rather than this protein.
PDB ID
Hz7N
Method AlphaFoldv2
Resolution 64.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50