Protein

Protein accession
H9C1A7 [UniProt]
Representative
7hU0P
Source
UniProt (cluster: phalp2_385)
Protein name
Putative tail-fiber/lysozyme protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAENSGTIDSLLVSLGLETDAKSFQKGADAIKGVTDGMMQLAAAAGVGLGFKALTQGVASSYSELKRLSDITGFTIQQIRGLEFAMRRIGSANPVASGQKLAQLIPDIVRRFGQGQLNQDAYYSDKFKPSELVKISTTQGNDAATEYFLNAYGSMNATERAYMRSGVGISENDDIARLGEYGGKFFRESMNMSNAMTPEIDPKLAKVAQDFNDEMAKLARNFENLAYSMGSQLLPIVNKFLELVNGFIHENPEIASAMIGAAGVAGTVGALGFGKRILGLGGGGGSAAGGSGGMSWLSRLLVNPITAGAVSAFTPGNFFTSSEDARMMESPDAFLRKKWERENPGVPYNGQYGGVTPYQAYLKQKAGNIGDALNNPNARSYLDAISRAEGTSGYMNSGYHTMFGGGQIASLADHPRQLKDFQQTDGTWNKTSAAGRYQFTQKSWDEAAAALGLNDFSPQSQDMAALWLIQRAGQLDNVLNGDFMTATNNLGGVWASLPSSPYAQPKRSQAEMESYYTPDYVQRRSQAPYNSSVNRSESSKPVSITLNNTQNISGLGLNEQQVQDTVASALTTAGENLERSFNNNRW
Physico‐chemical
properties
protein length:586 AA
molecular weight:62949,5 Da
isoelectric point:5,56
hydropathy:-0,38
Representative Protein Details
Accession
7hU0P
Protein name
7hU0P
Sequence length
238 AA
Molecular weight
25957,83970 Da
Isoelectric point
4,73471
Sequence
MSNPDAIGRQNWAKNNPGVPYPSDSSDLNNLVDDPNVRQYLEVLSKAEGTASYANSGYNTMFGGDQFYDSSDHPRQLKDFTQTDGTKNKTSAAGRYQFTSSSWDDAAKALNLTDFSPRSQDLAALFLIQRAGQLENVTNGNFADATNGLGGVWASLPSSNYAQPKCSWEEIQGYSDRQTTPMQSVAASATRGDVRLEQHNIINVGTVGGDSESIRDGVLQATTQLAQQARDMMHTEHY
Other Proteins in cluster: phalp2_385
Total (incl. this protein): 4 Avg length: 325,8 Avg pI: 7,05

Protein ID Length (AA) pI
7hU0P 238 4,73471
1gOki 263 9,07929
83cVf 216 8,81355
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24710
6nKSK
5 59,4% 143 1.197E-56
2 phalp2_29455
1worz
7 46,2% 188 1.306E-48
3 phalp2_15310
1iZB6
260 42,9% 149 6.150E-45
4 phalp2_10639
2wVtc
30 35,7% 179 7.369E-26
5 phalp2_16969
8cfPJ
14 35,5% 194 1.004E-25
6 phalp2_27887
fSOH
15 37,8% 164 2.603E-23
7 phalp2_11746
416Qp
39 37,5% 152 5.660E-22
8 phalp2_27370
4DRTE
6 32,7% 174 1.392E-17
9 phalp2_8965
3zk7o
3 30,8% 178 6.822E-14
10 phalp2_39909
1jiVQ
7 27,5% 189 4.053E-10

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Pectobacterium phage ZF40 (Bacteriophage ZF40)
[NCBI]
1127516 No lineage information
Host Pectobacterium carotovorum
[NCBI]
554 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Pectobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ177065 [NCBI]
CDS location
range 36100 -> 37860
strand +
CDS
ATGGCTGAAAATTCAGGAACAATCGATTCCCTGCTGGTTTCGCTGGGGCTGGAGACTGACGCTAAATCATTCCAGAAAGGCGCTGATGCTATCAAAGGCGTCACCGATGGGATGATGCAGCTTGCCGCCGCTGCTGGTGTAGGTTTGGGTTTTAAGGCGTTAACTCAAGGTGTCGCATCGTCATATAGCGAGCTAAAGCGTCTCTCTGATATTACAGGGTTTACGATCCAGCAGATTCGAGGGCTTGAGTTTGCCATGCGCCGCATTGGTTCAGCCAATCCAGTAGCATCGGGGCAGAAATTGGCGCAACTCATCCCCGATATTGTTCGCCGGTTCGGCCAGGGGCAACTAAATCAAGATGCGTATTATAGTGATAAATTTAAGCCATCAGAATTAGTCAAAATTTCAACAACTCAGGGTAATGATGCAGCTACCGAATATTTCCTGAATGCCTATGGGTCGATGAATGCGACAGAAAGAGCCTACATGCGATCGGGTGTTGGTATTTCAGAAAACGATGACATAGCCCGACTAGGTGAATATGGCGGTAAGTTCTTTCGCGAAAGTATGAATATGTCAAATGCCATGACTCCAGAAATAGATCCTAAGTTGGCTAAAGTTGCTCAGGATTTTAATGATGAAATGGCTAAGCTGGCGCGTAATTTTGAAAACCTAGCTTATTCAATGGGATCGCAATTACTCCCTATCGTTAATAAATTTTTAGAACTGGTAAACGGGTTTATTCATGAAAACCCTGAAATAGCCAGCGCAATGATCGGCGCTGCTGGTGTTGCAGGAACGGTTGGCGCGTTAGGATTTGGTAAAAGAATTTTGGGGTTAGGTGGCGGTGGTGGCAGCGCAGCCGGAGGATCAGGGGGGATGTCATGGTTAAGTCGATTGCTCGTTAACCCTATAACAGCGGGGGCTGTTTCGGCATTCACACCTGGTAACTTTTTCACGTCGTCAGAAGATGCTCGGATGATGGAAAGCCCCGATGCATTTCTGCGTAAAAAATGGGAGCGTGAAAATCCCGGCGTGCCATATAACGGGCAGTACGGCGGCGTGACACCTTATCAGGCATACCTGAAGCAAAAGGCCGGCAATATTGGCGATGCGCTAAATAACCCAAACGCCCGATCATATCTTGATGCGATATCCCGCGCAGAGGGTACAAGCGGGTATATGAATTCTGGTTATCACACCATGTTTGGCGGCGGGCAGATTGCCAGCCTGGCCGATCACCCGCGCCAGTTAAAAGACTTCCAGCAGACAGACGGGACGTGGAATAAAACATCGGCGGCAGGTCGTTATCAGTTCACCCAAAAATCATGGGATGAGGCGGCTGCTGCGCTGGGGCTAAACGACTTCTCTCCGCAGAGCCAAGACATGGCCGCATTATGGCTTATTCAGCGGGCGGGGCAGTTGGATAACGTATTGAACGGCGATTTCATGACGGCAACGAATAATCTCGGCGGCGTGTGGGCGTCACTGCCATCTTCGCCTTATGCGCAGCCGAAACGCAGCCAGGCTGAAATGGAGAGTTACTACACGCCTGACTACGTTCAACGCCGCAGTCAGGCTCCGTATAACTCATCTGTCAATCGTTCCGAATCATCAAAACCTGTCAGCATTACTCTAAATAACACTCAAAACATTTCCGGATTAGGTCTTAATGAGCAGCAAGTACAGGACACGGTGGCGAGCGCATTAACTACGGCCGGAGAAAACTTAGAGCGCTCATTTAACAATAATCGGTGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0098015 virus tail cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002536b63_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7hU0P) rather than this protein.
PDB ID
7hU0P
Method AlphaFoldv2
Resolution 75.49
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50