Protein

Protein accession
A8YQJ9 [UniProt]
Representative
7zt3x
Source
UniProt (cluster: phalp2_427)
Protein name
Putative tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAQQINATMSTKIALDLLSASESVKSLTAVVRSSQNAWKAQEAEMKSAGDAVGAAQAKYDGLGKSIEAQQSKIDALKAKQAELKGNTADVAQQFLKYQQQIDGATKQLASMQAQQDRAKQAMDYQKSGLAGLKQEYTAAARANQAYVTRLEAEGKQQEANKAKMEGYKSSIANLNEQLAKQSAELDKIASASGKDSDAWRTQKTRVDETATSLAKAKSSMTGLQAEMDKANPSVFNRVKEAISGTNKQAEKTPGLLRKIVEGGLITNAITNGWQALKGKIEEATAAGMQYEKQQDQMNAIWLTLTGNAEKGQAMVDMTNKLAVKFGQDTDLVNELNQQFYHVFDNQPKTEALTSAFLTMGDAIGLSSDRIQQVGLDFTHTLSGSIVQLGDFNQLTDAFPMMADAMLEYEKKVQHNSQLTMTDLRAQMSAGKISAQDATNVIEELGQKYSKASENLMQTIPGMTRVIKSRIPALIGDIEKPFLTAQNPVLGAISRWSQDKNTDKEFGKLGQAMSKGLSTITTAFAKAFDLSAGPKAMDKMMDKLSDAVTKVSQSIASHAGQIKTFFTTISTTGSAMAKISWVALTSGLKVLNPLLKSLGDFADKHPKLFGDLAAGFIALNLAGKALPIAAIAGFTKTIGGMYSGLKNLANSKFADMIKTNLSKLNSSALGKGMATITIAYDAISDIKDLTKAFSKGGTVGQKFSAVGESAGTLIGGGIGAFFGGPLGAAVGATIGKTAGKWAGDAAKKFTDGWNAKKKPANSWLGGLGWDARQMTNNVVKWWDGINKSTDAAQKKQHKQQEAANKRARKEWNDFWKGVGKGWNGFLKTVNGWGKSLSTAWSKVWNPISKTMSSIWKNIVKIAKAGLDILKKVIVYPAAFIAGLFILAWRKVEKPFKDVWNGLVKFVKPPLNTISKNISSTTKGVQNAWNKTWGAISKFFSNTWNSIVKIVSSSTNWIVKNVTNFLNAVKKVWGSIWKAISNFFKDIWNDIVKIYNNVSNTLSKGISVTLKFIQNVWNTAWGAISGFFGNIWNGMVKFFTPIIHGMSNTIGSVIKSIKNVWTDVWGGVGSFFSGIWDGIKKAAESGINFVVRVIRTGLSAVNGVLGFFGVKKVGLPSYVHFAQGGEVGKDGTQLAMVNDDGSEHYKELIHKKRTNQWIYAEKRNAILPLETGDRVYNGRESKAIANMYGIPGFAQGGIIGSVWDGVKDASSWVVDKAEDVGKWIGDKFEAIVDWIAHPVKHVTDLISSSIKGIVSSSPVKAFGDLGVGIFKHAYNGIGNWIKKELKKIEDSMANPGGSGVQRWKPYVIQALKANGFDASAYQVAAWMRVIQRESNGNPRAINLWDSNAKAGIPSMGLVQTIGPTFNAFKFPGHNDVYNGYDDLLAGIHYMKAIYGSGSSAFARVSGPEGYANGGLIKRPIHALVGEDGPETILPLTKTSRAWQLLGQAVTNINHNLGNGSVAESESSSTDDLGKKLDNIADLLTKLSFVLQVGDDQFYPKVAPKVKQYNDRTDRFNAYWKGGTV
Physico‐chemical
properties
protein length:1522 AA
molecular weight:164719,2 Da
isoelectric point:9,61
hydropathy:-0,25
Representative Protein Details
Accession
7zt3x
Protein name
7zt3x
Sequence length
587 AA
Molecular weight
64227,23220 Da
Isoelectric point
9,40550
Sequence
KFFSNTWNSIVKIVSSSTNWIVKNVTNFLNAVQKVWGSIWKAISNFFKDIWNDIVKIYNNVSNTLSKGISVTLKFIQNVWNTAWGAISGFFGNIWNGMVKFFTPIIHGMSNTIGSVINTIKHVWKDVWGDVGSFFGGIWNGIKKAAESGINFVVRVIRTGLSAVNGVLGFFGVKKVGLPSYVHFAQGGEVGKDGTQLAMVNDDGSEHYKELIHKKRTNQWIYAEKRNAILPLETGDRVYNGRESKAIANMYGIPGFAQGGIIGSVWDGVKDASSWVVDKAEDVGKWIGDKFEAIVDWIAHPVKHVTDLISSSIKGIVSSSPVKAFGDLGVGIFKHAYNGIGDWIKKELKKIEDSMANPGGSGVQRWKPYVIQALKANGFDASAYQVAAWMRVIQRESNGNPRAINLWDSNAKAGIPSMGLVQTIGPTFNAFKFPGHNDVYNGYDDLLAGIHYMKSIYGSGSSAFARVSGPEGYANGGLITRPIHALVGEDGPETILPLTKTSRAWQLLGQAVTNINHNLGNGSVAESESSSTDDLGKKLDNIADLLTKLSFVLQVGDDQFYPKVAPKVKQYNNRTDRFNAYWKGGAV
Other Proteins in cluster: phalp2_427
Total (incl. this protein): 4 Avg length: 938,8 Avg pI: 9,22

Protein ID Length (AA) pI
7zt3x 587 9,40550
Q7Y4B2 215 8,09486
Q3L0S4 1431 9,77078
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18933
28wdb
2 55,4% 539 2.157E-165
2 phalp2_28873
5i1Sw
11 34,0% 438 3.775E-49
3 phalp2_30529
5tUfk
11 26,6% 577 8.426E-46
4 phalp2_6598
1jOdc
3 33,9% 380 1.271E-43
5 phalp2_35192
6deMH
4 32,0% 430 1.271E-43
6 phalp2_39661
6T6Py
9 30,1% 578 1.164E-38
7 phalp2_27884
8L8Ka
36 29,7% 534 5.191E-36
8 phalp2_36879
7rXBb
1 29,2% 588 5.251E-35
9 phalp2_33299
69UrK
2 27,5% 458 9.401E-34
10 phalp2_32810
2G5Ou
3 29,5% 544 5.286E-33

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage Lc-Nu
[NCBI]
146269 Lacnuvirus > Lacnuvirus LcNu
Host Lactobacillus rhamnosus
[NCBI]
47715 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY131267 [NCBI]
CDS location
range 8501 -> 13069
strand +
CDS
GTGGCACAACAAATTAACGCAACAATGAGCACCAAGATTGCCCTTGATCTATTGTCGGCAAGTGAATCCGTCAAGTCATTAACAGCAGTTGTTCGTTCAAGTCAAAATGCTTGGAAGGCCCAAGAAGCCGAAATGAAATCTGCTGGCGATGCAGTTGGCGCTGCTCAAGCTAAATATGATGGCTTGGGTAAGTCTATTGAAGCACAGCAGTCTAAGATTGACGCTCTCAAAGCCAAACAAGCTGAGTTGAAGGGTAATACTGCCGATGTTGCTCAACAGTTTTTAAAGTATCAGCAACAAATCGATGGTGCCACTAAGCAGCTTGCCAGTATGCAAGCTCAGCAAGACCGTGCTAAACAGGCCATGGACTATCAAAAGTCTGGGTTGGCAGGCTTAAAGCAAGAGTACACAGCAGCTGCACGTGCCAACCAAGCTTATGTGACTCGCTTAGAGGCTGAGGGAAAACAACAAGAAGCCAACAAGGCCAAAATGGAAGGCTATAAGTCATCCATTGCCAATCTGAATGAGCAGCTGGCTAAACAGTCTGCTGAGTTGGACAAGATTGCCAGTGCTAGTGGCAAGGATTCAGACGCATGGCGTACACAGAAGACGCGTGTTGATGAAACGGCTACCAGTTTAGCAAAAGCCAAGTCTTCTATGACCGGTTTGCAAGCTGAAATGGACAAGGCTAATCCGTCCGTTTTCAACAGAGTTAAGGAAGCTATATCGGGAACAAATAAGCAAGCCGAAAAGACACCGGGTCTGCTTCGCAAAATTGTTGAGGGCGGCTTAATCACCAATGCCATTACAAACGGATGGCAAGCGCTAAAAGGAAAAATTGAAGAAGCAACTGCTGCTGGTATGCAATACGAAAAGCAGCAAGATCAAATGAATGCCATTTGGCTAACCTTAACTGGTAATGCTGAAAAAGGGCAAGCAATGGTTGACATGACTAACAAACTTGCCGTCAAGTTTGGTCAAGATACTGATTTGGTAAACGAACTAAACCAGCAGTTTTATCATGTCTTTGATAACCAACCCAAAACCGAAGCTTTAACTTCCGCCTTTTTGACAATGGGCGATGCGATTGGATTATCCAGTGACCGTATTCAACAGGTCGGCCTTGACTTCACACATACACTATCTGGATCTATTGTCCAGCTTGGCGACTTCAATCAGTTGACTGATGCGTTTCCTATGATGGCCGATGCAATGCTGGAATATGAGAAGAAAGTTCAACATAATTCTCAGCTAACTATGACTGATCTTCGTGCACAGATGAGTGCAGGCAAGATTAGTGCTCAAGACGCCACTAACGTTATCGAAGAATTAGGGCAGAAATATAGCAAAGCGTCAGAAAACTTGATGCAAACTATCCCCGGTATGACTCGGGTTATCAAGTCTCGCATCCCAGCGCTAATTGGCGACATAGAAAAGCCTTTTTTAACGGCTCAAAACCCAGTGCTTGGGGCTATCAGTCGATGGTCACAGGACAAAAATACCGATAAAGAGTTTGGCAAATTAGGTCAAGCAATGAGCAAAGGCCTAAGCACAATCACAACTGCGTTTGCAAAAGCGTTTGATCTATCCGCTGGTCCTAAAGCAATGGACAAAATGATGGACAAGCTTTCGGATGCTGTTACAAAAGTTAGTCAATCAATTGCTAGCCACGCTGGTCAAATCAAGACATTCTTCACTACGATTTCAACCACCGGATCAGCTATGGCTAAAATATCATGGGTTGCGCTTACGTCAGGTCTAAAGGTGCTCAACCCATTGTTAAAGTCTTTGGGAGATTTTGCGGATAAGCACCCTAAACTTTTTGGCGATTTAGCAGCCGGATTTATTGCTCTGAATTTGGCCGGTAAAGCGTTGCCGATTGCCGCAATTGCTGGATTCACTAAAACCATCGGAGGTATGTATTCTGGCCTTAAAAATTTAGCCAACTCAAAATTCGCTGACATGATAAAAACGAATTTAAGTAAGCTTAATAGCAGTGCACTTGGCAAAGGCATGGCGACTATCACAATTGCCTATGATGCCATTTCAGATATTAAGGATTTAACAAAGGCCTTTTCTAAAGGTGGCACTGTGGGCCAAAAGTTTTCTGCTGTTGGTGAATCTGCCGGGACACTGATTGGCGGCGGTATCGGTGCTTTCTTTGGTGGCCCATTAGGCGCAGCGGTTGGTGCAACAATTGGCAAAACAGCTGGTAAATGGGCTGGTGATGCCGCCAAGAAGTTTACTGATGGCTGGAATGCTAAGAAAAAGCCAGCTAATAGTTGGCTAGGTGGTCTTGGCTGGGATGCTCGTCAAATGACTAACAATGTAGTCAAATGGTGGGATGGCATTAACAAGTCCACTGATGCAGCTCAAAAAAAGCAGCATAAACAGCAAGAAGCAGCTAATAAGCGGGCACGAAAAGAATGGAATGATTTTTGGAAGGGTGTCGGTAAAGGTTGGAATGGTTTTCTAAAGACCGTTAATGGTTGGGGAAAGTCACTTTCAACAGCGTGGTCTAAAGTTTGGAATCCAATTAGCAAAACAATGTCTTCTATCTGGAAGAATATTGTGAAAATTGCAAAAGCAGGACTAGATATTCTTAAAAAGGTAATTGTATATCCTGCGGCATTTATTGCTGGGCTTTTCATACTTGCATGGAGAAAAGTAGAGAAGCCCTTTAAGGATGTCTGGAATGGCTTAGTAAAGTTTGTTAAGCCCCCATTAAACACAATCAGCAAAAACATTAGCAGCACAACAAAAGGAGTTCAAAACGCATGGAATAAAACTTGGGGAGCTATTTCTAAGTTCTTTTCCAATACGTGGAATTCAATCGTCAAGATTGTATCTTCTTCTACCAATTGGATAGTCAAGAATGTCACGAATTTCTTAAATGCAGTTAAAAAAGTATGGGGTAGTATTTGGAAAGCAATTTCTAATTTCTTCAAAGATATTTGGAATGACATCGTTAAGATTTATAACAATGTATCAAACACCCTTTCAAAGGGCATCAGCGTAACTTTAAAGTTTATTCAAAATGTCTGGAATACAGCATGGGGTGCCATCTCTGGTTTCTTTGGAAACATTTGGAACGGCATGGTTAAATTCTTCACGCCAATCATTCATGGCATGTCTAACACAATTGGCAGTGTCATTAAGAGTATTAAGAATGTTTGGACGGATGTATGGGGTGGCGTTGGTAGCTTCTTCAGCGGCATCTGGGATGGTATTAAAAAGGCAGCGGAAAGCGGCATCAACTTTGTGGTCAGAGTTATTCGTACTGGCTTGTCGGCAGTCAATGGTGTTCTAGGCTTCTTCGGTGTTAAAAAAGTTGGTCTGCCATCATACGTACACTTTGCCCAGGGCGGCGAAGTTGGTAAAGATGGTACGCAATTGGCTATGGTAAACGATGACGGTAGCGAGCATTACAAAGAATTGATCCACAAGAAGCGCACAAATCAGTGGATATATGCTGAAAAGCGCAACGCTATTCTTCCACTTGAGACTGGCGACCGTGTTTATAACGGGCGGGAAAGCAAGGCCATCGCTAACATGTATGGCATTCCCGGCTTTGCACAAGGCGGCATCATCGGCAGTGTGTGGGATGGCGTTAAAGACGCTAGTTCGTGGGTAGTCGATAAGGCTGAAGATGTTGGCAAATGGATCGGTGATAAGTTCGAAGCAATTGTAGATTGGATCGCTCACCCGGTTAAGCATGTAACTGATCTAATCAGTAGCAGTATAAAAGGGATTGTTAGCTCATCTCCAGTAAAAGCATTTGGAGACTTAGGAGTTGGCATTTTCAAACATGCATATAACGGAATTGGCAATTGGATCAAAAAAGAGCTTAAAAAAATAGAAGATTCCATGGCCAATCCCGGTGGCTCAGGCGTGCAACGTTGGAAGCCATATGTTATTCAAGCTTTAAAGGCTAATGGATTTGATGCCTCGGCATACCAAGTTGCTGCATGGATGCGAGTTATCCAGCGTGAATCAAATGGTAATCCTAGGGCAATTAACTTGTGGGATAGCAACGCTAAAGCCGGCATACCTTCAATGGGGCTTGTACAAACCATTGGGCCAACGTTCAATGCGTTTAAGTTCCCCGGCCACAACGATGTTTATAACGGCTATGACGATCTGCTTGCTGGTATTCACTACATGAAGGCTATCTACGGCTCTGGAAGTTCTGCTTTTGCTCGTGTCAGTGGCCCTGAAGGCTACGCTAATGGTGGCTTGATTAAACGGCCAATCCACGCGCTTGTTGGCGAAGATGGCCCAGAAACAATCTTGCCGTTAACTAAAACAAGCCGCGCTTGGCAACTACTAGGTCAGGCTGTTACCAACATCAATCACAACTTGGGTAATGGTTCCGTTGCTGAAAGTGAAAGTAGCAGTACCGATGATTTAGGAAAGAAGTTGGACAATATTGCTGATCTTCTCACGAAACTTAGCTTTGTTCTGCAAGTTGGTGACGACCAGTTTTATCCAAAAGTTGCGCCAAAAGTTAAGCAGTACAACGACAGAACAGACAGGTTCAATGCTTATTGGAAAGGAGGAACCGTTTAA

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00005c9414_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7zt3x) rather than this protein.
PDB ID
7zt3x
Method AlphaFoldv2
Resolution 53.78
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50