Protein
- Protein accession
- G9FHZ7 [UniProt]
- Representative
- 7AFQR
- Source
- UniProt (cluster: phalp2_3994)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MPSGANQGNAYVDVFPNMSGYFNRVRNAINSTPITHQIEVEASQASLTRLRDQIRDLLAGPPAFQISPELDSSRLDQQWRDWMARHRNESFDVQANVQVDNRALAALKKTFSGLGSAIGTVGKAGGLLGAAGLAIGAIGPAALAAGPALLGMASAMAVVKLGADGIKEAFSAITPQVDALKAQVGGVFKTELAPAVSQFGTALTNLTPQFKAVASSMAGSMSQVMNLFSSAGGQEAFTGIFDDMAKAISNATPGLVSFLETLRDLAASAGLDQLGTAFGSLFQSLSDAFAKIDAGQIVGIFSQILTAVGPLLGSLIEIFNTLGVTLGPIIAQLLTGLAPVISSLAGPLATAGKALGDGLLKIFEALAPVIPVVVGALADLFAMLAPIIAQLVSALAPVLAAIAPVLVQIGQVLADAWISAFEQIQPMLPELTTAFVQLVQALIPLLPPILELATTLLPRLLPLIIQLLTIFTDLVNILVPILVPILEKLTEVAGWLADMFAGKLSEALDKVHGLVTKLGEIVSAAPGWFNTAKEAVSQFATSAVEWIGNFLNTIQEIPGKIGNAFAGAGTWLKNAGRAIIDGLKAGISEAWSSLTSWFKDKLNSLVPGGGIVARILPGLANGGPVPARADGGGLVWGGRAAGQLSAPGGPRADRGVFLGSNDEFVVNARAMRGPWGGVVEAINSNRLPRFADGGSIGGGKVDTKKADWFKFLESIGYKPPLDEVTGARIGEQVTQGITSYKDAVSNIKGWADALINGGEIPGMEDAARSVLGLGNPDGGASTPPDYGTSTMGGSDNSGLTGGLSDLEGPGSIGSDIGAAVQSALSFAQGESGKAYQYGGVGNPSWDCSGFMSGIYATIRGLDPYTRWFTTEADFTSSSLGFVRGLGGSDGFSIGVHNGGGGQYSHMAGTLAGNSVESGANGVLVGGKALNASSDQFENRYYLPIVREALSGNVAAAKELANIEGAGAARWRDMAIAAMKRQGFDADDPAQVDAMVRQIESESGGDPLIVQQVQDVNSGGNEGVGLLQIIPNTYAAYRDPELVDDRRNGWSNMNAALRYYKDRYGLDLTQYWGKGTGYDNGGTINGMGLFGKWTTKPEEVLSPSMTQDFHDMLPFMSTVADSLRGRAAIAGYDDAAGGSGTTVNMRFGDVRTNSWEDAQTSINRDARRGMRSVLGGTAGGR
- Physico‐chemical
properties -
protein length: 1180 AA molecular weight: 122371,5 Da isoelectric point: 4,92 hydropathy: 0,12
Representative Protein Details
- Accession
- 7AFQR
- Protein name
- 7AFQR
- Sequence length
- 1863 AA
- Molecular weight
- 197527,62450 Da
- Isoelectric point
- 9,62392
- Sequence
-
MAEDTVYIPLAPSAKGFMATVVKEASGAARAGSAAMEKEFARGGRESGRSAARGVDDGLSRGNIGVNSVKARMAKLAAQMRGIGRTAGHQGALGINDGLNHIDYTRVGDEHGRSYGRGFVRGVRNGLVGIAATFGLVNAGVRGTVRHIGTIATATMWASRIMRGFATQVMAGAVAMQLLAGQGLAKLAGWLKTVAFLAGRLARDVARATAAVLVLSAAVRTLGRVMRVTRVIGMLTVGLAALIGLASTAAPALAALSAAIVTLGSAAGGIAIAGLSALGATIAGLKVGLMGVGDAFKQMGTSGAGSAAKVVDNTKDIARAERGLTKAVEAEKDAQEDVSKARDDARKKLRDLDLQLRGAALSERDAQLSLREARADLAKGGFETGTERERAVLAVQEAELRLAEVQRDNNDLAKDAASTRRKGVEGSDEVVAAQERLRDATEATRDAQEALADARQPKDTGASAAADKQAEAMAKLSTNARSFVESAMGVKPAWDAIQRGGQDTLFAGLAQRLPQLADTWLPRLGAAINTVNGGFNTGARSVVDWMNSAQGIPIVSSWLRTSSGMAAQAGTALGALAPGLASIAAGAGEAFAPMVAGATEGAKSLSNMLVQAQQSGRIKQYFTDAFNQVKTVIQNVTAVVGPLWAAFMRLGQISASGLAPGMRSVGAAITQATPGLVQMAERLMPALGQALTNLAPIIPGIVQAFSPWATILAVMAPHIATVMSHLGPMAPLLLTLAVTVKAITMAMTLYNAVMAVASVAQGVFFAATGRSTAGLQGNMIALAAHRVAMLAGAVASGIFAGALALATSPITWIIVAIGALVAGLVWFFTKTELGQKIWTTVWNSIKSAVQSVWEFLKPVFQWIGNAFGTVVGFIRDHWRLILPIIMGPLGLLISVVSKYWTQIKTAFSVAFQAIGAVVMWLWRNVVTPAFNGIKMVIGVAWNVIKFFFGLWVGLFRNVIGPVVMWLWNTVIGPAMRGIGGVIGWVWNTLIKPAWDSFRRSLDILGEAFKFLWNNVIKPTWDALGAGIRWVVDNIITPAWDALKSGLSAVGGFFDTIVTGIGNAWDKIKSFVAKPINFVLGTVWNKGLLPAWNTIAGFLPGLNPMKPVAEVAFKDGGPVPMGSGAKRGKDSVHALMMPDEHVWDVRDVRRAGGHGAMYRMRNMVDSGRPFTWTPGGLSPVSEGGPLPRFEKGGAVAAGQKLSPMPGEGGLQAIGQLMRRIIFKLWPKIKDIGGYRQDNFDEHPSGRALDVMVGSDKKLGDQVNAFAHANNPKFPLQHSIWQQAMWYPPKMRREPMGDRGSPTQNHMDHPHLWWKPQNVNPNVVPEGLVTDGFGGPSTAEMLNIVKKKISEIIDKALNPIKQGLTSIVGSPPPEWLGIPPKIFDITKTKAIETAFNLAAKLGDKLKGAYDAAKKVTSIVTNVVKQPFKAIGGLFRDQGGYLPKGLSLVRNETGKPEAVLNWDQLTTVKDMMEAFRAVFSGQSPEAASAAQQRISDEMTARHEQEIKGLKGRQLDEAQKRHDMERKALEDSTARIEGYRAGATAIRDTPLVAAESMAKDTADFFGFGKIFDTIAGLIPRPGDAASAGTAGGAGTSALSTTTTPSATDPVYGDGTTIEQGQTPSTTVMPDLNHEYDPKGGAEQWRPMAKEAMKRVGFDYNNTAQVDAMIKQIESESGGNPGIVQGVQDVNSGGNEAVGLLQIIPGTFATHRDPSLPDDRRNPMANMVASLRYYKSRYGMDLTTTWGHGHGYDSGGWLNPGLTMAVNKTLKPEAVLTAGQWASIDSMLESLPSAAEFKSVADLGAAAMRSSGRMPNEDEDAQSSSGHRDAPLVWVENQYTHDPDEAALKTGREVRRATRSEQLVGGWG
Other Proteins in cluster: phalp2_3994
| Total (incl. this protein): 49 | Avg length: 1786,6 | Avg pI: 8,58 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7AFQR | 1863 | 9,62392 |
| 1Yf8R | 1914 | 6,00728 |
| 1dMtS | 1816 | 9,23988 |
| 4YHgR | 1815 | 8,29968 |
| 6FC4Z | 1687 | 9,35792 |
| 6Qpjo | 1390 | 9,26786 |
| 72ukV | 1691 | 9,43226 |
| 7Aebk | 1877 | 9,65545 |
| 7Am7j | 1875 | 9,58840 |
| 7AuK7 | 1883 | 9,75557 |
| 7dhSx | 1830 | 9,73597 |
| 7gRaR | 1915 | 6,15665 |
| 7jGYT | 1704 | 9,84853 |
| 7jHbo | 1847 | 9,69580 |
| 7mBce | 1937 | 6,59323 |
| 7pb0M | 1705 | 5,40109 |
| 7qGY7 | 1739 | 9,40357 |
| 7tGoh | 1680 | 9,50111 |
| 7tGpR | 1607 | 9,50588 |
| 7vaZ0 | 1717 | 5,68313 |
| 7vnjh | 1749 | 5,59559 |
| 7w0kW | 1919 | 6,11891 |
| 7w0kn | 1911 | 6,23679 |
| 7xjmf | 1793 | 5,94765 |
| 7yeUa | 1956 | 5,81545 |
| 7zSjR | 1875 | 9,59091 |
| 8Itva | 1704 | 9,69432 |
| 8Itvf | 1847 | 9,68961 |
| 8Itvi | 1847 | 9,66589 |
| 8Itvp | 1830 | 9,60490 |
| 8MGSJ | 1840 | 9,75918 |
| 8MHYs | 1872 | 9,68562 |
| 8MI2F | 1873 | 9,71108 |
| 8MIY4 | 1839 | 9,69264 |
| 8MJnK | 1966 | 6,26777 |
| 8MQ28 | 1829 | 9,74809 |
| 8MmBC | 1840 | 9,68375 |
| 8MpF3 | 1847 | 9,68420 |
| A0A7T0M0T0 | 1863 | 9,62392 |
| A0A2P1N2Q9 | 1830 | 9,60490 |
| A0A160DCU0 | 1829 | 9,74809 |
| A0A1B3AZ86 | 1863 | 9,62934 |
| A0A345L307 | 1824 | 9,75434 |
| A0A4Y6EFQ2 | 1829 | 9,61599 |
| A0A514TZV4 | 1830 | 9,62366 |
| A0A649V4K6 | 1828 | 9,60503 |
| A0A8T8IZ31 | 1158 | 5,02459 |
| A0AAE8XA19 | 1678 | 9,38758 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_39181
3bZcI
|
8 | 35,2% | 1237 | 6.551E-233 |
| 2 |
phalp2_18324
5inLv
|
5 | 27,2% | 1706 | 5.412E-194 |
| 3 |
phalp2_3995
7AGUw
|
47 | 32,8% | 1284 | 1.589E-186 |
| 4 |
phalp2_33467
7m6oK
|
34 | 25,4% | 1713 | 6.404E-106 |
| 5 |
phalp2_12266
72uge
|
6 | 23,5% | 1906 | 1.656E-103 |
| 6 |
phalp2_24330
3Pu9x
|
29 | 26,3% | 1216 | 1.943E-92 |
| 7 |
phalp2_21847
4rAYx
|
3 | 25,1% | 1298 | 4.898E-90 |
| 8 |
phalp2_36905
7yda0
|
31 | 23,9% | 1343 | 5.318E-82 |
| 9 |
phalp2_36243
7zfxz
|
7 | 24,8% | 1747 | 4.395E-80 |
| 10 |
phalp2_9474
HzvZ
|
51 | 24,2% | 1581 | 7.282E-71 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Nocardia phage NBR1 [NCBI] |
1109711 | No lineage information |
| Host |
Nocardia brasiliensis [NCBI] |
37326 | Actinobacteria > Actinobacteria > Corynebacteriales > Nocardiaceae > Nocardia > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
JN116828
[NCBI]
CDS location
range 13557 -> 17099
strand +
strand +
CDS
GTGCCAAGCGGAGCCAACCAGGGCAATGCCTACGTTGACGTATTCCCGAACATGTCGGGCTACTTCAACCGCGTGCGGAACGCGATCAACTCCACCCCGATCACGCACCAGATCGAAGTGGAGGCCAGCCAGGCGTCCCTGACCCGGCTCCGTGACCAGATCCGCGACCTACTGGCCGGGCCGCCCGCCTTCCAGATCAGCCCGGAGCTGGACAGCTCACGCCTGGATCAGCAGTGGCGCGACTGGATGGCCCGGCACCGGAACGAATCGTTCGACGTTCAGGCCAACGTCCAGGTGGACAACCGCGCGCTGGCCGCCCTGAAGAAGACCTTCAGCGGGCTGGGGTCGGCCATCGGCACCGTGGGCAAGGCGGGCGGTCTGCTGGGCGCGGCCGGGCTGGCCATCGGCGCGATCGGCCCCGCCGCGCTGGCGGCCGGTCCCGCGCTGCTGGGCATGGCGTCGGCCATGGCCGTGGTGAAGCTGGGGGCCGACGGCATCAAGGAAGCCTTTTCCGCGATCACCCCCCAGGTGGACGCGCTGAAGGCCCAGGTGGGCGGCGTCTTCAAGACGGAGTTGGCCCCGGCCGTCTCCCAGTTCGGCACCGCGCTGACCAACCTCACGCCCCAGTTCAAGGCCGTGGCCAGCTCCATGGCCGGGTCCATGTCCCAGGTCATGAACCTGTTCAGCTCCGCCGGTGGACAGGAAGCCTTCACGGGCATTTTCGATGACATGGCCAAGGCCATCAGCAACGCCACGCCGGGGCTGGTGTCGTTCCTCGAGACGCTTCGGGACCTGGCAGCGTCGGCCGGGCTGGACCAGCTGGGTACGGCCTTCGGGTCGCTGTTCCAGTCGCTATCCGACGCCTTCGCCAAGATCGACGCCGGGCAAATCGTGGGGATCTTCTCCCAGATCCTGACCGCCGTCGGCCCGTTGCTCGGTTCGCTCATCGAGATTTTCAATACCCTGGGCGTCACCCTGGGGCCGATCATTGCCCAGCTGCTGACCGGGCTGGCCCCGGTGATTTCCTCCCTGGCCGGACCGTTGGCCACGGCGGGCAAGGCGCTGGGTGACGGTTTGCTGAAGATTTTCGAGGCCCTAGCCCCGGTGATCCCCGTCGTGGTCGGCGCGCTGGCCGACCTGTTCGCCATGCTGGCCCCCATCATCGCGCAGCTGGTCAGCGCGCTAGCCCCCGTGCTGGCCGCCATCGCGCCGGTGCTGGTCCAGATTGGCCAGGTATTAGCTGACGCCTGGATTTCTGCCTTCGAGCAGATCCAGCCCATGCTTCCCGAACTGACCACGGCCTTCGTCCAGCTGGTCCAGGCCCTGATCCCGCTGCTTCCGCCCATCCTCGAGCTGGCCACGACGCTGCTTCCCCGGCTGCTCCCGCTCATCATCCAGTTGCTGACCATCTTCACCGACCTGGTGAACATCCTGGTGCCGATCCTGGTCCCCATCCTGGAGAAGCTGACGGAGGTGGCCGGGTGGCTGGCCGACATGTTCGCGGGCAAGCTCTCCGAAGCGCTCGATAAGGTCCACGGCCTGGTCACGAAGCTGGGCGAAATCGTGTCCGCCGCGCCGGGCTGGTTCAACACCGCCAAGGAAGCGGTGTCCCAGTTCGCCACCTCCGCCGTGGAGTGGATCGGCAACTTCCTGAACACCATCCAGGAAATTCCGGGCAAGATCGGAAACGCCTTCGCCGGTGCGGGCACCTGGCTGAAGAACGCCGGACGCGCCATCATCGACGGCCTGAAGGCCGGTATCTCCGAAGCCTGGTCCAGCCTGACGTCCTGGTTCAAGGACAAGCTGAACAGCCTGGTGCCCGGCGGCGGCATCGTCGCGCGCATCCTGCCCGGCCTGGCCAACGGTGGCCCCGTGCCCGCGCGCGCCGACGGCGGCGGCCTGGTCTGGGGCGGCCGGGCGGCCGGGCAGCTCTCCGCGCCGGGCGGCCCGCGCGCCGACCGTGGGGTGTTCCTGGGGTCGAACGACGAATTCGTGGTGAACGCGCGCGCGATGCGCGGTCCCTGGGGCGGCGTGGTGGAGGCCATCAACTCCAACCGGCTGCCCAGGTTCGCCGACGGCGGCTCCATCGGCGGCGGCAAGGTGGACACGAAGAAGGCCGACTGGTTCAAGTTCCTGGAGTCCATCGGCTACAAGCCGCCGCTGGATGAGGTCACCGGCGCGCGCATCGGTGAACAGGTCACCCAGGGCATCACCAGCTACAAGGACGCCGTGTCCAACATCAAGGGCTGGGCGGACGCGCTGATCAACGGCGGGGAAATCCCCGGCATGGAGGACGCCGCGCGGTCCGTGCTGGGCCTGGGCAACCCCGACGGGGGCGCGTCCACCCCGCCCGACTACGGCACCAGCACCATGGGCGGCAGTGACAACAGTGGGTTGACAGGCGGGCTGTCGGATCTGGAGGGGCCAGGGTCGATAGGCTCCGACATCGGCGCGGCCGTCCAGTCGGCACTGAGCTTCGCCCAGGGGGAGAGCGGGAAGGCGTACCAGTACGGCGGCGTAGGGAATCCGTCCTGGGATTGCTCCGGCTTCATGAGCGGCATCTACGCGACCATTCGCGGGCTGGACCCCTACACCCGGTGGTTCACCACGGAAGCCGACTTCACCAGCTCCAGCTTGGGCTTCGTGCGCGGGCTGGGCGGAAGCGACGGCTTCAGCATCGGCGTGCACAACGGCGGCGGCGGGCAGTACTCCCACATGGCCGGGACGCTGGCGGGCAACTCCGTGGAGTCCGGGGCCAACGGTGTGCTGGTCGGCGGCAAGGCGCTGAACGCTTCGTCCGACCAGTTCGAGAACCGCTATTACCTCCCGATCGTGCGCGAAGCGCTGTCCGGGAACGTGGCCGCCGCCAAGGAACTGGCGAACATCGAGGGAGCCGGGGCCGCGCGCTGGCGGGACATGGCCATTGCCGCCATGAAGCGCCAGGGCTTCGACGCCGACGACCCGGCCCAGGTGGACGCGATGGTCCGTCAGATCGAATCGGAATCCGGCGGTGACCCGCTCATCGTCCAGCAGGTCCAGGACGTGAACAGCGGCGGGAACGAAGGCGTCGGGCTGCTCCAGATCATCCCCAACACCTACGCGGCCTACCGTGACCCGGAGCTGGTGGACGACCGCCGGAACGGCTGGTCCAACATGAACGCCGCCCTTCGCTACTACAAGGACCGGTACGGCCTGGACCTCACCCAGTACTGGGGCAAGGGGACCGGGTACGACAACGGCGGCACGATCAACGGCATGGGCCTGTTCGGCAAGTGGACGACGAAGCCGGAAGAAGTGCTGTCCCCGTCCATGACCCAGGACTTCCACGACATGCTTCCGTTCATGTCCACGGTGGCCGACAGCTTGCGCGGCCGGGCGGCCATCGCGGGGTACGACGACGCCGCCGGGGGCAGCGGCACCACGGTGAACATGCGCTTCGGTGATGTTCGGACCAACAGCTGGGAGGACGCCCAGACGTCCATCAATCGGGACGCACGGCGCGGAATGCGCTCCGTGCTCGGAGGAACGGCGGGCGGACGATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi00023eedda_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(7AFQR)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50