Protein
- Protein accession
- A0A6G5YCE9 [UniProt]
- Representative
- 4SCHa
- Source
- UniProt (cluster: phalp2_16240)
- Protein name
- NlpC/P60 domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MAELQHDMAEQVSLNVDAAVKSFKTLTSAVKANTAEWQANAAAAKRNGESQKSQQIKIDGLNKSIELQKAKLDDLKKQQAAIDTSTEKGTKQYYDLTAQITKTNVQIDKQTEQLNKAKSGMSYYTTGLADLQKSYRQATQLSQSYITRLEAEGKTAEANRAKLDSYSNTVEHYTKQLQIQQTELSKVAAASGKNSDAYREQEIRVNKTATSLANAKKQMDDLSSSLAKPKPVGFLDKIKAQLHGTEDAAKETKTSITDIVKGSAIGTGLSNIVGSLSSTLIDAAKQGFNLAKAGKEISENWEHIGVNEEGVKELTGQIGEIRGVSDASGAAVTKLQTSIYGLTSGNIKETKALTNELYAFGKAGGASEDQIVQIGGKLTRIFSAAKVNLSSFNKTFATMPGLKTAIQKASGMTKDAFNDALANGKISGVQMKKYMLDAAKGSGEAWAKFGETTEGKIAKTKGTWTNFTAAVMKPLANTALDGLSKGLDKIIGKNGQLNATGQHIQSIAGALSQNVGKGLITAIDFIAKHTTAVKAMGVAFATYFAVSKFTKMATTMITFVTGIQKVITAVREWTVVQKLLDVVMAANPIGIAVAAVAALAAGFVLLYKNCKPFRNYINGLGVQIKKAFSGLPSVIKNATKLFTKLYTSVKSTFNRLIKSIKSAWNSITKGFNNFKKSFKKNWDKFWDTIHDFFKKSWKDILSVFKDWAKDIDAGLKSFSKNFKKGWNSLWDGVGSIFTKAWKSIKNLGKNAMNGLIDIVNGGINAIDSVIHAFGGSKQTIKLLSHVKLASGTDSILKSLSNPITKPVIATLNDGNDSPATGNREMLVDDAGNAGIVQGRNTQMLLTPGMHVINARETAMFTSFLSALGHKRYAKGTDSILGQIGDAVNGAVSGIGNWISKTANNLKKYFDLAVKIVSHPVKYVEGLFKWTNPKNVNGAMQDLAHGAFDHAQDAAKDWWSALWQMAGGSLNGTSSALLKAVEKYGEGHRYVWGASGPTTFDCSGLVLYALKKGFGISYPHYSGAQYEQTQHISKSQAHSGDLVFWGKGGSEHVGVYAGGNKYFSAQSPSQGIHMNTLDSVVGYGAPKFGRVQGLKQDTDTAKATTGLQKYIKNQVGNGFFSFISKLGSMFGVQDGGGQPSGSHKNWLAEAGFKPSDFGYITYIVDHESGWNPKATNPSSGAYGLPQSLPGNKMAAAGSDWRTNPITQLRWMKSYVNSAYGGARQAYQFWLKNHAYANGGIVSTSGIYQLAEQNMPEMIIPLDTSKHNRATQLLDQTTKIVKGNDGIKSEIQSEKISNMLNTIIALLGTIANSDKADKIIELLKLEVQNPIKVDTKVNLDGKTLARQLEKYQVRRQQGGQAGYAF
- Physico‐chemical
properties -
protein length: 1363 AA molecular weight: 147046,5 Da isoelectric point: 9,61 hydropathy: -0,32
Representative Protein Details
- Accession
- 4SCHa
- Protein name
- 4SCHa
- Sequence length
- 1216 AA
- Molecular weight
- 132960,79530 Da
- Isoelectric point
- 10,26641
- Sequence
-
MAANIPMGSMSTEIKLNGSQSVKTLRELKQAVTQATSAWKAQRAELSTIGESTKAAEAKYKGLAETIKRQKDYISGLREAQKHLQEAQKSVDRSTREGRQEYGKYNEALQKNETRVHSAEQRLAGLANQQSKAHKSLDYYKSGLADVQKQLKSSESVTRSYVNRLQSEGKSYEAAKAKLAGYRSSLDNLNKQQKIQEAELSKIASTAGKSSDAFKRQQVRVNETATSINKTKSNMSSLNDTIKKTNPSVFDRLKDKLHGTSNEAKDTSHNIMDIAKGSAIGNMVSNGFSSLGSALWSAAKNGFKLDEAGEELKKRWSDLGLPKREVKGLMDQIGKIRGASNASGASINQLQRSLYNLTNGNVGHVKALTNELFAFGKQAGLSDKEIAGMGPKLTRVFSQSKVRLSAFNRAFGQMPGLRNAIIKASGMSKKAFNNALANGKISGTKLQELMIKASSKSGKAWERFGSTTKGQLARAQGQYTNMTAMFMKPIEFVSIKALNGVMAQLVNKRGGLTKTGSAIQGIVKNLSKNIGKGITNTIDFIVHHIQGIKTFGKVLAGAFAAKKIYDFTGGIQKGIGDVADSIDAFKKLAKSTSIADAAQKLLNLDEKANPVGLIAVAVVALGVAFYELYKHCKPFRNFVNSIGKQAVKVFNGVVKFFKNNWKEIGELIVNPIQGGFDLLYKNNKGFRKWVNGLVKGFKNAWKGVGKWFGNIGKGIQKSWKGMTKWFSKLGKNMSKGIKSSWKGVTKWFSKIGKNIQKSWKGMTKWFKTLGKNMAKGLKSAWKSMVKWFSGIAKNIKKAWHAMTSWFSKLGHGMASGLKSAWKGVTKWFSNIGKGIRNAWRSMTGWFTRLGRNMSKGLKATWHSITSWFGNIVDGIKNAWSGVTSFFGKIGSNSVRVFKSAWHGITSWFKNIVDGIKNAWDGFWDKVSGPIKFIGKVFSGKAKIGKIHFAEGTDWRKHYGVPAVVNDAPGSKYREGLLTNGQIIPFPDKRNLPFWLLPGQDIVNGDDMAKIFGSAIHYADGTVTPATNLSSTPAGLSNQSGLGLLINITDDILQAITGETVSNFTPTKTAIPGINSIRSTPIATPLSKSPINNKNSDKAGKLGKSTNTKDKADIKDLKEQTAEMQKAVVVSKQFVKSIASVEKQVKALYATLKKNPFGKYISSQATKAVKALKGKGNFAKVIKSMNSKMSKDIKKTNSANLKNIKRFSSSMIKTFKS
Other Proteins in cluster: phalp2_16240
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_39680
eQI
|
275 | 29,2% | 1233 | 3.342E-165 |
| 2 |
phalp2_10036
7fPkw
|
316 | 31,3% | 1140 | 6.129E-140 |
| 3 |
phalp2_3948
7g15J
|
9 | 28,0% | 1170 | 3.401E-134 |
| 4 |
phalp2_1109
7vQjn
|
20 | 25,7% | 991 | 4.468E-83 |
| 5 |
phalp2_23533
7euEk
|
5 | 26,0% | 900 | 8.438E-76 |
| 6 |
phalp2_12070
5hYmz
|
8 | 24,2% | 1033 | 1.229E-71 |
| 7 |
phalp2_27797
7wN5Z
|
2 | 22,3% | 1133 | 7.805E-57 |
| 8 |
phalp2_37988
5tToF
|
60 | 23,8% | 842 | 4.433E-47 |
| 9 |
phalp2_12320
7vQnK
|
16 | 19,7% | 1221 | 3.393E-44 |
| 10 |
phalp2_36980
8KjJ4
|
4 | 20,7% | 1155 | 1.352E-43 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Caudoviricetes sp [NCBI] |
2832643 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MN855847
[NCBI]
CDS location
range 1911 -> 6002
strand +
strand +
CDS
ATGGCTGAGTTGCAACACGATATGGCTGAGCAAGTTTCATTGAACGTAGATGCTGCAGTTAAAAGTTTTAAAACACTCACTAGTGCGGTTAAGGCTAATACTGCTGAATGGCAAGCTAATGCTGCTGCGGCTAAACGAAACGGCGAAAGCCAAAAGTCTCAGCAGATTAAGATTGACGGGTTGAACAAATCAATAGAGTTGCAAAAAGCCAAATTAGACGATTTAAAAAAACAGCAAGCAGCTATTGACACGTCTACAGAAAAAGGGACTAAGCAATACTATGATTTAACAGCTCAAATTACTAAAACGAATGTCCAAATTGATAAGCAAACTGAGCAGCTCAACAAAGCAAAGTCTGGGATGTCCTATTACACGACTGGTCTCGCTGACTTACAAAAAAGCTATAGACAGGCTACTCAGTTGTCGCAGTCGTACATTACTAGGTTAGAAGCTGAGGGTAAAACAGCGGAAGCTAATCGAGCTAAACTAGATAGCTATAGTAATACTGTTGAACATTACACTAAGCAATTACAAATACAGCAGACAGAATTGTCTAAAGTTGCAGCAGCAAGCGGTAAAAATTCCGATGCATATCGGGAACAGGAAATACGTGTTAACAAAACCGCCACCAGCTTGGCAAATGCTAAAAAACAGATGGATGATTTATCGTCCAGTCTAGCAAAGCCTAAACCAGTTGGTTTTTTAGACAAAATTAAAGCACAGTTACATGGAACCGAAGATGCAGCCAAGGAAACTAAGACAAGTATCACAGATATTGTTAAGGGGTCTGCCATCGGAACAGGTCTATCTAATATTGTTGGTAGTCTAAGCTCAACTCTAATTGATGCAGCTAAACAGGGTTTTAACTTAGCTAAAGCAGGCAAAGAAATTAGCGAAAACTGGGAGCATATTGGCGTTAATGAAGAGGGCGTTAAAGAGCTAACAGGTCAGATTGGTGAAATTCGTGGTGTTAGTGATGCCAGTGGAGCTGCTGTTACTAAATTACAGACTAGTATATATGGCTTGACTAGTGGAAATATTAAAGAAACTAAGGCCTTGACAAATGAGCTGTATGCTTTTGGTAAAGCTGGTGGAGCCAGCGAAGATCAAATTGTGCAGATTGGCGGTAAGTTAACTCGTATTTTCTCAGCTGCAAAAGTTAATCTATCTAGTTTTAACAAAACGTTTGCAACTATGCCTGGTCTTAAAACTGCTATCCAAAAAGCTAGTGGTATGACCAAGGACGCCTTTAATGATGCATTAGCTAACGGCAAAATTAGTGGCGTTCAGATGAAAAAGTACATGCTCGATGCGGCAAAGGGTTCTGGTGAAGCGTGGGCAAAATTCGGAGAAACAACTGAAGGCAAGATTGCTAAGACTAAAGGTACCTGGACTAACTTTACTGCAGCCGTAATGAAACCGTTAGCTAATACAGCTCTAGATGGTTTAAGCAAAGGGTTGGACAAAATTATTGGCAAAAATGGCCAGTTAAATGCTACAGGTCAGCATATCCAGTCAATTGCTGGTGCCTTATCGCAAAATGTTGGTAAAGGACTAATAACAGCTATCGACTTTATCGCTAAGCACACAACTGCTGTCAAAGCAATGGGGGTAGCATTTGCAACGTACTTTGCAGTGTCTAAGTTTACTAAAATGGCCACAACAATGATTACATTTGTGACTGGGATTCAAAAAGTAATCACTGCGGTACGTGAGTGGACAGTTGTACAAAAGCTCTTAGATGTAGTAATGGCGGCTAATCCTATCGGCATAGCAGTTGCAGCAGTAGCTGCTCTAGCAGCCGGATTCGTCTTACTATATAAAAACTGTAAGCCATTTCGGAATTATATTAATGGCTTAGGTGTCCAAATTAAAAAGGCTTTTAGTGGCCTTCCCAGCGTCATTAAAAATGCCACTAAATTATTCACTAAGTTATATACAAGTGTTAAATCCACTTTCAATCGTTTAATTAAGTCGATTAAATCTGCATGGAACTCTATCACTAAAGGCTTCAACAACTTTAAAAAGTCTTTCAAGAAAAATTGGGATAAATTTTGGGATACTATCCATGACTTTTTTAAAAAATCGTGGAAAGACATATTATCGGTTTTTAAAGATTGGGCAAAAGACATTGATGCTGGGCTAAAGTCTTTTAGCAAGAATTTCAAGAAAGGCTGGAATAGTCTGTGGGATGGTGTAGGTTCGATTTTCACTAAAGCATGGAAGTCCATCAAAAACCTAGGCAAAAATGCCATGAACGGACTAATAGACATAGTCAATGGCGGTATTAATGCTATTGATAGTGTTATCCATGCATTTGGTGGCAGTAAACAGACTATTAAGCTACTTAGTCACGTAAAATTAGCTAGTGGTACAGATTCGATTTTAAAATCACTGTCAAATCCAATAACTAAGCCGGTAATTGCGACATTAAACGATGGTAATGATAGTCCAGCTACTGGCAATAGAGAAATGCTAGTAGATGATGCAGGTAATGCAGGTATTGTTCAGGGACGTAACACACAAATGCTGCTAACGCCTGGGATGCACGTAATAAATGCACGTGAAACTGCCATGTTTACAAGTTTTTTGTCAGCATTGGGACATAAGCGCTATGCAAAAGGGACTGATTCAATTCTTGGACAGATTGGTGATGCAGTTAACGGTGCTGTTAGTGGCATTGGCAATTGGATTTCAAAAACAGCTAATAATCTTAAAAAGTATTTTGACCTAGCGGTTAAAATCGTTTCTCATCCAGTTAAGTATGTTGAGGGACTTTTTAAGTGGACCAATCCTAAAAACGTCAATGGCGCTATGCAAGACCTAGCACATGGTGCATTTGATCATGCTCAAGATGCGGCTAAAGACTGGTGGTCAGCTCTTTGGCAAATGGCAGGGGGCAGTCTTAATGGCACTAGCTCCGCATTACTTAAAGCTGTCGAGAAGTATGGTGAAGGTCACAGATATGTCTGGGGTGCCTCTGGCCCAACAACGTTTGATTGTTCCGGCTTAGTGCTATATGCGCTTAAAAAGGGCTTTGGCATAAGCTATCCACACTATAGTGGTGCACAATATGAGCAGACCCAACATATTAGCAAATCTCAAGCACATAGTGGTGACCTTGTATTTTGGGGCAAAGGTGGTTCTGAGCACGTTGGCGTGTATGCAGGCGGCAACAAATATTTCAGCGCACAGTCACCAAGTCAAGGGATACATATGAACACCTTAGACTCAGTAGTGGGCTACGGTGCGCCTAAATTTGGTCGAGTCCAAGGTCTAAAGCAAGATACTGATACTGCAAAAGCAACTACTGGGTTACAAAAATACATTAAAAATCAAGTCGGGAACGGCTTTTTTAGTTTTATCAGCAAGCTGGGCTCAATGTTCGGCGTCCAAGATGGTGGTGGCCAACCAAGTGGTTCACATAAGAACTGGTTAGCAGAAGCTGGCTTTAAACCAAGCGATTTTGGCTACATTACGTATATCGTGGATCATGAATCAGGTTGGAATCCAAAAGCTACTAATCCTAGTTCAGGAGCATACGGTCTACCTCAGTCATTACCAGGCAACAAAATGGCCGCTGCAGGTAGTGACTGGCGAACTAATCCGATTACGCAACTGCGATGGATGAAGAGTTACGTTAACAGCGCCTATGGTGGTGCTAGACAGGCATATCAGTTTTGGCTAAAAAATCATGCGTATGCTAATGGCGGTATAGTCAGCACATCTGGCATCTATCAGCTAGCAGAGCAAAATATGCCAGAAATGATTATCCCGTTGGATACTAGCAAACATAATAGGGCTACCCAGCTGCTAGACCAGACAACTAAGATTGTTAAGGGCAATGACGGTATCAAATCCGAAATTCAATCAGAAAAGATTAGTAATATGCTTAACACAATAATTGCATTGCTGGGGACTATTGCTAATTCGGATAAAGCTGACAAGATTATTGAGCTATTAAAGCTTGAAGTTCAAAATCCGATTAAAGTTGACACTAAAGTCAATCTTGACGGTAAAACATTGGCAAGACAACTTGAAAAATATCAGGTAAGGAGACAGCAAGGAGGTCAAGCAGGTTATGCGTTCTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0008234 | cysteine-type peptidase activity | molecular function | None (UniProt) |
| GO:0016020 | membrane | cellular component | None (UniProt) |
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4SCHa)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50