Protein

Protein accession
A0A6G5YCE9 [UniProt]
Representative
4SCHa
Source
UniProt (cluster: phalp2_16240)
Protein name
NlpC/P60 domain-containing protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAELQHDMAEQVSLNVDAAVKSFKTLTSAVKANTAEWQANAAAAKRNGESQKSQQIKIDGLNKSIELQKAKLDDLKKQQAAIDTSTEKGTKQYYDLTAQITKTNVQIDKQTEQLNKAKSGMSYYTTGLADLQKSYRQATQLSQSYITRLEAEGKTAEANRAKLDSYSNTVEHYTKQLQIQQTELSKVAAASGKNSDAYREQEIRVNKTATSLANAKKQMDDLSSSLAKPKPVGFLDKIKAQLHGTEDAAKETKTSITDIVKGSAIGTGLSNIVGSLSSTLIDAAKQGFNLAKAGKEISENWEHIGVNEEGVKELTGQIGEIRGVSDASGAAVTKLQTSIYGLTSGNIKETKALTNELYAFGKAGGASEDQIVQIGGKLTRIFSAAKVNLSSFNKTFATMPGLKTAIQKASGMTKDAFNDALANGKISGVQMKKYMLDAAKGSGEAWAKFGETTEGKIAKTKGTWTNFTAAVMKPLANTALDGLSKGLDKIIGKNGQLNATGQHIQSIAGALSQNVGKGLITAIDFIAKHTTAVKAMGVAFATYFAVSKFTKMATTMITFVTGIQKVITAVREWTVVQKLLDVVMAANPIGIAVAAVAALAAGFVLLYKNCKPFRNYINGLGVQIKKAFSGLPSVIKNATKLFTKLYTSVKSTFNRLIKSIKSAWNSITKGFNNFKKSFKKNWDKFWDTIHDFFKKSWKDILSVFKDWAKDIDAGLKSFSKNFKKGWNSLWDGVGSIFTKAWKSIKNLGKNAMNGLIDIVNGGINAIDSVIHAFGGSKQTIKLLSHVKLASGTDSILKSLSNPITKPVIATLNDGNDSPATGNREMLVDDAGNAGIVQGRNTQMLLTPGMHVINARETAMFTSFLSALGHKRYAKGTDSILGQIGDAVNGAVSGIGNWISKTANNLKKYFDLAVKIVSHPVKYVEGLFKWTNPKNVNGAMQDLAHGAFDHAQDAAKDWWSALWQMAGGSLNGTSSALLKAVEKYGEGHRYVWGASGPTTFDCSGLVLYALKKGFGISYPHYSGAQYEQTQHISKSQAHSGDLVFWGKGGSEHVGVYAGGNKYFSAQSPSQGIHMNTLDSVVGYGAPKFGRVQGLKQDTDTAKATTGLQKYIKNQVGNGFFSFISKLGSMFGVQDGGGQPSGSHKNWLAEAGFKPSDFGYITYIVDHESGWNPKATNPSSGAYGLPQSLPGNKMAAAGSDWRTNPITQLRWMKSYVNSAYGGARQAYQFWLKNHAYANGGIVSTSGIYQLAEQNMPEMIIPLDTSKHNRATQLLDQTTKIVKGNDGIKSEIQSEKISNMLNTIIALLGTIANSDKADKIIELLKLEVQNPIKVDTKVNLDGKTLARQLEKYQVRRQQGGQAGYAF
Physico‐chemical
properties
protein length:1363 AA
molecular weight:147046,5 Da
isoelectric point:9,61
hydropathy:-0,32
Representative Protein Details
Accession
4SCHa
Protein name
4SCHa
Sequence length
1216 AA
Molecular weight
132960,79530 Da
Isoelectric point
10,26641
Sequence
MAANIPMGSMSTEIKLNGSQSVKTLRELKQAVTQATSAWKAQRAELSTIGESTKAAEAKYKGLAETIKRQKDYISGLREAQKHLQEAQKSVDRSTREGRQEYGKYNEALQKNETRVHSAEQRLAGLANQQSKAHKSLDYYKSGLADVQKQLKSSESVTRSYVNRLQSEGKSYEAAKAKLAGYRSSLDNLNKQQKIQEAELSKIASTAGKSSDAFKRQQVRVNETATSINKTKSNMSSLNDTIKKTNPSVFDRLKDKLHGTSNEAKDTSHNIMDIAKGSAIGNMVSNGFSSLGSALWSAAKNGFKLDEAGEELKKRWSDLGLPKREVKGLMDQIGKIRGASNASGASINQLQRSLYNLTNGNVGHVKALTNELFAFGKQAGLSDKEIAGMGPKLTRVFSQSKVRLSAFNRAFGQMPGLRNAIIKASGMSKKAFNNALANGKISGTKLQELMIKASSKSGKAWERFGSTTKGQLARAQGQYTNMTAMFMKPIEFVSIKALNGVMAQLVNKRGGLTKTGSAIQGIVKNLSKNIGKGITNTIDFIVHHIQGIKTFGKVLAGAFAAKKIYDFTGGIQKGIGDVADSIDAFKKLAKSTSIADAAQKLLNLDEKANPVGLIAVAVVALGVAFYELYKHCKPFRNFVNSIGKQAVKVFNGVVKFFKNNWKEIGELIVNPIQGGFDLLYKNNKGFRKWVNGLVKGFKNAWKGVGKWFGNIGKGIQKSWKGMTKWFSKLGKNMSKGIKSSWKGVTKWFSKIGKNIQKSWKGMTKWFKTLGKNMAKGLKSAWKSMVKWFSGIAKNIKKAWHAMTSWFSKLGHGMASGLKSAWKGVTKWFSNIGKGIRNAWRSMTGWFTRLGRNMSKGLKATWHSITSWFGNIVDGIKNAWSGVTSFFGKIGSNSVRVFKSAWHGITSWFKNIVDGIKNAWDGFWDKVSGPIKFIGKVFSGKAKIGKIHFAEGTDWRKHYGVPAVVNDAPGSKYREGLLTNGQIIPFPDKRNLPFWLLPGQDIVNGDDMAKIFGSAIHYADGTVTPATNLSSTPAGLSNQSGLGLLINITDDILQAITGETVSNFTPTKTAIPGINSIRSTPIATPLSKSPINNKNSDKAGKLGKSTNTKDKADIKDLKEQTAEMQKAVVVSKQFVKSIASVEKQVKALYATLKKNPFGKYISSQATKAVKALKGKGNFAKVIKSMNSKMSKDIKKTNSANLKNIKRFSSSMIKTFKS
Other Proteins in cluster: phalp2_16240
Total (incl. this protein): 5 Avg length: 1332,4 Avg pI: 10,12

Protein ID Length (AA) pI
4SCHa 1216 10,26641
3xQO5 1294 10,20027
b1ac 1340 10,16855
yjLK 1449 10,34739
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39680
eQI
275 29,2% 1233 3.342E-165
2 phalp2_10036
7fPkw
316 31,3% 1140 6.129E-140
3 phalp2_3948
7g15J
9 28,0% 1170 3.401E-134
4 phalp2_1109
7vQjn
20 25,7% 991 4.468E-83
5 phalp2_23533
7euEk
5 26,0% 900 8.438E-76
6 phalp2_12070
5hYmz
8 24,2% 1033 1.229E-71
7 phalp2_27797
7wN5Z
2 22,3% 1133 7.805E-57
8 phalp2_37988
5tToF
60 23,8% 842 4.433E-47
9 phalp2_12320
7vQnK
16 19,7% 1221 3.393E-44
10 phalp2_36980
8KjJ4
4 20,7% 1155 1.352E-43

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Caudoviricetes sp
[NCBI]
2832643 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN855847 [NCBI]
CDS location
range 1911 -> 6002
strand +
CDS
ATGGCTGAGTTGCAACACGATATGGCTGAGCAAGTTTCATTGAACGTAGATGCTGCAGTTAAAAGTTTTAAAACACTCACTAGTGCGGTTAAGGCTAATACTGCTGAATGGCAAGCTAATGCTGCTGCGGCTAAACGAAACGGCGAAAGCCAAAAGTCTCAGCAGATTAAGATTGACGGGTTGAACAAATCAATAGAGTTGCAAAAAGCCAAATTAGACGATTTAAAAAAACAGCAAGCAGCTATTGACACGTCTACAGAAAAAGGGACTAAGCAATACTATGATTTAACAGCTCAAATTACTAAAACGAATGTCCAAATTGATAAGCAAACTGAGCAGCTCAACAAAGCAAAGTCTGGGATGTCCTATTACACGACTGGTCTCGCTGACTTACAAAAAAGCTATAGACAGGCTACTCAGTTGTCGCAGTCGTACATTACTAGGTTAGAAGCTGAGGGTAAAACAGCGGAAGCTAATCGAGCTAAACTAGATAGCTATAGTAATACTGTTGAACATTACACTAAGCAATTACAAATACAGCAGACAGAATTGTCTAAAGTTGCAGCAGCAAGCGGTAAAAATTCCGATGCATATCGGGAACAGGAAATACGTGTTAACAAAACCGCCACCAGCTTGGCAAATGCTAAAAAACAGATGGATGATTTATCGTCCAGTCTAGCAAAGCCTAAACCAGTTGGTTTTTTAGACAAAATTAAAGCACAGTTACATGGAACCGAAGATGCAGCCAAGGAAACTAAGACAAGTATCACAGATATTGTTAAGGGGTCTGCCATCGGAACAGGTCTATCTAATATTGTTGGTAGTCTAAGCTCAACTCTAATTGATGCAGCTAAACAGGGTTTTAACTTAGCTAAAGCAGGCAAAGAAATTAGCGAAAACTGGGAGCATATTGGCGTTAATGAAGAGGGCGTTAAAGAGCTAACAGGTCAGATTGGTGAAATTCGTGGTGTTAGTGATGCCAGTGGAGCTGCTGTTACTAAATTACAGACTAGTATATATGGCTTGACTAGTGGAAATATTAAAGAAACTAAGGCCTTGACAAATGAGCTGTATGCTTTTGGTAAAGCTGGTGGAGCCAGCGAAGATCAAATTGTGCAGATTGGCGGTAAGTTAACTCGTATTTTCTCAGCTGCAAAAGTTAATCTATCTAGTTTTAACAAAACGTTTGCAACTATGCCTGGTCTTAAAACTGCTATCCAAAAAGCTAGTGGTATGACCAAGGACGCCTTTAATGATGCATTAGCTAACGGCAAAATTAGTGGCGTTCAGATGAAAAAGTACATGCTCGATGCGGCAAAGGGTTCTGGTGAAGCGTGGGCAAAATTCGGAGAAACAACTGAAGGCAAGATTGCTAAGACTAAAGGTACCTGGACTAACTTTACTGCAGCCGTAATGAAACCGTTAGCTAATACAGCTCTAGATGGTTTAAGCAAAGGGTTGGACAAAATTATTGGCAAAAATGGCCAGTTAAATGCTACAGGTCAGCATATCCAGTCAATTGCTGGTGCCTTATCGCAAAATGTTGGTAAAGGACTAATAACAGCTATCGACTTTATCGCTAAGCACACAACTGCTGTCAAAGCAATGGGGGTAGCATTTGCAACGTACTTTGCAGTGTCTAAGTTTACTAAAATGGCCACAACAATGATTACATTTGTGACTGGGATTCAAAAAGTAATCACTGCGGTACGTGAGTGGACAGTTGTACAAAAGCTCTTAGATGTAGTAATGGCGGCTAATCCTATCGGCATAGCAGTTGCAGCAGTAGCTGCTCTAGCAGCCGGATTCGTCTTACTATATAAAAACTGTAAGCCATTTCGGAATTATATTAATGGCTTAGGTGTCCAAATTAAAAAGGCTTTTAGTGGCCTTCCCAGCGTCATTAAAAATGCCACTAAATTATTCACTAAGTTATATACAAGTGTTAAATCCACTTTCAATCGTTTAATTAAGTCGATTAAATCTGCATGGAACTCTATCACTAAAGGCTTCAACAACTTTAAAAAGTCTTTCAAGAAAAATTGGGATAAATTTTGGGATACTATCCATGACTTTTTTAAAAAATCGTGGAAAGACATATTATCGGTTTTTAAAGATTGGGCAAAAGACATTGATGCTGGGCTAAAGTCTTTTAGCAAGAATTTCAAGAAAGGCTGGAATAGTCTGTGGGATGGTGTAGGTTCGATTTTCACTAAAGCATGGAAGTCCATCAAAAACCTAGGCAAAAATGCCATGAACGGACTAATAGACATAGTCAATGGCGGTATTAATGCTATTGATAGTGTTATCCATGCATTTGGTGGCAGTAAACAGACTATTAAGCTACTTAGTCACGTAAAATTAGCTAGTGGTACAGATTCGATTTTAAAATCACTGTCAAATCCAATAACTAAGCCGGTAATTGCGACATTAAACGATGGTAATGATAGTCCAGCTACTGGCAATAGAGAAATGCTAGTAGATGATGCAGGTAATGCAGGTATTGTTCAGGGACGTAACACACAAATGCTGCTAACGCCTGGGATGCACGTAATAAATGCACGTGAAACTGCCATGTTTACAAGTTTTTTGTCAGCATTGGGACATAAGCGCTATGCAAAAGGGACTGATTCAATTCTTGGACAGATTGGTGATGCAGTTAACGGTGCTGTTAGTGGCATTGGCAATTGGATTTCAAAAACAGCTAATAATCTTAAAAAGTATTTTGACCTAGCGGTTAAAATCGTTTCTCATCCAGTTAAGTATGTTGAGGGACTTTTTAAGTGGACCAATCCTAAAAACGTCAATGGCGCTATGCAAGACCTAGCACATGGTGCATTTGATCATGCTCAAGATGCGGCTAAAGACTGGTGGTCAGCTCTTTGGCAAATGGCAGGGGGCAGTCTTAATGGCACTAGCTCCGCATTACTTAAAGCTGTCGAGAAGTATGGTGAAGGTCACAGATATGTCTGGGGTGCCTCTGGCCCAACAACGTTTGATTGTTCCGGCTTAGTGCTATATGCGCTTAAAAAGGGCTTTGGCATAAGCTATCCACACTATAGTGGTGCACAATATGAGCAGACCCAACATATTAGCAAATCTCAAGCACATAGTGGTGACCTTGTATTTTGGGGCAAAGGTGGTTCTGAGCACGTTGGCGTGTATGCAGGCGGCAACAAATATTTCAGCGCACAGTCACCAAGTCAAGGGATACATATGAACACCTTAGACTCAGTAGTGGGCTACGGTGCGCCTAAATTTGGTCGAGTCCAAGGTCTAAAGCAAGATACTGATACTGCAAAAGCAACTACTGGGTTACAAAAATACATTAAAAATCAAGTCGGGAACGGCTTTTTTAGTTTTATCAGCAAGCTGGGCTCAATGTTCGGCGTCCAAGATGGTGGTGGCCAACCAAGTGGTTCACATAAGAACTGGTTAGCAGAAGCTGGCTTTAAACCAAGCGATTTTGGCTACATTACGTATATCGTGGATCATGAATCAGGTTGGAATCCAAAAGCTACTAATCCTAGTTCAGGAGCATACGGTCTACCTCAGTCATTACCAGGCAACAAAATGGCCGCTGCAGGTAGTGACTGGCGAACTAATCCGATTACGCAACTGCGATGGATGAAGAGTTACGTTAACAGCGCCTATGGTGGTGCTAGACAGGCATATCAGTTTTGGCTAAAAAATCATGCGTATGCTAATGGCGGTATAGTCAGCACATCTGGCATCTATCAGCTAGCAGAGCAAAATATGCCAGAAATGATTATCCCGTTGGATACTAGCAAACATAATAGGGCTACCCAGCTGCTAGACCAGACAACTAAGATTGTTAAGGGCAATGACGGTATCAAATCCGAAATTCAATCAGAAAAGATTAGTAATATGCTTAACACAATAATTGCATTGCTGGGGACTATTGCTAATTCGGATAAAGCTGACAAGATTATTGAGCTATTAAAGCTTGAAGTTCAAAATCCGATTAAAGTTGACACTAAAGTCAATCTTGACGGTAAAACATTGGCAAGACAACTTGAAAAATATCAGGTAAGGAGACAGCAAGGAGGTCAAGCAGGTTATGCGTTCTAG

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008234 cysteine-type peptidase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4SCHa) rather than this protein.
PDB ID
4SCHa
Method AlphaFoldv2
Resolution 53.43
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50