Protein

Protein accession
A0A2K9V347 [UniProt]
Representative
1kAuJ
Source
UniProt (cluster: phalp2_29719)
Protein name
Endolysin, N-acetylglucosaminidase domain
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MQAEELKYLSHEAVIKKVAPLATLDNVTSGIPAAITLAQFIIESFWGRSPLASASNNCFGMKKNLSGNNWPGSTWTGKSMTWVSSEASSGETVRQPSEFRVYASVEDSIADHSAYLAGAMNGTDLRYKGLRWQLDYRTAAQIIKDGGYATAPDYVEVLCAMIERYNLTQYNVAQPPFLVRVTVPMVAARKGPGSEYPATVVVRGPNIFTITEVQGSYGRLKSGAGWLNLHYAEWLCSE
Physico‐chemical
properties
protein length:238 AA
molecular weight:26072,3 Da
isoelectric point:6,84
hydropathy:-0,16
Representative Protein Details
Accession
1kAuJ
Protein name
1kAuJ
Sequence length
844 AA
Molecular weight
92804,87940 Da
Isoelectric point
7,70813
Sequence
LEDNVNIIKNTNFTAYRNCRERVEKVKYIVIHYTGGEGTAADQVKYFNNGNRSASAHYFVDRSGEIREYCDPKKWYAWHCGGSLESSHHPYYGKCTNSNSIGVEICTHNNGKTWEFTKEAVAAAKELTKYLMKEYDVSADNVIRHYDVTGKSCPRVPGWGAVGGNAEWEKFKKALGAETVDTPATEKKKWYRVRKSWDDAASQIGAYLILDNAIAEANNSGPKYAVYDWTGKEIYRYVEASGTKEKEIRFFYPGYTRTSSPDDRQGAGCVWHDQHKNCFVYDAYQKNTDAGNNLIQYILDNGLRSIDGCGSHAHGDHLGGFFKMLESGKITIENFYCYDPDSLKLAGTGSANARSAKEDKEYLQSLIDKLKARGTKIHFVKTGDTIVCGEMVFEVCRNQPASFGQYDTGKAWGYLNDGSINLYERACHVLLCGDGGGRKAAQYFNGDIDVCESEHHGNGDGSGKVEEVKSRGCGLAIECNNEKNGPGSCGFTQYGARRFKEAGIPVWMLNADINGIAKGGKLTVTQGSGSKTFDVPFGLVFYRVRKSWADAGSQIGAYTILDKAKECADQHPGYIVFDETGKAVYPVEGSGAAPEPAATKPKTEQEIFIETVGTMARDDMAKNGILACITIAQAILESGWGTSELAVNAKNLFGMKKSLSGNTWSGSTWGGKSYSKLTQEVYTSGPAVVRADFRAYESWAESVGDHSAYLAGAKNGSRLRYAGLIGCTDYRKAAQIIKDGDYATAPDYVEKLCKLIEQYNLTEWNSVAAVPAGSGALDEYHTVKWTGMTKRACSGYMYPDARSKVTVEVHQGVKAGVCKGIGKFYLCKVDDTYGYIHKSHITKL
Other Proteins in cluster: phalp2_29719
Total (incl. this protein): 6 Avg length: 530,3 Avg pI: 7,50

Protein ID Length (AA) pI
1kAuJ 844 7,70813
2VjBz 786 9,11140
3iBYE 838 7,06136
A0AAE9UUR2 238 7,12758
A0AAF0A405 238 7,12758
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28540
41m0Z
1 34,0% 690 1.044E-101
2 phalp2_15411
21scl
1 28,1% 750 3.013E-63

Domains

Domains [InterPro]
Unannotated
Unannotated
Unannotated
Unannotated
Unannotated
Disordered region
Representative sequence (used for alignment): 1kAuJ (844 AA)
Member sequence: A0A2K9V347 (238 AA)
1 844 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Faecalibacterium phage FP_Epona
[NCBI]
2070182 Eponavirus > Eponavirus epona
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG711462 [NCBI]
CDS location
range 44999 -> 45715
strand +
CDS
ATGCAAGCAGAAGAACTGAAGTATTTGTCCCACGAAGCGGTCATCAAAAAGGTCGCACCACTGGCCACCTTGGACAACGTCACGTCCGGCATCCCGGCAGCGATCACCCTCGCCCAGTTCATCATCGAAAGTTTCTGGGGCCGGTCTCCGCTGGCCTCGGCTTCCAACAACTGCTTCGGGATGAAGAAGAACCTCTCCGGCAACAACTGGCCCGGCTCCACATGGACCGGGAAAAGCATGACGTGGGTATCTTCAGAGGCCAGCAGCGGAGAGACCGTCCGGCAGCCCTCCGAGTTCCGGGTGTACGCCAGCGTCGAGGACTCCATCGCCGACCACAGCGCATATCTCGCCGGGGCGATGAACGGCACCGACCTGCGGTACAAGGGACTGCGCTGGCAGCTGGACTACCGCACCGCCGCCCAGATCATCAAGGACGGAGGGTACGCCACCGCCCCGGACTACGTCGAGGTTCTCTGCGCCATGATCGAGCGGTACAACCTGACCCAGTACAACGTGGCGCAGCCGCCCTTTCTGGTTCGGGTGACCGTCCCGATGGTCGCCGCCCGGAAAGGCCCCGGCAGCGAGTATCCCGCCACCGTGGTCGTCCGTGGCCCGAACATCTTCACCATCACCGAGGTGCAGGGCAGCTACGGCAGACTCAAGAGCGGAGCCGGGTGGCTCAACCTCCACTACGCAGAGTGGCTCTGCAGCGAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000bec673e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1kAuJ) rather than this protein.
PDB ID
1kAuJ
Method AlphaFoldv2
Resolution 69.43
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50