Protein

Protein accession
E5G068 [UniProt]
Representative
6ndmR
Source
UniProt (cluster: phalp2_30607)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIIGSRTGHSKKCLGAVSLRNEWECMNILEREVNKILKAHGHTIIDCNSSASTENGELSEGCRKANAQHIDIFLSYHMNASKDHKGHGTECWVHPNARASCKEIASRISSNLSKLGFYNRGVKTGLLYEMKNVNAPNIIIETCFCDNEKDIGIWSPTPYEVMARHIANALDPSIPIEEKPKRWQVRVYAFTSKEEAQKYSDRITKELKAYNVVEEI
Physico‐chemical
properties
protein length:216 AA
molecular weight:24423,6 Da
isoelectric point:8,08
hydropathy:-0,52
Representative Protein Details
Accession
6ndmR
Protein name
6ndmR
Sequence length
343 AA
Molecular weight
37291,86580 Da
Isoelectric point
5,61861
Sequence
MKIAIDKGHCLYGFDTSADGRSVGGFSESEYDRILGDKIINLLRANGHEVIDVTVDNGNQFSSMYGSLGARTSKANANDVDLYIAIHFNAGGGRGTEVYLAPRSYYGSDSSYNTNLGYASRVQQKMVSLGFMNRGVKTEEFYVLVNTNAHAILIETCFCDSVADKQLYDSLGADKVAKAIVEGILNISVSTSTPSISTTTPTTSNAATNSDAPFRVRKSDNSPSSQIFASGTLEGAKANCPAGYCVFDKNGTLRYSNIPQASNSKTYAEDGTFYFTTNVQIRTQPCDGYETGLCYYAGESVIYHTVILDKNGFNWIVYNRSNGQQGYMKVRDLSTGEKYGYAE
Other Proteins in cluster: phalp2_30607
Total (incl. this protein): 8 Avg length: 269,4 Avg pI: 8,15

Protein ID Length (AA) pI
6ndmR 343 5,61861
1034D 317 8,88872
10D2I 317 9,00773
1pyu0 225 8,83580
3Y9NF 303 9,61644
A0AAU8KZ58 217 7,59337
A0AAU8L113 217 7,59337
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32076
5RDQF
9 40,1% 254 9.519E-52
2 phalp2_2703
7cyBj
192 45,5% 215 2.114E-50
3 phalp2_35186
67E6C
93 32,0% 278 6.782E-33
4 phalp2_12323
7wjSB
3 30,9% 220 7.721E-27
5 phalp2_24413
4m1OU
1246 29,6% 243 3.430E-19
6 phalp2_32087
63Erl
43 30,4% 220 3.430E-19
7 phalp2_11988
35NoL
2 26,2% 255 8.309E-19
8 phalp2_23918
1NTlO
42 27,9% 265 1.264E-15
9 phalp2_21446
82r2O
3 25,7% 260 1.264E-15
10 phalp2_38795
1mhnb
2 24,0% 295 1.281E-09

Domains

Domains [InterPro]
Ami3
Unannotated
SH3_5
Representative sequence (used for alignment): 6ndmR (343 AA)
Member sequence: E5G068 (216 AA)
1 343 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520, PF08460

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage CpV1
[NCBI]
926066 Guelinviridae > Capvunavirus > Capvunavirus CpV1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HM640230 [NCBI]
CDS location
range 9896 -> 10546
strand -
CDS
ATGATTATAGGAAGTAGAACAGGACATAGTAAAAAATGTTTAGGGGCGGTAAGCTTACGTAATGAGTGGGAGTGTATGAATATTTTGGAGCGTGAGGTAAATAAAATACTTAAAGCTCACGGGCATACCATAATAGATTGTAATAGCTCCGCAAGTACAGAAAATGGGGAACTTTCGGAAGGGTGTAGGAAAGCAAACGCTCAACACATAGATATATTTTTATCCTATCATATGAACGCAAGCAAAGACCATAAGGGTCACGGTACAGAGTGTTGGGTACACCCTAACGCAAGAGCATCATGTAAAGAAATAGCTTCAAGAATTTCAAGTAACCTTTCAAAATTAGGTTTTTATAATAGAGGTGTAAAAACTGGTTTATTATATGAAATGAAGAATGTAAATGCACCTAATATAATAATAGAAACTTGTTTTTGTGATAATGAGAAAGATATTGGAATATGGTCGCCTACACCATATGAGGTTATGGCTAGACATATAGCAAACGCCCTTGACCCTTCAATACCAATAGAAGAAAAACCAAAAAGGTGGCAAGTTAGAGTTTACGCATTTACTAGCAAAGAAGAAGCACAAAAATACTCTGATAGAATAACAAAAGAATTAAAAGCGTATAACGTAGTAGAGGAGATATAA

Gene Ontology

Description Category Evidence (source)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
E5G068
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi0001f3a50f_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6ndmR) rather than this protein.
PDB ID
6ndmR
Method AlphaFoldv2
Resolution 88.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50