Protein

Protein accession
B7SSN6 [UniProt]
Representative
4hcut
Source
UniProt (cluster: phalp2_1915)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNISQAGINLIKSFEGLRTKAYKAVPTEKYYTIGYGHYGSDVRVDQVISEKEAEKLLYDDVQSFVDAVNKLLKVDVTQNQFDALVSFAYNVGVGALKSSTLLEYLNTGNFQKAADEFLRWNKSGGKVYSGLVKRREQERTLFLTGGSKNVSRETSKPKTTATNTYVVEKGDSLSEIATKNKTTVERLLRLNPHIKNPNKIYVGQNINTGSPVKSTLKYKIKRGETLSGVAKKNKTTVSQLMKLNPSIKNANNIYAGQTIRLK
Physico‐chemical
properties
protein length:262 AA
molecular weight:29171,0 Da
isoelectric point:9,84
hydropathy:-0,53
Representative Protein Details
Accession
4hcut
Protein name
4hcut
Sequence length
279 AA
Molecular weight
31354,83530 Da
Isoelectric point
9,91525
Sequence
VKLSNTGLSLIKSFEGVRLTAYKAVPTEEHWTIGYGHYGPDIKQNMRITQDQADEYFRKDVARFEKAVNDNVKVPLNQNQFDALVSFTYNVGAGALQRSTLLELLNQGKYEAAANQFDVWNKSSGKVLAGLVKRRAKEKELFLKDLPKEKVVKPNVEKVSAEIVTKPNKPSTTKKVIKTYKVKSGDALSKLASKWGTTVKRLQELNGIKNPDKIYIGQTLKYETTGNVDNIKRYHTVVNGDNVTKIAKKYGTTVSAIKKLNPGIKNINLIYPKQKIRIK
Other Proteins in cluster: phalp2_1915
Total (incl. this protein): 54 Avg length: 258,7 Avg pI: 9,18

Protein ID Length (AA) pI
4hcut 279 9,91525
15vXF 259 5,33357
1QQ5d 263 9,81468
1cfpB 277 9,38629
1dLsd 274 9,41253
1jYRN 263 6,65888
2LNgT 346 6,20610
2LO6I 346 8,39657
2VOjr 201 9,68162
2ZNwG 333 9,13003
2foVQ 263 8,83483
325Kd 345 7,72967
39y6o 200 9,73081
3Ll4z 348 8,88698
3c16v 266 7,83499
4EmrS 205 9,69800
4OECt 260 6,14193
4v79f 260 8,47322
5hT09 204 9,69664
5i3aE 207 9,73867
5ioMn 219 9,89449
5nN6j 197 7,87721
6DdoW 333 9,03358
6PkbE 257 7,71762
7ASWe 258 10,02092
7AaS1 258 9,94491
7AwAu 262 9,83731
7Awm6 262 9,86748
7g7HI 274 9,94156
7lFdv 280 9,76962
8LKhV 258 10,02040
8LKlW 263 9,90868
PGIb 353 8,82574
d0qD 332 7,67749
A0A222Z0F1 258 10,02092
A0A6M9Z9S7 258 9,94491
A0A142IG96 262 9,86748
P07540 258 10,02040
Q37896 263 9,90868
A0A7T8EP28 258 9,97108
Q94ML9 258 9,97108
A0A6J5MCC1 154 9,85639
A0A889IMY6 258 10,02040
A0A889INZ4 284 10,21297
A0A976N061 266 9,94162
A0AAE9FF95 258 10,09512
A0AAE9FK70 258 10,02040
A0AAE9FMV5 258 10,04780
A0AAE9JUA4 258 10,04780
P11187 258 9,99507
A0A6G5Y4H0 144 9,51452
A0A6G5Y5U5 148 9,01824
A0A6J5RUX8 144 6,31091
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30084
2YvPg
3 38,5% 288 2.977E-67
2 phalp2_17145
2TusA
116 43,5% 186 9.252E-44
3 phalp2_23976
29bMx
1 42,7% 180 3.010E-40
4 phalp2_36216
7pp4x
2 37,0% 186 4.927E-39
5 phalp2_14179
2AnJ3
6 36,0% 205 4.927E-39
6 phalp2_29634
8aoUy
1 37,7% 175 1.147E-35

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4hcut (279 AA)
Member sequence: B7SSN6 (262 AA)
1 279 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959, PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Nf (Bacteriophage Nf)
[NCBI]
2992639 Salasmaviridae > Beecentumtrevirus > Beecentumtrevirus Nf
Host Bacillus subtilis
[NCBI]
1423 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
EU622808 [NCBI]
CDS location
range 15083 -> 15871
strand +
CDS
ATGAATATTTCACAAGCTGGTATCAATTTGATTAAGAGTTTTGAAGGGTTAAGAACAAAAGCTTACAAAGCTGTGCCGACAGAGAAATATTATACGATTGGTTACGGACATTATGGTTCTGATGTTCGTGTTGATCAAGTGATAAGTGAGAAAGAAGCCGAGAAACTATTGTATGATGATGTGCAGTCATTTGTTGACGCCGTTAATAAACTACTTAAAGTAGATGTTACACAAAATCAGTTTGACGCTCTTGTGTCATTTGCATATAACGTAGGGGTAGGTGCGTTAAAATCATCCACATTGTTGGAATATTTAAACACTGGAAACTTTCAGAAGGCGGCTGATGAATTCCTGAGGTGGAATAAGTCGGGAGGTAAGGTGTACAGTGGACTTGTGAAGAGAAGAGAACAGGAAAGAACGTTGTTTCTTACAGGTGGAAGTAAAAATGTTTCACGTGAAACATCAAAACCGAAAACAACTGCAACAAACACGTATGTGGTTGAGAAGGGAGACTCACTCAGTGAGATTGCTACGAAAAATAAAACAACTGTTGAAAGATTATTGAGATTAAATCCTCACATCAAAAATCCAAACAAAATATATGTTGGACAAAATATTAATACTGGTTCACCAGTTAAATCAACATTGAAATATAAAATTAAGCGTGGCGAAACACTTAGTGGGGTTGCTAAAAAGAATAAAACAACTGTATCTCAGTTGATGAAATTAAACCCAAGTATTAAGAACGCAAACAACATTTATGCAGGGCAAACAATCAGACTTAAATAG

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
B7SSN6
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi000188d2d9_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4hcut) rather than this protein.
PDB ID
4hcut
Method AlphaFoldv2
Resolution 83.86
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50