Protein

Protein accession
I1TJX3 [UniProt]
Representative
3xQtT
Source
UniProt (cluster: phalp2_34303)
Protein name
Endolysin CS74L
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKIGIDMGHTLSGADYGVVGLRPESVLTREVGTKVIYKLQKLGHVVVNCTVDKASSVSESLYTRYYRANQANVDLFISIHFNATPGGTGTEVYTYAGRQLGEATRIRQEFKSLGLRDRGTKDGSGLAVIRNTKAKAMLVECCFCDNPNDMKLYNSESFSNAIVKGITGKLPNGESGNNNQGGNKVKAVVIYNEGADRRGAEYLADYLNCPTISNSRTFDYSCVEHVYAVGGKKEQYTKYLKTLLSGANRYDTMQQILNFINGGK
Physico‐chemical
properties
protein length:264 AA
molecular weight:28935,4 Da
isoelectric point:9,08
hydropathy:-0,40
Representative Protein Details
Accession
3xQtT
Protein name
3xQtT
Sequence length
274 AA
Molecular weight
30711,69100 Da
Isoelectric point
8,02014
Sequence
MKKIADISNLNGNVDVKLLFNLGYIGIIAKASEGGTFVDKYYKQNYTNTKAQGKITGAYHFANFTTIAKAQQEANFFLSCIAGTTPDFVVLDLEQPCTGDITEACLAFLNIVAKKFRCVVYCNPSFIKEHLNSKICAYPLWIANYGVATPAFTLWTKYAMWQFTEKGQVSGISGYIDFSYITDEFIKYIKGEDEVENLVVYNDGADQRAAEYLADRLACPTINNARKFDYSNVKNVYAVGGNKEQYTSYLKTLITGSTRYATMQAVLDYIKNLK
Other Proteins in cluster: phalp2_34303
Total (incl. this protein): 28 Avg length: 281,3 Avg pI: 7,55

Protein ID Length (AA) pI
3xQtT 274 8,02014
3wVRd 306 5,39319
3xPJg 306 7,80856
4SvpE 279 8,22960
5HGc1 261 8,82110
5iZVb 276 8,44222
5tD2m 276 5,83472
5tIB0 273 8,56445
7BUDL 274 7,47355
7BdJw 234 9,01025
7BdPF 306 5,76719
7Bkgh 274 9,04790
7BvO8 306 5,58411
7cIYZ 276 8,44924
7cIZh 276 6,22014
7d3P3 313 8,51822
7h4pL 274 9,03249
7kjUF 276 9,00509
7kzGy 306 5,06807
7kzJy 294 5,28639
7kzN4 306 5,16254
7ou3W 277 8,84082
7u29a 274 8,56645
7uZxx 274 8,75141
8Kfkk 272 6,59215
D9ZNF3 274 7,47355
A0AAF0PIW3 274 7,47355
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30706
7cHI2
66 41,8% 287 3.661E-86
2 phalp2_39458
4VDTp
155 33,9% 206 7.770E-72
3 phalp2_36157
6TiaL
25 33,6% 199 6.102E-42
4 phalp2_5615
4g2xP
40 30,2% 205 7.794E-39
5 phalp2_39912
1jXPT
300 33,5% 188 2.700E-38
6 phalp2_10644
2AeOK
60 29,5% 200 5.278E-36
7 phalp2_3425
3TK4D
923 29,4% 224 1.170E-34
8 phalp2_20360
2RVWF
69 32,1% 196 3.522E-33
9 phalp2_6199
5X5xO
102 28,0% 200 6.538E-33
10 phalp2_30061
2NCcc
23 29,3% 208 6.538E-33

Domains

Domains [InterPro]
GH25
Unannotated
Representative sequence (used for alignment): 3xQtT (274 AA)
Member sequence: I1TJX3 (264 AA)
1 274 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage phi8074-B1
[NCBI]
1147137 No lineage information
Host Clostridium sporogenes
[NCBI]
1509 Firmicutes > Clostridia > Clostridiales > Clostridiaceae > Clostridium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ246028 [NCBI]
CDS location
range 17678 -> 18472
strand +
CDS
ATGAAGATAGGTATTGATATGGGGCATACCCTAAGTGGGGCAGATTATGGAGTAGTTGGACTTAGACCAGAAAGCGTATTGACTAGAGAAGTAGGTACAAAGGTCATATACAAGCTTCAGAAGCTCGGACACGTCGTTGTTAACTGTACAGTAGATAAGGCATCAAGCGTAAGTGAAAGCTTATACACACGCTATTACAGGGCTAATCAAGCTAACGTAGACCTATTTATATCTATACACTTTAATGCAACACCGGGCGGAACAGGAACAGAAGTATACACATACGCAGGACGCCAACTAGGTGAAGCGACAAGAATAAGACAAGAGTTTAAATCTTTAGGACTAAGAGACAGAGGGACAAAAGATGGTAGTGGATTAGCAGTCATCAGAAATACAAAAGCAAAAGCTATGTTAGTGGAATGTTGTTTCTGCGACAACCCTAACGATATGAAATTATATAACTCCGAGAGTTTTTCCAACGCTATAGTAAAAGGCATAACAGGAAAATTACCGAATGGAGAATCTGGAAACAATAATCAAGGGGGTAATAAAGTGAAAGCAGTAGTAATTTATAACGAAGGTGCAGACAGAAGAGGAGCAGAATACTTAGCAGACTATTTAAATTGTCCAACCATATCTAACTCAAGAACATTCGACTATAGTTGCGTAGAACATGTATATGCAGTCGGGGGTAAAAAAGAACAATATACAAAGTATCTAAAAACATTATTATCTGGTGCTAATCGATATGATACAATGCAACAAATATTAAACTTCATCAACGGAGGTAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
I1TJX3
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00025c0c49_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3xQtT) rather than this protein.
PDB ID
3xQtT
Method AlphaFoldv2
Resolution 96.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50