Protein

Protein accession
2cu97 [EnVhog]
Representative
2UWwJ
Source
EnVhog (cluster: phalp2_11964)
Protein name
2cu97
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MFRKFVTSNTFIAIVLTVIAMGVFAVMDFFPRLSASVSENSIERPDMLSEVFFESEVPFRQIGPLRQPETDDQKIIRFSSIIEKRQPRLDPAIAEQIAKCTLISSKRYGFPPEMILALMKRESTFIPTLVSSAGCKGLMQVYPEKHLKKLKRRGISKDSPKIFYIAPNIDIGCEILREYYDSAKGNVKQALKQYVGGKHDTYLTDVTVEFTSLMLEKFDSSNM
Physico‐chemical
properties
protein length:223 AA
molecular weight:25382,3 Da
isoelectric point:9,03
hydropathy:-0,12
Representative Protein Details
Accession
2UWwJ
Protein name
2UWwJ
Sequence length
226 AA
Molecular weight
26627,87140 Da
Isoelectric point
9,42878
Sequence
MREGRNVDFETVGLVMIFVIVFVLLHHIFIITNKIEVYRTTNEIKANNIEQKVEELSNKIDKISSSIEDVKKQVNYLIKQENKTKKLSLFIKEVNPYLPNHLVKVISKTIIQSSNKYSVPVEVILAVAWQESHFKVNVVSSAGCIGIMQINPEVWTKKLNIPEEVLWYPQINIEVGTYILRYYYEKEGSWEKAIERYYGKDWFGKKYRKMVLRKIRKVRDIIKDIS
Other Proteins in cluster: phalp2_11964
Total (incl. this protein): 5 Avg length: 222,2 Avg pI: 9,36

Protein ID Length (AA) pI
2UWwJ 226 9,42878
2UTlZ 226 9,21655
2UrWQ 229 9,56493
2V4UX 207 9,53244
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38948
3NAZh
14 30,4% 151 1.087E-31
2 phalp2_23989
3MMD6
17 32,6% 156 1.477E-26
3 phalp2_8985
3QkV7
9 30,5% 193 2.871E-24
4 phalp2_9885
2tkXx
10 31,2% 160 1.635E-20
5 phalp2_38521
1AQSM
1 24,8% 181 1.625E-16
6 phalp2_30261
4i1tO
1 26,8% 149 7.476E-16
7 phalp2_19205
258Vx
30 28,4% 144 3.905E-14
8 phalp2_27300
4hoPU
11 30,5% 154 4.939E-12
9 phalp2_28534
40dtQ
23 26,3% 167 1.818E-10
10 phalp2_7492
7FU2r
1 25,5% 188 7.758E-06

Domains

Domains
Representative sequence (used for alignment): 2UWwJ (226 AA)
Member sequence: 2cu97 (223 AA)
1 226 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2cu97
Method AlphaFoldv2
Resolution 82.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2UWwJ) rather than this protein.
PDB ID
2UWwJ
Method AlphaFoldv2
Resolution 94.16
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50