Protein

Protein accession
C998 [EnVhog]
Representative
6U1kO
Source
EnVhog (cluster: phalp2_7734)
Protein name
C998
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGKVLLVLLAMLIVTLMLFTTRGEAVIVTPVLERIAQCETHRNDQHSTRDYISRYGIFRPSWNEYRPEWVHAVPSHKKAGERIPTRKEQDAVANHIARTAGLTAWGCYRQYSWVRNG
Physico‐chemical
properties
protein length:117 AA
molecular weight:13506,4 Da
isoelectric point:9,75
hydropathy:-0,32
Representative Protein Details
Accession
6U1kO
Protein name
6U1kO
Sequence length
117 AA
Molecular weight
13628,31260 Da
Isoelectric point
9,86690
Sequence
LKKALVIITVLATLVVTPAGNTGNNWQWQHKKWLPPVWQRIAQCETGTNWKHNSGTYQGAFGFYHGSWDAFNKFGYPREAYNATPWQQYKVALAIYKRYGFTGWGCYMNRSWVRNGY
Other Proteins in cluster: phalp2_7734
Total (incl. this protein): 4 Avg length: 121,5 Avg pI: 8,53

Protein ID Length (AA) pI
6U1kO 117 9,86690
2QkEw 140 9,60303
5xZ16 112 4,89562
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32050
5GXAc
4 56,8% 88 3.644E-29
2 phalp2_35710
3QBh9
6 54,7% 84 9.849E-26
3 phalp2_18471
6HX4b
2 56,7% 74 4.370E-24
4 phalp2_33110
4KaxU
1 42,3% 92 2.308E-17
5 phalp2_26654
2j3QA
256 34,4% 93 8.650E-11
6 phalp2_13763
6RU1K
25 37,9% 108 1.185E-10
7 phalp2_40292
36Ynu
19 30,8% 120 3.769E-09
8 phalp2_34704
2W6sQ
31 31,0% 116 9.675E-09
9 phalp2_9748
88qrj
107 40,0% 95 2.232E-07
10 phalp2_22809
4CLdy
17 34,6% 78 3.053E-07

Domains

Domains
Disordered region
Transgly
Representative sequence (used for alignment): 6U1kO (117 AA)
Member sequence: C998 (117 AA)
1 117 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF06737

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6U1kO) rather than this protein.
PDB ID
6U1kO
Method AlphaFoldv2
Resolution 88.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50