Protein

Protein accession
4yvcs [EnVhog]
Representative
4yvcs (this protein)
Source
EnVhog (cluster: phalp2_3540)
Protein name
4yvcs
Lysin probability
75%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGVVLLALAIKTQYGASGETTEASIQSSVALSVTMEAVTAVETTLPTDVTPIPTATATKAPETYYKMKLTKKDSYLLAKIAMAEAEGEPLKGKELVIMVVLNRMLDDEFPDNVHDVIYQEKQFSPIADGRFDKVEPNADCWKALKKVKSLENDFSEGALYFENCANADNWHSRNLTFLYEYGNHKFYK
Physico‐chemical
properties
protein length:188 AA
molecular weight:20958,6 Da
isoelectric point:5,04
hydropathy:-0,27
Other Proteins in cluster: phalp2_3540
Total (incl. this protein): 35 Avg length: 191,3 Avg pI: 5,18

Protein ID Length (AA) pI
13GqH 232 4,65582
13tZE 233 4,61075
1NAo7 195 5,98022
1cBSl 183 4,67020
1cMMX 190 4,97542
1dR6r 141 4,42124
1icJA 191 4,42221
1kYnY 189 4,32212
2VmRG 169 4,57164
2XLz2 208 4,91966
3NIk 199 5,80436
3dNep 203 5,55734
3pJx9 210 4,42971
3sJpN 207 4,40669
3ulAt 209 4,48451
41bNp 167 4,79752
4NGTK 224 5,08097
4kaQe 157 9,31731
5TOsS 190 4,26755
5UDuQ 208 4,54760
5i1ks 220 6,21889
6mMMT 208 4,93626
7PsqD 209 9,09670
84SpN 150 4,38083
8869M 150 4,38083
8dU1N 184 4,54771
8nzwF 184 4,54771
8pCco 208 4,51662
8qOC8 150 4,44330
8rC0A 175 5,63743
8rYmW 154 4,58323
CVsk 197 4,35696
niic 207 5,31583
ocjl 208 9,03365
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21750
3Q2vo
389 46,4% 170 1.391E-53
2 phalp2_33788
1cBeC
29 46,9% 147 4.098E-48
3 phalp2_9804
8oqJz
6 47,7% 159 5.614E-48
4 phalp2_33822
1kYXn
17 49,6% 129 1.337E-40
5 phalp2_19006
135l5
10 42,7% 124 1.088E-32
6 phalp2_1335
14NLd
89 35,7% 151 1.337E-31
7 phalp2_34576
4ZO4j
9 37,2% 118 2.753E-29
8 phalp2_40085
7WX1M
32 33,0% 121 2.465E-28
9 phalp2_34905
4k7zF
37 37,8% 119 2.465E-28
10 phalp2_32638
21P5a
8 39,1% 120 1.053E-26

Domains

Domains
Disordered region
Hydro_2
Protein sequence: 4yvcs
1 188
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4yvcs
Method AlphaFoldv2
Resolution 77.48
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50