Protein

Protein accession
8i5VK [EnVhog]
Representative
4f3sa
Source
EnVhog (cluster: phalp2_40429)
Protein name
8i5VK
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MATSYNGYEVISNYGDPRLVLLTANGKKFAPEPGILGGPVGVVMNFLADFLDREVEPMDLTQQLDDWAFEPKTVTGGSGWSCHASATAFDYNALNHPQGASNTWTSVQLSKVDAFLANVLEGCVRWGEHYTYPTKRDGMHYEVMAPINVMIRIADKVSGIKVITPIPVLSPAPTPTPVKDELNMIQVYFVNVAGTIYEANLLSGTKRGLHTTKELEDRRYILSKAGIQTGDWNKLVPVDDPDAFGTTIT
Physico‐chemical
properties
protein length:249 AA
molecular weight:27281,7 Da
isoelectric point:5,09
hydropathy:-0,16
Representative Protein Details
Accession
4f3sa
Protein name
4f3sa
Sequence length
241 AA
Molecular weight
26151,16130 Da
Isoelectric point
6,58067
Sequence
VPVSQNGYTANDTSALDTYTVPGSRVRLRLRKGPPATVLLYLASRFDDEVEDIDTAGTYIQDAAPSIPGGEPSTLADDWSYAPRPIRGSTTTLSNHASGTAIDLNATQHPRGSARTFTAEQTRRVRAILASLRDPLTGRPVVRWGQDYVSAPTDGMHFEVDADPAAVARVAHHITAAAAPTPLPEEAPMIVIRREKGWWYLALGGKLVGLYAGQKIDKTIPRITVDEKQWTRLAQAVEVVM
Other Proteins in cluster: phalp2_40429
Total (incl. this protein): 2 Avg length: 245,0 Avg pI: 5,84

Protein ID Length (AA) pI
4f3sa 241 6,58067
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36143
6QfrR
13 46,6% 193 1.777E-36
2 phalp2_5098
1IyeS
221 44,0% 209 1.021E-34
3 phalp2_28421
1FBxb
7 46,1% 182 1.021E-34
4 phalp2_38848
1Icxu
17 46,5% 176 1.904E-34
5 phalp2_33832
1p4wJ
21 44,9% 189 9.033E-34
6 phalp2_19831
5lDZw
29 41,8% 196 2.682E-27
7 phalp2_34617
5knDd
1 37,9% 211 1.135E-21
8 phalp2_39385
4JjMD
8 37,3% 174 9.602E-19
9 phalp2_19694
4HC63
25 37,3% 166 2.400E-18
10 phalp2_11272
6Q92e
9 36,9% 157 8.129E-18

Domains

Domains
PET_M15
Unannotated
Representative sequence (used for alignment): 4f3sa (241 AA)
Member sequence: 8i5VK (249 AA)
1 241 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
8i5VK
Method AlphaFoldv2
Resolution 74.51
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4f3sa) rather than this protein.
PDB ID
4f3sa
Method AlphaFoldv2
Resolution 79.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50