Protein

Protein accession
4bjU1 [EnVhog]
Representative
2jgvd
Source
EnVhog (cluster: phalp2_2020)
Protein name
4bjU1
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MAYKEGIGENGKLSSSVLKSVGGNHALFKTAADSYLRMKAAAEDSGVTFNLVSSYRKCGRKGDYTERNCKGGATQWCLWEKYKAGKNAVAADPSTSKGCKSNHGYGLAIDLYPKSAQDWVKKNGEKYGWMWDEGKSIGEDWHFRYSPSNDTFNKSGLTATQYRNIAIGLGLTAFALAFWYYTRPESK
Physico‐chemical
properties
protein length:187 AA
molecular weight:20707,0 Da
isoelectric point:9,30
hydropathy:-0,67
Representative Protein Details
Accession
2jgvd
Protein name
2jgvd
Sequence length
240 AA
Molecular weight
27493,79150 Da
Isoelectric point
9,08761
Sequence
MAVLAQTKKIKDITLVNGQLPNEILISIGGNEKLFQPAAESFFKMMEAAKKSNLKYYIEDTYRLCGEPGDGEKYLKGETDFTQWAAWDLSQLKKNNPKDPRYSKYNLAADPTEGCKSKHGYGLAIDIYNNPKVITPNFYKGIYTTVDKKKIYSINPKQDELQTWIRNNGGIYGWVWTGVNFPTIEPWHFDYFYEKDQTKSSYSAPLIAKTPTQINKSTTQDQNKYAVNKNVTKNINSFFS
Other Proteins in cluster: phalp2_2020
Total (incl. this protein): 6 Avg length: 236,2 Avg pI: 8,55

Protein ID Length (AA) pI
2jgvd 240 9,08761
2jf9M 285 6,86072
7TvRt 194 9,31402
87soV 279 7,67067
Jz3z 232 9,07929
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36221
7r53d
5 26,8% 186 1.298E-13
2 phalp2_29744
1svLh
1 26,8% 194 5.211E-11
3 phalp2_33152
7COCJ
1 23,6% 186 3.109E-10
4 phalp2_26267
LAL7
1473 26,7% 161 3.331E-09
5 phalp2_29962
8rfJR
9 25,5% 180 1.086E-08
6 phalp2_37797
4C1UL
98 24,4% 213 3.527E-08
7 phalp2_36869
7lOzC
4 22,2% 247 2.053E-07
8 phalp2_37143
1r1gj
3 23,1% 177 6.619E-07
9 phalp2_9358
bQoR
17 22,8% 188 6.619E-07
10 phalp2_35030
7HQbI
2 22,5% 186 2.127E-06

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 2jgvd (240 AA)
Member sequence: 4bjU1 (187 AA)
1 240 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4bjU1
Method AlphaFoldv2
Resolution 87.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2jgvd) rather than this protein.
PDB ID
2jgvd
Method AlphaFoldv2
Resolution 73.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50