Protein

Protein accession
4Mdza [EnVhog]
Representative
3gQoC
Source
EnVhog (cluster: phalp2_17197)
Protein name
4Mdza
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKADEFARIAESFLNVPTCYLLGFWGQYLSLAEYNRVNSLRGIAGKNDKFNNKRFCGTNVFAFDCINFIKGILGGTTTKKRVPYENIKKCPIGDCTNQEFLDMMKKSGVKPKDATRGMGLATSGHAAIALGGGKWIDANFTGSQNGVAIHTTGIDQFTAAGYIDGIDYTEASDIQIGDIMEMEVYEIRDGFAYGKVPISVPVPKQIEPGSKVTINKGAKAGGTNPDYRGKYINEAYTNGKFVDTVDRIETHYGNKEALLKALITWVALESLTLVE
Physico‐chemical
properties
protein length:275 AA
molecular weight:30072,9 Da
isoelectric point:7,57
hydropathy:-0,24
Representative Protein Details
Accession
3gQoC
Protein name
3gQoC
Sequence length
276 AA
Molecular weight
30034,57420 Da
Isoelectric point
6,88715
Sequence
MNNLEFAQNAESFASVLTCYLYGFFGQPLTQEEYNRVNALPAIHGSNNKFGNQKYIGTGVFAFDCICYVKALLSGATTQRRVDYNAIKNCPIHDCTNQEFLEMMQKDNINPKNATRGMGLASSSHAAVALGNGRWADANFTSGQNGVKIHDSGIEQFTCAGRIYGIDYLDDVKVGDIIPMKVTRIEVGEMGTVAYGQAEINPVYDTIKVGSKVTIAKGAVSGGGNPEYANKPILSKYANGKYVDTVAELGTFNGEQQARLKNINTWVAYKYLTVVG
Other Proteins in cluster: phalp2_17197
Total (incl. this protein): 7 Avg length: 277,3 Avg pI: 6,78

Protein ID Length (AA) pI
3gQoC 276 6,88715
3gK9r 276 6,30898
4OdmS 276 7,00128
4kcFg 276 6,00080
7DP9J 273 5,59366
7DR0a 289 8,11704
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15715
3fRXN
1 36,6% 169 1.216E-41
2 phalp2_30278
4kbSa
2 26,8% 276 3.483E-16
3 phalp2_29540
3v8t1
106 20,9% 244 2.745E-11
4 phalp2_16170
4yiWs
67 22,9% 187 2.862E-10
5 phalp2_3306
2Vjdq
8 23,6% 186 2.952E-09
6 phalp2_13292
3o9d4
478 23,3% 193 5.281E-09
7 phalp2_16010
3o4Sp
262 21,5% 260 5.372E-08
8 phalp2_31586
4l17x
11 23,8% 335 5.407E-07
9 phalp2_21023
7YTt
3 21,1% 289 3.034E-06
10 phalp2_9253
6XprS
108 22,4% 174 1.694E-05

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 3gQoC (276 AA)
Member sequence: 4Mdza (275 AA)
1 276 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3gQoC) rather than this protein.
PDB ID
3gQoC
Method AlphaFoldv2
Resolution 77.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50