Protein

Protein accession
gXVh [EnVhog]
Representative
4fOqa
Source
EnVhog (cluster: phalp2_26883)
Protein name
gXVh
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MDGLITLEQAIARYGQIVNGVWANEAKFCTLLEVPPALQPTLINSATGKPTTHIYCNKDIEVYLLCALSNVVSRGLASELKTFDGCFEPRDIRGEPGKLSTHSYALAVDFNAKENPLGGPSRMPKEL
Physico‐chemical
properties
protein length:127 AA
molecular weight:13821,7 Da
isoelectric point:5,64
hydropathy:-0,10
Representative Protein Details
Accession
4fOqa
Protein name
4fOqa
Sequence length
151 AA
Molecular weight
16765,98200 Da
Isoelectric point
8,26035
Sequence
MTRTECSNKYGPIVDGVWALEAKHCVSIQVPAPFNRFTKNSVTGRVWTSVYCNRDMAKPLLAALENLIACGRAQELITFDGCFNIRPVRGEPTLLSAHSWALAVDFNAPFNPLGGPSRMTPEFVRCFTKEGFTWGGAFAREDPQHYSFAGF
Other Proteins in cluster: phalp2_26883
Total (incl. this protein): 24 Avg length: 158,5 Avg pI: 7,23

Protein ID Length (AA) pI
4fOqa 151 8,26035
1MmmM 155 7,82042
1ojJe 192 5,65420
1opns 150 5,60702
1ovAd 193 5,85091
1qtQP 152 8,80259
3NSjW 150 6,57442
4DsSl 138 8,95268
4Ivjq 157 9,04409
4Y8de 151 5,00345
4dPOb 159 8,42848
4dPqi 131 7,70137
4fYBW 151 9,03597
5ngWn 172 8,36511
6HV6i 160 5,76480
6PXYm 163 9,06704
6VGQt 160 5,14776
6W48c 155 6,07657
6Y5ke 158 5,76537
8i3qF 154 5,88564
8kaeI 153 6,41641
8o8jc 151 8,70383
8r40 221 9,86349
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28777
4SR8r
700 44,0% 125 2.840E-51
2 phalp2_35615
1iltm
50 38,7% 142 3.561E-40
3 phalp2_2616
6M0Mp
4 35,6% 132 5.167E-34
4 phalp2_20305
2fycX
1 31,6% 155 5.295E-31
5 phalp2_29493
1NtQH
15 33,7% 145 3.272E-23
6 phalp2_39364
4EoD5
60 31,5% 111 2.398E-20
7 phalp2_19861
5zIVL
30 34,5% 110 2.158E-19
8 phalp2_19828
5klLF
236 29,8% 114 1.271E-17
9 phalp2_23138
4Fgaj
4 29,1% 137 1.558E-16
10 phalp2_33629
xXFp
1153 25,6% 160 1.698E-14

Domains

Domains
Representative sequence (used for alignment): 4fOqa (151 AA)
Member sequence: gXVh (127 AA)
1 151 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4fOqa) rather than this protein.
PDB ID
4fOqa
Method AlphaFoldv2
Resolution 97.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50