Protein

Protein accession
7XENF [EnVhog]
Representative
4GGll
Source
EnVhog (cluster: phalp2_35838)
Protein name
7XENF
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFLLPLTILPRVSSSFGPRVDPYFGKSHTHNGIDLSVPEGTPVLAAAAGTVSKEWTATADGRAQPNGNALKIDHGNGYATAYLHLSRKAVGMGARVSAGQVIGYVGSTGASTGPHLHFMVYQNGTPVDPKPFIVWSVGSAVTQTGAAVVKGAATTWWVWGGVAGIALLLILRARSRSTGASPATASPSPAA
Physico‐chemical
properties
protein length:191 AA
molecular weight:19559,0 Da
isoelectric point:10,06
hydropathy:0,14
Representative Protein Details
Accession
4GGll
Protein name
4GGll
Sequence length
186 AA
Molecular weight
19332,84260 Da
Isoelectric point
9,86639
Sequence
VAEAFPVALASPLPLGSYKVTSPFGPRINPVTKEPQLHNGIDLGAPNGTPIYAAAAGKVTTASVGPVTGNWVKIDHGSGIATAYLHMSAIVARPGITVNAGDLIGYVGSTGRSTGNHLHFIVYVNGKEVDPAPYVRWDVLASALVGAGYGAARNLAYSWALTTAVGLLVLGGYGLWRWRSTRAGRR
Other Proteins in cluster: phalp2_35838
Total (incl. this protein): 4 Avg length: 192,0 Avg pI: 9,35

Protein ID Length (AA) pI
4GGll 186 9,86639
45icx 194 8,80852
6jOq2 197 8,66689
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14848
7w0BH
74 41,6% 137 1.697E-31
2 phalp2_15404
1Utrm
46 40,9% 127 1.367E-29
3 phalp2_19030
17y4D
9 43,9% 132 1.675E-28
4 phalp2_36864
7kPyj
9 39,5% 134 5.863E-28
5 phalp2_27075
2pChT
5 44,7% 123 8.019E-28
6 phalp2_31190
87Snf
9 45,3% 130 5.246E-27
7 phalp2_3247
2k2so
54 44,0% 118 2.508E-26
8 phalp2_9303
7pYyV
24 36,6% 142 8.762E-26
9 phalp2_35850
4Jl8n
12 46,3% 136 3.060E-25
10 phalp2_18623
3ebV
83 41,6% 137 5.718E-25

Domains

Domains
PET_M23
Disordered region
Representative sequence (used for alignment): 4GGll (186 AA)
Member sequence: 7XENF (191 AA)
1 186 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4GGll) rather than this protein.
PDB ID
4GGll
Method AlphaFoldv2
Resolution 81.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50