Protein

Protein accession
6Gq5j [EnVhog]
Representative
6G1uI
Source
EnVhog (cluster: phalp2_16457)
Protein name
6Gq5j
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MLLCVTHALFALLVVFTPPRGGEPVLAPHVRARGGEAYVTELGAWVHSAAELEHVSPSLLAFLLYLESSGDSSKVEPKTGSEGLMQLNPGSPWGMAWKHACELEPDRCEELNVVFGARALRAGFDACGDDAVQAVSFYRSGRCVPKKRKVRRRSAWVVERAAQLESGHGST
Physico‐chemical
properties
protein length:171 AA
molecular weight:18528,0 Da
isoelectric point:7,01
hydropathy:-0,09
Representative Protein Details
Accession
6G1uI
Protein name
6G1uI
Sequence length
161 AA
Molecular weight
17014,37630 Da
Isoelectric point
9,82790
Sequence
MSITAAILALSTIFVGAHGPVHSKALREHALGAVERAEQLAAYVHAAARDAGVDAALLTVLLYLESSFRAHAVHPETGAYGLGALHPRSVWARALQGGCARSPSTCEQMSVVWSARALARGIQGCGGELEGVGWYRSGRCVAGPRARFAMRLRERVWKGAA
Other Proteins in cluster: phalp2_16457
Total (incl. this protein): 7 Avg length: 173,7 Avg pI: 9,69

Protein ID Length (AA) pI
6G1uI 161 9,82790
3O0dE 174 9,42781
6EVYP 165 9,60510
6GJOx 174 9,88940
6GqJB 182 10,84405
k2zU 189 11,25684
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7453
4N7RB
7 37,1% 140 1.677E-25
2 phalp2_5676
4AQE5
7 32,2% 149 1.673E-22
3 phalp2_34826
3QB0u
3 31,1% 109 1.339E-12
4 phalp2_4231
3O2J1
27 29,7% 121 5.524E-11
5 phalp2_34906
4kB5k
11 34,1% 120 3.060E-09
6 phalp2_34514
4NmGW
2 27,9% 154 5.665E-09
7 phalp2_25410
2zxe8
226 28,8% 142 3.584E-08
8 phalp2_26388
1vB2c
5 31,4% 121 5.647E-07
9 phalp2_39377
4GGn4
3 28,1% 160 1.915E-06
10 phalp2_8654
44Rd9
6 29,4% 156 8.778E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6G1uI (161 AA)
Member sequence: 6Gq5j (171 AA)
1 161 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6G1uI) rather than this protein.
PDB ID
6G1uI
Method AlphaFoldv2
Resolution 95.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50