Protein

Protein accession
5i3si [EnVhog]
Representative
7ul2t
Source
EnVhog (cluster: phalp2_3977)
Protein name
5i3si
Lysin probability
90%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGGIPNGRLDESTMVNLGSGWDKYGQWFHLAPPGTAARWQELVRLAHEKYGVRLRVTPGWNIYRPLRIQYLYRADLGIWAAVPGTSSHGGEYQGRECCAIDVQNWGDLGWARFVALCRIVGLTVNFVKPQELWHVGDFNPWTVPQFAAINIKPLPIIQPEPEEDEMGFYYGPTDGSKPFMFFNQARGKSRSISSAEWSMLRAFQRGAAPVLPMDVHMVSTYWYDEAVRLGTY
Physico‐chemical
properties
protein length:232 AA
molecular weight:26330,7 Da
isoelectric point:6,96
hydropathy:-0,28
Representative Protein Details
Accession
7ul2t
Protein name
7ul2t
Sequence length
230 AA
Molecular weight
25964,99570 Da
Isoelectric point
9,49602
Sequence
MAYRNGEVPLHKLIHVQNNIWLPAGTLARWNWAVQQGVKKYGVRLRITGQNSTGKYTWNGYRPLSAQKLYRNAFGQLAAVPGWSSHGGWYHNQEVFAIDVDNWAEMGWARFAALMRLAGLRVDFVSPREQWHVGDFNNPWVIPAFASSGSGSHLNPKPAPKPAPKPDESEEDDMEPSYVYNRAPYDRVYVIHPITGKKRPLSKAEWEAAKLNGAKAWDATKAQVDEIKNA
Other Proteins in cluster: phalp2_3977
Total (incl. this protein): 4 Avg length: 229,8 Avg pI: 7,77

Protein ID Length (AA) pI
7ul2t 230 9,49602
5i3pp 227 7,72558
5i3sf 230 6,90380
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3758
5Emni
17 43,5% 218 7.673E-65
2 phalp2_39492
5i0H3
1 29,7% 195 2.115E-21
3 phalp2_22603
1KKhU
23 33,8% 192 2.878E-21
4 phalp2_32540
1rt6v
7 30,6% 222 1.824E-18
5 phalp2_26065
7gnet
1 32,1% 230 4.262E-14
6 phalp2_16194
4ErvX
37 30,1% 169 7.827E-07
7 phalp2_6293
6RzlU
1 25,3% 260 1.407E-06

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 7ul2t (230 AA)
Member sequence: 5i3si (232 AA)
1 230 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5i3si
Method AlphaFoldv2
Resolution 74.06
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7ul2t) rather than this protein.
PDB ID
7ul2t
Method AlphaFoldv2
Resolution 77.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50