Protein

Protein accession
4oX8J [EnVhog]
Representative
5cQF6
Source
EnVhog (cluster: phalp2_30496)
Protein name
4oX8J
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFSVTLTKKAKLLVGACCRFLLGGIMKKVGVFAGKVVIGMAICVLVTTGLKIALNFTIAKASQAKEKVLDALPKREVAQSPLSLEQSIAKASRVNGVDPLILHVISEKESSNGNQRALYRFEPHLFSRLRGEKSYRSLSDSEIRMLASSHGAFHILGLTAERECGLHFSKLYDTEKSAMCAARIVKRIDESVKAKATEHRLREIFRQYNGQGPAAENYAKDAMVRLAAILYQRTNG
Physico‐chemical
properties
protein length:236 AA
molecular weight:25983,1 Da
isoelectric point:9,83
hydropathy:-0,06
Representative Protein Details
Accession
5cQF6
Protein name
5cQF6
Sequence length
239 AA
Molecular weight
27345,26160 Da
Isoelectric point
8,96306
Sequence
MEKHVMEATEKNKSQRHARKFKLIIFGLKLTFFLCICLCVSRLVDLAFEVASVHAMRARDALLEKVTVVKTITEYREVDDATLGDIIVKTAKEFDVDPLILMVLAEKESRGGDQNSLYAFEAKKFEELRNNKKYRTTSTNELRMIASSHGVFHVMGYSAKDYCNLHWSRLYDVWTAARCSALIVQQKSKEIDGIKDPTVRIREVFRRYNGGGEEADKYADDAMSRLAGILYERVSKAKS
Other Proteins in cluster: phalp2_30496
Total (incl. this protein): 11 Avg length: 225,3 Avg pI: 9,56

Protein ID Length (AA) pI
5cQF6 239 8,96306
2iAd1 211 9,48609
51tdD 234 10,00448
56eHu 236 9,71585
5EkHz 239 9,21100
6KULm 236 9,63179
6XyL 214 9,58447
80MUO 211 9,51252
80MkY 211 9,57589
88PT4 211 9,62347
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9502
Zsqg
18 29,2% 181 1.073E-19
2 phalp2_6731
24Qeo
1 28,8% 149 5.124E-11
3 phalp2_19668
4A3S3
3 24,5% 163 8.759E-07

Domains

Domains
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 5cQF6 (239 AA)
Member sequence: 4oX8J (236 AA)
1 239 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5cQF6) rather than this protein.
PDB ID
5cQF6
Method AlphaFoldv2
Resolution 87.26
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50