Protein

Protein accession
Yzq4 [EnVhog]
Representative
3KG4d
Source
EnVhog (cluster: phalp2_21391)
Protein name
Yzq4
Lysin probability
98%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MARDPQLLRKAYKKSSALTVIPKRVDQAEEKLIDEDIDPKILELLGIDDPTGLDYSDYISLLKEKMITDRMGGQVDSGSSELLTDEFKRIKGKTGKFTVKNQKIKADTFVAKKKEPSSAARVVTPIPLLPEEVEEDIKPERQSVDKTLALIASKLNIVSNNVQQTTENIQKKDAIETKQDDRERIDTEIAANQERENKAERRNLLVGAVGAIKKTTKPITDILGGFFDFFKRLGFATFIMELIKFLENPAKYVNGISEFINKQIEKLEKSVENFVIDKMIKPLNGQISGLNTAIKDFTAGINTQLEQLKNIPLIGDKIKTITAPEVPLINESIIKDKVSLGRVPLVDDNFLGGGFGGKEETSSKSFANTGTGQDGGFGLGTHGKGRPSDPGVKPQETLMGEEPSIGGGGGMGRGGLSPQQKA
Physico‐chemical
properties
protein length:422 AA
molecular weight:46154,2 Da
isoelectric point:6,74
hydropathy:-0,48
Representative Protein Details
Accession
3KG4d
Protein name
3KG4d
Sequence length
510 AA
Molecular weight
N/A Da
Isoelectric point
4,84219
Sequence
MADKNSDNLDDLLNSIRNEEEGVPEGLDDLLNDIRSEKVVEDIQENLDEKLTETSDSIMNEIDKLIAEYQNKVKSVRTSATQVKKKIEKKSAALAVIPKRVEQAEEELINEDIDPKILELLGIDDPTGLDYSDYKSLLKERMVANQMGSGLGLDQEKLKDEFKRVRGKTGKFTVKNQKIKAETFVSNSKKPSTSARVVTPIPLLPGEIDEDIQPERQSFDKTIALLAGKLNSVDSNVKQTVDNIQKKDAVEAKQDEQDRIDAERIANNERETRTERRNLLVGVVGGIKKTVKPVTDMLGGFFDFFKRLGFAVFILELLEFLQDPKKYLNGIIEFVNKQIEKLEKTIENFIIDKLVSPMNGVIDGFNQKVKEFVSFINPLLSKLKGLGIDAQLDAETMMIPNIDPDAIRKGFDLPSIPTIGDDGPKLTEEQKEIKLREQREATEAAMRGDPRMDPSAVQVEPQETMMGQPRPTSGSNNQQALLDTIAYAEGTSGPDGYNTWFGGRTXXXXN
Other Proteins in cluster: phalp2_21391
Total (incl. this protein): 5 Avg length: 529,2 Avg pI: 5,35

Protein ID Length (AA) pI
3KG4d 510 4,84219
2P1Gq 554 5,04443
4sP5t 581 5,04443
yBk6 579 5,06563
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3330
36d3y
20 63,9% 441 7.234E-188
2 phalp2_12407
8Ej3z
18 36,4% 335 5.251E-54
3 phalp2_12654
1Stlx
3 26,8% 387 3.717E-18

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3KG4d) rather than this protein.
PDB ID
3KG4d
Method AlphaFoldv2
Resolution 58.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50