Protein

Protein accession
5GC0k [EnVhog]
Representative
5GC0k (this protein)
Source
EnVhog (cluster: phalp2_23361)
Protein name
5GC0k
Lysin probability
77%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MSGTQVSSAPNVLFDPMMALRQQEGQNQLLQQQQDISGHEMEMASRLANTVLDPTLYPTPEARAAAYPTLLANARQTAPGYFKNAPETYPGDDVAHALARMGTPSQTQAEWAANIAANRAISAAGNTTAQPAAGGGAAGGGTAAPAAPPYGGGPGNVTVPPEYMPYFQEASQRTGIPVDLLIAQARQESGFNPGATGGAGEVGIMQIHPRTAADPGFGMTGVNPAVLRDPRTNINFGADYLAARGKAAGADFSTPGGQAIALRSYNAGGDPNYVSNVFRYMPGTGGAAPGGGAATTTAAAAKPVILGDSLASAGGLGGTGVVGAGPAAVRDAVKAAAQAGTLRGQPVVLSSGASNNPGAIDAVEEQLQAAKDGGAASVKLLGVGPGVEAQAPGTNARLQALAQKYGADFVPLPPSMMTAGGVHPTPQGYAALRAITTPSPGGAGGVAARTGGTDTAGPGAGSGTAAAAPPAVAPGQAAPAPGGAASAPQPPAPPAGGPAPPQLQPLNANGLTTRQQAILDAQGATGRLTGQQRIAAEQAYVNQNIQLAQKAFSDYMQQQQLAVSQGQLSNAQAETNLKYWQAQHPELAPNAREATIAYRTLQELAPKMRPGGGATQEETDRYNNAAIAYQDFKTVTDPISKAIVQVPSRPLPEGFPQPPGAAGSGGARPLTQGLSPAQQQVERDPAAYKVAEKQYERDAEDSKDLSTAVRKSQADQVRIKEMQDVLQRFSTGPGTEGRTAASAFLQRWLPSALTGWEKESANLSGSDAAQAFAKLALVGAGTQEQSVLGARGGYQAIKLFKDANPGVNLQDATNKSILDMQLISNQANADYAQAALSHFADNEQRFSQTHQYRSLAQFDRDWNTQRNPQVYAAAMGAISGQKPEQWTKGLSDSEYDRALQLVSRAKPDAVVNTKSGRFSMQPTAVTGGTPTKPGDTVIRYDATGNRVQ
Physico‐chemical
properties
protein length:948 AA
molecular weight:97248,8 Da
isoelectric point:6,46
hydropathy:-0,40
Other Proteins in cluster: phalp2_23361
Total (incl. this protein): 14 Avg length: 850,4 Avg pI: 6,04

Protein ID Length (AA) pI
2naUM 789 5,80561
2narV 806 6,30796
4N6m2 996 6,47774
5FqeP 828 6,00711
6D2An 938 6,27039
6DNzB 828 6,50348
6EmAB 882 6,40060
6IM8I 606 5,49056
6IPT1 816 5,73320
6JdfZ 807 5,42053
6TRM6 868 6,25237
6XPQT 872 5,48555
nMcz 922 5,92446
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26308
16Ndq
5 31,4% 709 1.979E-71
2 phalp2_29021
6F9Mp
4 31,9% 768 8.792E-68
3 phalp2_21887
4ER1o
8 24,3% 951 3.940E-44

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5GC0k
Method AlphaFoldv2
Resolution 72.42
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50