Protein

Protein accession
4EjW9 [EnVhog]
Representative
4EjW9 (this protein)
Source
EnVhog (cluster: phalp2_23127)
Protein name
4EjW9
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAENVLESFVIKLGYQIDQASAKRFITTITDGLKGVLEFSGAIMGMAVAAEEAVRRTAAQMARLSFAATMGGTTVNKLREVGYAVEAVGGSYQDLIGAQTSLNQKLTESPWMKGLLRSMLHGHSMDIEGIIKRYHELSSGPLGEASAAAVNFSRVMREQYGVNMMIAERTHRHWKEYASVQEHNKQVTLLFQKQMDEAEKNSVKMQVSWTKLGFTVDTMWKSVTGGLSGILSHVFDKFDEWLQTSAPEINKFMEELNPLIDEITTSVLNWVDDLIKHPKKIKDAWEDVKSTWEDLKAIFWVVHDAVCKLIAAFNWLAEKIGPVNAVLAILVTYFAAPVIGGILGFSGAIVTLTGSMGGLLLAAAPVAAVLTTIAGLMWAIHPTSTGNKAEEEAEKKLGEQARKAAGGGVTIPLGQGSNKFINLTPAQAQQLLETGTNFSDEDKAALQWRASLPPNTPAGKIPGMQKGGIVPINAHAGEMVLPQPISQGLQSLFMGGGLSDSLDDMSRWLANDSSFQPFVTFATEVYDKITDAFEEALIRVGGTAGAGGAGAGGDGGGASPDGGGATPDQGGPAGAGAISQKAAELISRAEGTFTKAGINYNAMYSGEMKGLTEMTIAKVMEYQRQHMGSHTPIGAFQMTRDTIADTIKALHLDPETTKFSPEVQRQLAGYLLQHRGIQPWTNQHPELMRQVREMGAGAFETQGTGGGGSVGKFGKGSSSALLEIMNQAFTAALPAGYSVRQTSGARPGGDPSSGHYRGKAADFEIVGPDGKAIPNEGADRSGLYRKVAIQAYLAAMQKYGPEMAKQLGWGGHFGSHIHGGGAADLMHFDWLFKGIRGRIGNMYQEYN
Physico‐chemical
properties
protein length:847 AA
molecular weight:90609,0 Da
isoelectric point:6,12
hydropathy:-0,17
Other Proteins in cluster: phalp2_23127
Total (incl. this protein): 4 Avg length: 823,0 Avg pI: 6,07

Protein ID Length (AA) pI
254qM 670 5,38478
25jUD 860 6,26664
4EZDO 915 6,48529
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11566
16h5G
7 32,6% 692 9.535E-108
2 phalp2_29071
6RWnL
4 29,1% 903 3.700E-81
3 phalp2_12429
jGso
2 27,7% 869 3.479E-78
4 phalp2_20129
25lh2
4 24,8% 776 5.918E-39
5 phalp2_37815
4ERO0
2 26,7% 624 2.974E-37
6 phalp2_35231
6FTTB
1 23,1% 677 5.201E-37
7 phalp2_9175
6D7BW
1 24,3% 645 7.186E-30
8 phalp2_32146
6Ia6o
1 21,2% 938 2.017E-28
9 phalp2_5020
1jlsB
2 22,3% 569 1.479E-21

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4EjW9
Method AlphaFoldv2
Resolution 55.49
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50