Protein

Protein accession
4E7Ez [EnVhog]
Representative
4E7Ez (this protein)
Source
EnVhog (cluster: phalp2_19681)
Protein name
4E7Ez
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MPTETEELKLRVTLDDHASAQLQAVRQHMTQMGSGPTGQALQNITSKASEAERAAGKLGREFLGIGRSAMDFAKVIGPWPVALATIGFEFVRQAAAMKEWSGQMVQINNAARTAGMNIGTFRNITSQLRESGVSADQAAGMMQNLTRSLSEAIRPGSQVRENLIQMAGINATGMLNQLDKLEQMTDATDRLNEIRNMGLNVYAHEFEAHHDKMRATAMQLMFLQQFGQEQLINRKTAFLQQDKQTKANMEWESGKQDKLNTALEKEADTRQRIHDILMSSMADYETAGAKAMERIEAGFRKALERKEAGGPLLQGTAQTIIGGALRAAPSGAGAVGSYLGTELGKGARRLLGFQHGGIITRPTMGMVGEAGPEAVMPLGQMGGGASTQRHTDTVDENTKQLRMVNDQLEELLDPSISKYAQGMGALRGLGTATGGGGGGGGGGGGGGLGGGTGRTGMTGTPGTPGGPAIDMRSAMTGVNPMWMDPTRGAAGNRPGGFLGPTVAGGTELHAGMKAMAWPSQGGGGLGGGVQGGGALTGGAGAGAPVDPQALYRSAVERFRNSKLNGYIPQDGAKFGITTGAPEEWARMSVALAMQESGMRAHPPPEPGHTGTAGLYQFETGDLRNYGVKGDVTDPNAQLEAMARVMEKFVPSSGHLAGKAGAGAYFGPFRTQVDVIKHLAEAGKVAQGAGAVPTGPGGGVHAGLGPYGPPSGTGTGASTGGGTDLPSGDLGVARRMAPSGQDPLAFVVHHTGPAGSARGIVEDWRKNRPGIGAQYIVDRDGKIHDVKAEFGYGGTGHVLPSATPAEFLKKGYVNKNMIGVEVMAKDDKDVTPAQKQAVENLYSQYYAGTPVYGHGELNPGHREATEGATITKEIRAAQAQGRMLTRSTPNTIPAAQVAGPTSAAVPTEPPSPFSGPAEPAGKTLDRMMGKEVQHDVDTSGTITIKHDRRPPKFSMRKPKFRDVPMNRQGTMTPAASGVPENNAGDTVAI
Physico‐chemical
properties
protein length:988 AA
molecular weight:103002,0 Da
isoelectric point:8,73
hydropathy:-0,43
Other Proteins in cluster: phalp2_19681
Total (incl. this protein): 6 Avg length: 991,2 Avg pI: 7,36

Protein ID Length (AA) pI
4Efkd 942 6,84213
5nM8t 1144 6,34576
6XAFk 1025 6,90175
6XB6L 998 6,76835
6XzAw 850 8,56103
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20592
4EqYd
1 30,1% 641 1.728E-69
2 phalp2_34638
5sZ8Y
3 26,7% 982 2.190E-55
3 phalp2_28717
4ENu0
8 28,2% 767 1.115E-48
4 phalp2_4964
6I865
4 22,9% 889 2.069E-31
5 phalp2_31646
4EQbE
2 22,0% 663 1.100E-18

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4E7Ez
Method AlphaFoldv2
Resolution 57.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50