Protein

Protein accession
4F9Za [EnVhog]
Representative
4F9Za (this protein)
Source
EnVhog (cluster: phalp2_40517)
Protein name
4F9Za
Lysin probability
96%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MVKLTYQVNAASQASFVRSLVESASSSSQAASAMYRLGMAVANMARGMAEAGDKLYWMGQRLQSSVQGIESASYAMSQLGSSVSSAHASMEALGNFTRTYGPAASQYLRSIGITATDAGQRMEQLGNHLRSMGFAPGKEGTTGYAMGKQLASFLGVSENDARAVADPEFARRRAEQRAVAARAGLNVDVWDKSAQRLMTVFRQFDNLLDTIWKKFGQDVMPGLTDRFERLYKILLNYLPQITDFFNKMGKAIVWVVDLFLVFIDTVQKLDPALLKMIEQIGMVVAAVRILSSGLLASPLFWFITALTTLLLLMDDYNHWKNDQKNGTNTSAFDWSGFDSMKKGIDKWGEDVLGVKCLFDKLVIAMMALFAAPMLAGLGSSAAAVAGIGKAALSALPFLAKLSGWLSLLLLAGDTPGSGTKKELSPEQQKAVEEINKRHGYTGDFLHDMRKWLFDMLPDQTGAAGRALNPDTPGGGAGGAGGSSAPPGAVSTNTASGAFMQAGSGSKPSGQELLDYLMSPQGGSWTNVQAAGIIGNLMQESGLNPNAVGDGGAAYGMAQWHPDRQARFRERYGKPIQQASWREQIDFLNWELTGTESRAGAALRGAGSIETATMAALGMERPKDWNTSPAELNRRLPYARAALLPGSVAGQPMVAGGGGGAVGLPGGWQPGAPMGGAGGVVINQTTEINVAAGPTATDTAQRVAGAQTRVNESLVRNTRSALR
Physico‐chemical
properties
protein length:722 AA
molecular weight:76623,4 Da
isoelectric point:9,05
hydropathy:-0,12
Other Proteins in cluster: phalp2_40517
Total (incl. this protein): 7 Avg length: 708,6 Avg pI: 7,49

Protein ID Length (AA) pI
16jwa 583 9,06814
1Y2sx 695 5,78618
4E8wb 761 7,24120
4EOBu 734 6,89152
6XIY5 731 5,49732
QIN2 734 8,92966
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37802
4E7k5
12 30,2% 793 5.128E-110
2 phalp2_25971
6DOjR
4 30,1% 898 3.803E-108
3 phalp2_5038
1ogSO
1 28,0% 771 6.875E-74
4 phalp2_39305
4jkcC
213 28,9% 753 5.577E-70
5 phalp2_32382
wgVF
186 23,5% 725 5.821E-48
6 phalp2_38983
872UN
24 25,1% 624 1.623E-37
7 phalp2_30674
6PrQi
4 24,2% 730 4.644E-35
8 phalp2_40424
4eJZ6
2 24,0% 625 1.436E-34
9 phalp2_300
6LdWe
1 24,9% 638 4.434E-34
10 phalp2_1396
1lObC
1 22,6% 765 6.627E-31

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4F9Za
Method AlphaFoldv2
Resolution 51.31
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50