Protein
- Protein accession
- 8lxJj [EnVhog]
- Representative
- 3gPQA
- Source
- EnVhog (cluster: phalp2_2120)
- Protein name
- 8lxJj
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 98% (predicted by ML model) - Protein sequence
-
MSEFHIIDVSKWQGDIDWEKVKASGIDGAMLRAGYGAGNIDPKFVRNAKECTRLGIPFGVYWFSYAWTPMQAEDEAKYCMGAIAPYRLTLPVAFDWEYDSYNRAVRAGVTPSRALAVSMAKRFLSVVEQAGYVPMLYTNLDYQNQFFPPEELAGYDRWIAAYRATRPDTPLAMWQYTSRGRVDGIDGLVDCNRLYIDYPAIAEEREKTPDYAALVCEKLGLAGETREYIDGYRYANDLWRKLWDALNNFS
- Physico‐chemical
properties -
protein length: 250 AA molecular weight: 28614,0 Da isoelectric point: 5,01 hydropathy: -0,35
Representative Protein Details
- Accession
- 3gPQA
- Protein name
- 3gPQA
- Sequence length
- 294 AA
- Molecular weight
- 33987,39430 Da
- Isoelectric point
- 4,51344
- Sequence
-
MKQAFIYWPAIAPQDEPKKEEEKEVKQETSIVEEVSTEEKNIPEAIIEEPFVDDDTRFDINLSGNIIDISKHQGTIDFKALKQYVGLVIARASCGSDKDIKIDEYAKEMIKNRIPFGVYCYSYAGTVEKAKDEAQKLVAYAEQYDPLFYVLDAEEERLTTETIKAFVKELRNLTSERIGCYVAHHRYKAYKYDTLRDLFDFTWIPCYGKNNGTLEGSKEPSYPCELWQYTSTGKIAGIKGNCDMNVIHGEKTLEWFLTNYDPNYIEDENEDENYPDIEVDESNISYEYSEDNVG
Other Proteins in cluster: phalp2_2120
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_32637
21xEV
|
86 | 47,2% | 201 | 2.787E-53 |
| 2 |
phalp2_12693
24JaA
|
16 | 45,9% | 198 | 1.178E-51 |
| 3 |
phalp2_33310
6kV0o
|
68 | 29,5% | 193 | 7.589E-41 |
| 4 |
phalp2_32227
7cHL2
|
31 | 38,0% | 205 | 1.120E-36 |
| 5 |
phalp2_24235
2VKgf
|
225 | 36,7% | 196 | 2.460E-35 |
| 6 |
phalp2_24035
8aWkT
|
56 | 28,6% | 199 | 3.350E-35 |
| 7 |
phalp2_16055
3TLHo
|
2 | 35,3% | 215 | 2.905E-34 |
| 8 |
phalp2_39103
2m4j4
|
107 | 34,0% | 197 | 2.905E-34 |
| 9 |
phalp2_11865
7owpA
|
74 | 33,7% | 228 | 2.514E-33 |
| 10 |
phalp2_23950
235xP
|
13 | 35,5% | 194 | 1.596E-32 |
Domains
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(3gPQA)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50