Protein
- Protein accession
- 4JZTh [EnVhog]
- Representative
- 1gCiB
- Source
- EnVhog (cluster: phalp2_25082)
- Protein name
- 4JZTh
- Lysin probability
- 87%
- PhaLP type
-
endolysin
Probability: 95% (predicted by ML model) - Protein sequence
-
MVKLIYWASCVMMIVISVILIVSQQPKVMAVENNPASIEAPTPDPITVRITEQNEPVNVYSEEIPLTPGWQAYTQDTCRRYGIDYALMLGLMETESSFRFDADSGWAYGICQIGYINEDVLATEGIDIYSRIGNIEAGCYILAGYLERYTESQALMAYNMGEYGASELWEQGIYESEYSRSVQEAAQKWRMIINGTIN
- Physico‐chemical
properties -
protein length: 198 AA molecular weight: 22352,1 Da isoelectric point: 4,22 hydropathy: -0,07
Representative Protein Details
- Accession
- 1gCiB
- Protein name
- 1gCiB
- Sequence length
- 188 AA
- Molecular weight
- 21496,37100 Da
- Isoelectric point
- 4,69828
- Sequence
-
MERKDVAIFAIALVLLVVLLVVKPPADEPEATPEEVWHEVAQVTIPDPPKMYAAVPISHETQDILKAACEEWGVSYELALAVCFRETGYKDLETEYDGKHYYGMMAVQLESATQYMEQCGVSDLSGMSERLRVGCCILGDYLSRYDVHLALMCYNLGENRAIGWWRHGIYETEYSRDIVEHWELLKKE
Other Proteins in cluster: phalp2_25082
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_600
139Z6
|
763 | 36,7% | 158 | 7.174E-38 |
| 2 |
phalp2_12998
3nWhM
|
100 | 33,7% | 145 | 2.520E-37 |
| 3 |
phalp2_29541
3wlSl
|
72 | 31,7% | 164 | 3.833E-35 |
| 4 |
phalp2_664
1m3Kq
|
134 | 34,0% | 138 | 3.449E-34 |
| 5 |
phalp2_980
2m07J
|
181 | 36,1% | 191 | 2.037E-32 |
| 6 |
phalp2_26221
naqH
|
11 | 34,7% | 138 | 5.219E-32 |
| 7 |
phalp2_36276
81fJ
|
27 | 29,8% | 134 | 9.770E-32 |
| 8 |
phalp2_17432
7DOMS
|
1 | 27,0% | 155 | 4.121E-27 |
| 9 |
phalp2_23955
23NgV
|
32 | 31,4% | 159 | 1.145E-24 |
| 10 |
phalp2_28346
1dnWl
|
3 | 29,6% | 182 | 2.139E-24 |
Domains
Domains
1
188 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(1gCiB)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50