Protein
- Protein accession
- Zsqg [EnVhog]
- Representative
- Zsqg (this protein)
- Source
- EnVhog (cluster: phalp2_9502)
- Protein name
- Zsqg
- Lysin probability
- 97%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MAYLGAGPLWQITTAQASSVYEQAIGKLTRVEVIKEYVQPVELSTDEMLERISISMSINPTITKAIARQESGLAYRADALRFEPHLEARFAKMARGPEERRMLATSIGVMQVIPGFHLKTCNLTSYAQLFDKRINITCGLQVLKACLEHNKALPTAQRFRRAFVCYNGSEEYADDIFNHIGQIVLEEGIQ
- Physico‐chemical
properties -
protein length: 190 AA molecular weight: 21353,4 Da isoelectric point: 6,90 hydropathy: -0,11
Other Proteins in cluster: phalp2_9502
| Total (incl. this protein): 18 | Avg length: 204,4 | Avg pI: 8,53 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 1MeN2 | 192 | 8,74470 |
| 1Monv | 207 | 8,25751 |
| 1MtBK | 206 | 8,48161 |
| 1OYQj | 248 | 7,66539 |
| 1Qpms | 219 | 6,91352 |
| 1QuQG | 186 | 9,65635 |
| 1Ydoj | 186 | 9,54366 |
| 2WVP1 | 166 | 9,90494 |
| 4UYUJ | 212 | 8,84721 |
| 4ztG4 | 160 | 8,72111 |
| 5l24j | 255 | 8,48580 |
| 6PhsV | 195 | 9,23698 |
| 7YRUa | 211 | 8,63240 |
| 7Z3TU | 213 | 6,90749 |
| 807eV | 205 | 9,63121 |
| 8eGNt | 232 | 8,74019 |
| ECfk | 196 | 8,34945 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_20913
71e0x
|
11 | 38,2% | 128 | 3.005E-32 |
| 2 |
phalp2_6731
24Qeo
|
1 | 28,6% | 143 | 4.112E-32 |
| 3 |
phalp2_19668
4A3S3
|
3 | 26,0% | 161 | 5.398E-26 |
| 4 |
phalp2_36591
1XLWM
|
1 | 27,7% | 155 | 1.008E-25 |
| 5 |
phalp2_5467
36wMG
|
4 | 23,3% | 167 | 1.677E-24 |
| 6 |
phalp2_203
5INPy
|
3 | 30,4% | 128 | 5.844E-24 |
| 7 |
phalp2_30496
5cQF6
|
11 | 29,0% | 162 | 9.125E-20 |
| 8 |
phalp2_11114
5jcjr
|
291 | 22,2% | 166 | 3.316E-17 |
| 9 |
phalp2_20310
2k1Qv
|
26 | 23,2% | 159 | 4.520E-17 |
| 10 |
phalp2_27050
2cnxE
|
19 | 24,0% | 158 | 3.427E-15 |
Domains
Domains
1
190
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50