Protein
- Protein accession
- M4W627 [UniProt]
- Representative
- 5hxHI
- Source
- UniProt (cluster: phalp2_30503)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MAFIQKQGYWISENGWRMCDTAELDYSAIPGTDFKLGVRKGAPSVILKALIWRLHRIEPMITTQIGCYTSSNSMANSNHNSATAIDYNWNLHPYQKWGTWGSKRPAVDQVIKDFRGIVEFGGDWTSPRDEMHFELHFAEGHAGTEQLATDLRNGLWGIYADGTAPPPVVIPEGYLQLGSEGDQVRKLQAGMNKVFRNYKAMPLDEDGIFGAFTESAVKEFQRRSLIGVDGIVGPETKQRLATYGIILDGATEPTSPPPPVVPSFVYPSTDEMVKQVWEQHFGPKAEGWPQLGQTPEGKNRYLVDGVASLIAKEEA
- Physico‐chemical
properties -
protein length: 315 AA molecular weight: 35005,0 Da isoelectric point: 5,44 hydropathy: -0,42
Representative Protein Details
- Accession
- 5hxHI
- Protein name
- 5hxHI
- Sequence length
- 559 AA
- Molecular weight
- 61349,32280 Da
- Isoelectric point
- 5,65351
- Sequence
-
MSFRVVYGQSRSENGWRMCNADETDSLPIPGSSVRIPLRSGVPSTILRAFASRFNQLIEPLDQTQCGGWTPTNSVATSNHLAGSAVDLNWRLHPFRVKGTFGDKLPVLRNILREFEGTVFYGGDWANPIDEMHFQMNYDCFEGSKKAEDFAQRLLNGHLGIYASANPNDFPLPLGYYYGPLSGPNESVSGEWAGERPEWRAGLKRWQKALGLKQTGKWNDGVTAKAATTLQLAKRWPPHPDFGYGGVYLGEWDAVIKNGWRLPDGWKPDAVPATRVKWADVSQYQTATLDATYPYPVVAFRASVADRRDTKFLANMTAARKLVAAGKLQKVIAYHFWVPGYDNVGTFMSALESSGGIFPELAFMIDVEDGGTKWNITGDQSDGVNDFIDRVSERFGNPAAAVGYLNFRSNQDLWKTIPHGLKLIVPAYSGPESKPWVPEGYTAFGHQYADNEDTPPFGPCDINQAHMTLADFISAFGVQAVPDLVPDSGTTPVPPPTGAGSLTLTGRPLDPRIPDDLAGHVLSMRMEGLRTQALVAALCEANNIDVSDVFNKVKDAVTP
Other Proteins in cluster: phalp2_30503
| Total (incl. this protein): 19 | Avg length: 421,4 | Avg pI: 5,43 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 5hxHI | 559 | 5,65351 |
| 1IfsK | 503 | 4,39169 |
| 1Ifvi | 524 | 4,46461 |
| 416Bq | 508 | 5,38648 |
| 5svdG | 466 | 4,78007 |
| 7AJDF | 530 | 4,91966 |
| 7BtFW | 530 | 5,10706 |
| A0A249XT09 | 530 | 4,91966 |
| B3VM63 | 318 | 5,42974 |
| B5U509 | 321 | 5,75361 |
| B5LJL3 | 530 | 5,10706 |
| A0A2P1K0B0 | 321 | 6,10715 |
| S5YB17 | 315 | 5,56484 |
| A0A3S9U8E5 | 253 | 7,74451 |
| A0A514CWK8 | 321 | 6,01160 |
| A0A7D5JSM2 | 321 | 6,10715 |
| A0AAE7XGC8 | 527 | 4,86914 |
| A0AAE9GCD4 | 315 | 5,43923 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_39755
8MRZu
|
55 | 29,4% | 502 | 1.790E-79 |
| 2 |
phalp2_36515
1p8L5
|
2 | 27,3% | 417 | 4.314E-40 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage FF47 [NCBI] |
1305710 | Mapvirus > Mapvirus Ff47 |
| Host |
Mycobacterium avium subsp. paratuberculosis ATCC 19698 [NCBI] |
1074454 | Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium > Mycobacterium avium complex (MAC) |
| Host |
Mycobacterium avium subsp. paratuberculosis [NCBI] |
1770 | Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium > Mycobacterium avium complex (MAC) |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
JX901189
[NCBI]
CDS location
range 23066 -> 24013
strand +
strand +
CDS
ATGGCCTTTATTCAGAAGCAGGGGTATTGGATCTCTGAGAATGGTTGGCGAATGTGCGACACCGCCGAGCTGGATTACTCAGCGATCCCCGGAACAGACTTCAAGCTCGGTGTTCGCAAGGGTGCCCCGTCGGTGATCCTCAAGGCGCTCATCTGGCGCCTGCATCGGATCGAACCGATGATTACCACACAGATCGGCTGCTACACCTCGTCGAACTCGATGGCGAACAGCAATCACAACTCTGCTACTGCCATCGACTACAACTGGAACCTTCATCCCTACCAGAAGTGGGGAACGTGGGGTAGCAAGCGCCCCGCAGTGGATCAGGTCATCAAGGACTTCCGGGGCATTGTCGAGTTCGGCGGCGACTGGACCTCACCGCGTGACGAGATGCACTTTGAGCTCCACTTCGCGGAGGGCCATGCCGGCACTGAGCAGCTCGCTACGGACTTGCGCAACGGCCTCTGGGGTATCTACGCGGATGGCACTGCGCCGCCGCCTGTCGTGATCCCTGAGGGCTACTTGCAGCTCGGCTCCGAGGGCGATCAGGTGCGCAAGCTCCAGGCCGGGATGAACAAGGTCTTCCGGAACTACAAGGCCATGCCTCTCGACGAAGACGGCATCTTCGGTGCCTTCACTGAGTCGGCGGTCAAGGAGTTCCAACGGCGTTCGCTGATCGGTGTGGACGGTATCGTCGGGCCCGAGACGAAGCAGCGCTTGGCGACGTACGGAATCATCCTCGATGGGGCGACTGAGCCGACCTCACCGCCTCCTCCTGTTGTCCCCTCGTTCGTCTATCCGTCGACCGACGAGATGGTCAAGCAGGTTTGGGAGCAGCACTTCGGCCCGAAGGCAGAAGGCTGGCCACAGCTCGGCCAGACTCCCGAAGGAAAGAATCGCTACCTCGTTGATGGGGTAGCGTCACTTATCGCCAAGGAGGAAGCGTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c50898_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(5hxHI)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50