Protein
- Protein accession
- G4KKR9 [UniProt]
- Representative
- 4UYf9
- Source
- UniProt (cluster: phalp2_37921)
- Protein name
- G331 protein
- Lysin probability
- 97%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MFTTIKDYLIGSTIKAVIFTLAMVSVTVGISIPVTPASTIGTPTPDLSESDIQNLIKEADARNKELTTPAKPTGAFFNPKPVAENSKMYDGFRMDDIVDLIYYQHNKIVEREKISNLVQIVIEEAPKYGNPPIHVVLAIINKESTWNSKARSGSSYGPMQVHYRVWGDYCNLDSAKSLYNPRIGVRCGLKVFTYYLEANNGNVNKTLQRYRGSDSRTNNRYARDVLRTANKIKTVLSS
- Physico‐chemical
properties -
protein length: 238 AA molecular weight: 26633,1 Da isoelectric point: 9,36 hydropathy: -0,33
Representative Protein Details
- Accession
- 4UYf9
- Protein name
- 4UYf9
- Sequence length
- 190 AA
- Molecular weight
- 22387,93570 Da
- Isoelectric point
- 9,67595
- Sequence
-
MKKKQLLILMSVPIVLFVLFLTYHYSYTNYVTKLTAKTEAIYAYKWCLKNAKQYIPESEILHIIHVAQKYDNWKLILAIIQVESSFDRYVVSNKNASGLMQVTYKVWGQKLKIRSERALFTPEINIRAGYNILNEYLIKSNYDINDALFKYVGKDKGRKYEKKVLQQYASLSLYIKYNINKGKGKNNEEK
Other Proteins in cluster: phalp2_37921
| Total (incl. this protein): 2 | Avg length: 214,0 | Avg pI: 9,52 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4UYf9 | 190 | 9,67595 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_8611
20BaY
|
8 | 37,7% | 151 | 2.181E-14 |
| 2 |
phalp2_5081
1Diw1
|
14 | 34,0% | 141 | 1.881E-13 |
| 3 |
phalp2_1587
84qUt
|
11 | 35,6% | 132 | 2.558E-13 |
| 4 |
phalp2_24402
4hGCM
|
1 | 34,4% | 122 | 3.478E-13 |
| 5 |
phalp2_25221
2aelP
|
7 | 34,6% | 173 | 1.379E-11 |
| 6 |
phalp2_23358
5F9UJ
|
16 | 35,8% | 134 | 1.588E-10 |
| 7 |
phalp2_34246
3afXs
|
1 | 31,0% | 193 | 5.371E-10 |
| 8 |
phalp2_30261
4i1tO
|
1 | 33,0% | 139 | 2.049E-08 |
| 9 |
phalp2_600
139Z6
|
763 | 32,1% | 140 | 2.049E-08 |
| 10 |
phalp2_18090
3WXWI
|
1 | 36,1% | 119 | 5.072E-08 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Yersinia phage phiR1-37 [NCBI] |
331278 | No lineage information |
| Host |
Yersinia enterocolitica [NCBI] |
630 | Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Yersinia > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
AJ972879
[NCBI]
CDS location
range 247799 -> 248515
strand +
strand +
CDS
ATGTTTACCACAATAAAAGATTATTTAATTGGAAGTACTATTAAGGCTGTGATATTCACACTAGCAATGGTCTCCGTAACAGTAGGTATTTCGATTCCGGTAACTCCAGCTTCTACCATAGGAACTCCTACTCCAGATTTATCTGAATCAGATATTCAGAACTTGATTAAAGAAGCTGATGCACGTAATAAAGAACTCACAACTCCCGCTAAACCTACAGGAGCGTTCTTTAATCCTAAACCCGTAGCTGAAAACAGCAAAATGTATGATGGGTTTCGAATGGATGATATAGTAGATTTGATCTACTATCAACACAACAAAATTGTCGAGAGAGAAAAAATCTCTAATCTTGTACAAATAGTTATAGAGGAAGCACCTAAATATGGCAATCCGCCAATACACGTAGTATTAGCTATTATCAATAAAGAAAGTACATGGAATTCCAAAGCACGTTCTGGTAGCTCATATGGGCCTATGCAAGTACATTATCGTGTATGGGGTGACTACTGTAACTTAGATTCAGCAAAGAGTTTATATAACCCCAGAATAGGGGTACGATGTGGCTTAAAAGTCTTTACCTATTATTTGGAAGCAAATAATGGAAATGTAAATAAAACCTTGCAACGATATCGAGGAAGCGATTCTCGAACAAATAATCGATATGCAAGAGACGTTTTGCGAACTGCAAATAAAATAAAAACTGTTTTATCTTCTTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi00022dbdf6_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(4UYf9)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50