Protein

Protein accession
G4KKR9 [UniProt]
Representative
4UYf9
Source
UniProt (cluster: phalp2_37921)
Protein name
G331 protein
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFTTIKDYLIGSTIKAVIFTLAMVSVTVGISIPVTPASTIGTPTPDLSESDIQNLIKEADARNKELTTPAKPTGAFFNPKPVAENSKMYDGFRMDDIVDLIYYQHNKIVEREKISNLVQIVIEEAPKYGNPPIHVVLAIINKESTWNSKARSGSSYGPMQVHYRVWGDYCNLDSAKSLYNPRIGVRCGLKVFTYYLEANNGNVNKTLQRYRGSDSRTNNRYARDVLRTANKIKTVLSS
Physico‐chemical
properties
protein length:238 AA
molecular weight:26633,1 Da
isoelectric point:9,36
hydropathy:-0,33
Representative Protein Details
Accession
4UYf9
Protein name
4UYf9
Sequence length
190 AA
Molecular weight
22387,93570 Da
Isoelectric point
9,67595
Sequence
MKKKQLLILMSVPIVLFVLFLTYHYSYTNYVTKLTAKTEAIYAYKWCLKNAKQYIPESEILHIIHVAQKYDNWKLILAIIQVESSFDRYVVSNKNASGLMQVTYKVWGQKLKIRSERALFTPEINIRAGYNILNEYLIKSNYDINDALFKYVGKDKGRKYEKKVLQQYASLSLYIKYNINKGKGKNNEEK
Other Proteins in cluster: phalp2_37921
Total (incl. this protein): 2 Avg length: 214,0 Avg pI: 9,52

Protein ID Length (AA) pI
4UYf9 190 9,67595
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8611
20BaY
8 37,7% 151 2.181E-14
2 phalp2_5081
1Diw1
14 34,0% 141 1.881E-13
3 phalp2_1587
84qUt
11 35,6% 132 2.558E-13
4 phalp2_24402
4hGCM
1 34,4% 122 3.478E-13
5 phalp2_25221
2aelP
7 34,6% 173 1.379E-11
6 phalp2_23358
5F9UJ
16 35,8% 134 1.588E-10
7 phalp2_34246
3afXs
1 31,0% 193 5.371E-10
8 phalp2_30261
4i1tO
1 33,0% 139 2.049E-08
9 phalp2_600
139Z6
763 32,1% 140 2.049E-08
10 phalp2_18090
3WXWI
1 36,1% 119 5.072E-08

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4UYf9 (190 AA)
Member sequence: G4KKR9 (238 AA)
1 190 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Yersinia phage phiR1-37
[NCBI]
331278 No lineage information
Host Yersinia enterocolitica
[NCBI]
630 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Yersinia >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AJ972879 [NCBI]
CDS location
range 247799 -> 248515
strand +
CDS
ATGTTTACCACAATAAAAGATTATTTAATTGGAAGTACTATTAAGGCTGTGATATTCACACTAGCAATGGTCTCCGTAACAGTAGGTATTTCGATTCCGGTAACTCCAGCTTCTACCATAGGAACTCCTACTCCAGATTTATCTGAATCAGATATTCAGAACTTGATTAAAGAAGCTGATGCACGTAATAAAGAACTCACAACTCCCGCTAAACCTACAGGAGCGTTCTTTAATCCTAAACCCGTAGCTGAAAACAGCAAAATGTATGATGGGTTTCGAATGGATGATATAGTAGATTTGATCTACTATCAACACAACAAAATTGTCGAGAGAGAAAAAATCTCTAATCTTGTACAAATAGTTATAGAGGAAGCACCTAAATATGGCAATCCGCCAATACACGTAGTATTAGCTATTATCAATAAAGAAAGTACATGGAATTCCAAAGCACGTTCTGGTAGCTCATATGGGCCTATGCAAGTACATTATCGTGTATGGGGTGACTACTGTAACTTAGATTCAGCAAAGAGTTTATATAACCCCAGAATAGGGGTACGATGTGGCTTAAAAGTCTTTACCTATTATTTGGAAGCAAATAATGGAAATGTAAATAAAACCTTGCAACGATATCGAGGAAGCGATTCTCGAACAAATAATCGATATGCAAGAGACGTTTTGCGAACTGCAAATAAAATAAAAACTGTTTTATCTTCTTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00022dbdf6_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4UYf9) rather than this protein.
PDB ID
4UYf9
Method AlphaFoldv2
Resolution 93.87
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50