Protein

Protein accession
Ig48 [EnVhog]
Representative
2GfCO
Source
EnVhog (cluster: phalp2_32811)
Protein name
Ig48
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIVSLVLAASMALAPVAVGTNQPSLETTSKYTLIDSPVSLYQGRMYVKKDNDLRYCIRYRESRHAYSADTGSGKYKGAYQMSREFKDGMSWMIQRDMRETGTPKAEAVRIGEKLRSTPVNKWAPYYQDYAFWMGWDKGKGRSHWDATNYSIKRGC
Physico‐chemical
properties
protein length:155 AA
molecular weight:17749,0 Da
isoelectric point:9,56
hydropathy:-0,66
Representative Protein Details
Accession
2GfCO
Protein name
2GfCO
Sequence length
140 AA
Molecular weight
16589,65290 Da
Isoelectric point
9,69355
Sequence
MIIELAIAASIGLTPVQPVTEPDIGYEVSAYQGKWYSAKWEPVRKCIMQRESRHNYRAKNRSSSAMGAYQFLDSQWRDGLVWNLRKDTAKPHRYKLEKLRDIPINKWPRYYQDQAFWTVWRNGAGRYHWAPTHPNTPNCY
Other Proteins in cluster: phalp2_32811
Total (incl. this protein): 95 Avg length: 152,1 Avg pI: 9,74

Protein ID Length (AA) pI
2GfCO 140 9,69355
15lYo 164 9,73390
1LsMA 148 10,49521
1MHvz 149 10,15546
22URD 168 9,62353
299NB 159 9,74809
29Vr1 146 10,00570
2GcP8 170 9,95877
2GcXn 140 9,86162
2HX2R 146 9,97643
2ICa2 131 9,54521
2Mks 174 9,69555
2Njcl 173 9,61586
2P4zq 151 9,84008
2Tal3 158 9,69219
2ZVIZ 149 9,19849
2e6dh 150 9,82364
2gSL7 187 9,76962
2huOq 173 9,73442
2nxuN 181 8,98497
2x0I4 137 9,29720
2x4s8 143 9,88747
36qQR 161 9,87735
3hxoX 146 10,18583
3iIIY 153 10,53086
49BkS 148 10,49521
4A8aP 175 9,76743
4Aakk 146 10,48973
4Adcg 176 9,82287
4AeHq 153 8,94932
4Gag 146 10,12142
4KLFr 149 10,06108
4WIXE 148 10,13644
4WK7f 175 9,75460
4XEJp 149 10,32476
4ae1 175 9,88360
4bjSf 149 9,57750
4e2dx 168 10,00332
4n2go 146 9,09554
4nCLx 144 9,76388
4nmcM 149 9,27366
4nvDC 149 10,15546
4ojoT 146 10,48973
4vKCt 168 9,93704
4vQLI 138 9,51381
4zOiW 135 10,10047
4zUqF 151 9,57525
50bsz 149 9,09773
56Mu3 149 10,00970
586wo 148 10,49521
58est 145 6,90300
5CEiV 144 9,73268
5CviE 144 9,91661
5JrzC 132 10,95739
5bDac 148 9,12042
5iIdM 145 7,80868
5kSDw 112 9,68942
5lgq2 175 9,67073
5vvvP 148 9,41917
5yDFy 151 9,92370
5yHZP 175 9,75460
5yML0 149 9,51432
6A4wu 144 9,69380
6ATfz 146 9,92686
6ICTO 175 9,64887
6P7pT 153 9,57557
6brb 177 9,73790
6zS1i 145 6,41862
7IiPd 144 9,82100
7Vvzn 144 9,51278
7W5fi 149 10,00970
82nkm 149 9,27250
8BffD 159 10,00583
8CseV 160 9,59497
8CxaH 135 10,00770
8FVq1 146 10,09080
8FfZp 135 10,19453
8eTT 138 10,52841
8eau8 149 10,17532
8hK2t 149 8,90697
8oHip 151 9,95606
8ocJh 141 9,69464
8p4G9 141 9,69509
8x1Hw 146 9,97315
8xEjv 146 9,92686
AUIT 131 9,87677
AzsS 157 10,04400
Ddcg 173 9,77020
Igtx 155 9,61496
X5YI 155 9,92093
kPzw 169 9,90481
uZln 143 10,31393
yGyd 151 9,93156
yQoz 140 9,34252
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_695
1B37v
117 56,8% 125 4.453E-51
2 phalp2_19728
4NMTi
55 36,8% 144 9.778E-29
3 phalp2_18319
5gNaP
1 29,4% 95 5.034E-19
4 phalp2_21967
7HbLZ
2 32,5% 126 2.995E-17
5 phalp2_7713
6P840
47 34,2% 111 1.971E-16
6 phalp2_40252
2M3BD
51 30,5% 95 1.296E-15
7 phalp2_28381
1o9gv
124 35,8% 106 1.593E-14
8 phalp2_26867
4ajOR
1 29,1% 96 4.272E-09

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 2GfCO (140 AA)
Member sequence: Ig48 (155 AA)
1 140 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2GfCO) rather than this protein.
PDB ID
2GfCO
Method AlphaFoldv2
Resolution 86.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50