Protein

Protein accession
4C3bx [EnVhog]
Representative
gfrU
Source
EnVhog (cluster: phalp2_16646)
Protein name
4C3bx
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VTIYFPDMSAYQQGISLAGAQAAGIKCTQGLTYASPAYAQQAAAADKAGVFRLAYHFLEHAQGAGQADYFYAHAGKTPAMVDCEPTFDSAGNYASRPTITDVCAFVDRLRGRGGVIWWVYLPHWWWQELGSPDLAPLRSRGLMLWSSVYSTYSDSNAAAGWQEYGGMTPLVWQYSETTPFGGIAEVDFNGFRGSKYAGKQDKASVTACRAEFVSLSKTGKAAPVPAPTVLPPVRDLTVTDVGPSSVRLQWDSPAGPSPFATGWYRMTIRYATGAKADQDLPSYGRPHVPKTANPQNVLFGSLPVKTRLYALVGAVATTGANGSEWERVDFTTTAAKAIPAEVPDE
Physico‐chemical
properties
protein length:345 AA
molecular weight:37118,2 Da
isoelectric point:6,58
hydropathy:-0,20
Representative Protein Details
Accession
gfrU
Protein name
gfrU
Sequence length
310 AA
Molecular weight
32487,15730 Da
Isoelectric point
4,68401
Sequence
MTIFFPDVSGYNGGLRIQPNTVAVGARATLADRVADASYTGFRQQAATLGALFFAYHWLNHGNAAAQAQWCFQHVGGTPLMIDAEDVAGNTGYAGPLTVQDILAFTTAYRALGGVVHLVYLPHWYWQNDMGSPDLRPLAAAGLALVSSNYATYSDTGPGWAPYGAVTPTIWQYTDKLPYGGQAVDFNAFRGSVTELRALIEGGSDDMALSDVVPGTDVDGRTVGAILADLENMRNWLITPAGQTKFGPAVPLAGSPLDLLAKLATAPTTPTVELSDADRADIAAKVASALGGKLDQLLTRLAAAGDALNG
Other Proteins in cluster: phalp2_16646
Total (incl. this protein): 11 Avg length: 317,1 Avg pI: 5,47

Protein ID Length (AA) pI
gfrU 310 4,68401
1e0v0 301 4,65428
1kM9M 296 4,37321
1otfT 296 4,90273
2TeBa 299 4,92978
4E2tE 352 7,73285
4EYta 270 4,90335
5BAh9 360 5,76838
5nHV6 308 5,26570
8pcpf 351 6,42721
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9356
bl0n
11 56,1% 203 3.354E-83
2 phalp2_18920
24XdO
30 56,8% 213 2.204E-82
3 phalp2_1777
2SfqX
42 54,9% 202 4.974E-75
4 phalp2_8437
Ypuv
26 46,3% 231 1.681E-68
5 phalp2_32493
1fUkE
14 50,6% 217 5.562E-62
6 phalp2_23893
1IwiZ
3 45,7% 214 4.412E-60
7 phalp2_20034
1pbFu
55 31,6% 272 1.524E-37
8 phalp2_9984
3dkvN
1 34,6% 202 1.552E-35
9 phalp2_22258
7lGFm
11 30,4% 276 6.222E-34
10 phalp2_36852
7d3iJ
93 30,6% 333 2.127E-33

Domains

Domains
Representative sequence (used for alignment): gfrU (310 AA)
Member sequence: 4C3bx (345 AA)
1 310 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (gfrU) rather than this protein.
PDB ID
gfrU
Method AlphaFoldv2
Resolution 77.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50