Protein

Protein accession
A0A385DVT4 [UniProt]
Representative
7W7I4
Source
UniProt (cluster: phalp2_28555)
Protein name
Glycoside hydrolase/peptidoglycan hydrolase
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MKQRVYNILMLLLIGGLYGLYYIDYQEEHKEPVKVDVLRLEQPEFLLSEAPDDYLMEALEYYNVKHKNIVYAQAILETGHFRSKVCKEYNNLFGLYNSYKKDYYKFDHWSESVVAYLNYIQYRYKPPDDYYQFLIKIGYAEDPQYVEKLKNIVKRYE
Physico‐chemical
properties
protein length:157 AA
molecular weight:19128,6 Da
isoelectric point:5,84
hydropathy:-0,55
Representative Protein Details
Accession
7W7I4
Protein name
7W7I4
Sequence length
272 AA
Molecular weight
31532,16970 Da
Isoelectric point
6,41254
Sequence
MMWVIAILIIILTLGILKDTHVEVYHRYCGPAKLQEEYDVVIPLWMALIIVVLGLLPIANIILFAAFIIYYAIHAGWNPNECEDYTHVFSLRGDNIVTRGLLKVKNLLCKRVFNILISFAVGVLGAIQVHSYLKEDEPPEIKVVLHIDNKEKQPDFFSKSPQEGLMEALEYYGVKHPQIVYAQAVLETGHFKSDLCLNDNNLFGLYNSKKHRYHTFDHWTESVVAYLDYVQYRYKPPNDYYKFLSDIGYAEDPNYINKLKGIVSRNDKRRSE
Other Proteins in cluster: phalp2_28555
Total (incl. this protein): 10 Avg length: 178,6 Avg pI: 6,11

Protein ID Length (AA) pI
7W7I4 272 6,41254
69P7y 273 4,84498
A0A7M1RZ75 156 6,08685
A0A8S5S0H0 164 7,74252
A0AAE7V4P9 161 8,28672
A0AAE9X408 148 5,39120
A0AAF0BA35 148 5,39120
A0AAF0BBU7 148 5,39120
A0AAF0D5F1 159 5,72695
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23870
1CkWM
5 28,9% 190 3.859E-14

Domains

Domains [InterPro]
Unannotated
GLUCO
Representative sequence (used for alignment): 7W7I4 (272 AA)
Member sequence: A0A385DVT4 (157 AA)
1 272 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacteroides phage crAss001 (Bacteroides phage PhiCrAss001)
[NCBI]
2301731 Crassvirales > Steigviridae > Kehishuvirus > Kehishuvirus primarius
Host Bacteroides intestinalis
[NCBI]
329854

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH675552 [NCBI]
CDS location
range 92569 -> 93042
strand -
CDS
ATGAAACAAAGGGTATATAATATCCTTATGCTCTTACTAATTGGTGGTCTGTATGGTTTATACTATATAGACTATCAAGAGGAGCACAAGGAACCTGTAAAGGTGGATGTGTTGAGATTGGAACAACCAGAGTTCTTACTATCAGAAGCTCCTGATGATTATCTTATGGAGGCTTTAGAGTATTATAATGTTAAACATAAGAACATTGTATATGCTCAGGCTATCCTTGAGACAGGTCATTTCAGGTCTAAGGTCTGCAAAGAGTACAATAATTTGTTTGGACTCTATAATAGTTATAAGAAAGACTATTATAAGTTTGACCATTGGAGTGAGAGTGTGGTTGCCTATCTCAATTACATACAATATAGATACAAACCCCCGGATGATTACTATCAATTTTTGATTAAAATAGGTTATGCGGAAGACCCGCAATATGTAGAAAAACTAAAGAATATAGTAAAGAGATATGAATAG

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7W7I4) rather than this protein.
PDB ID
7W7I4
Method AlphaFoldv2
Resolution 87.74
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50