Protein

Protein accession
A0A8F3EC74 [UniProt]
Representative
1gHkc
Source
UniProt (cluster: phalp2_4061)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MFNRNPAFEAGVIDQALRHVGYRSQPNRQSAMQIKGYAGKPWNGTFVDRVLHDAFGDFAEVRFVSTVAALGFYAARNRLYRKARPGDVVFFNFASNPQEAFEQPHVGIVTEVRKDGSFRTVEGETAPGNPQGSQLADGVFERTRYRSDVLAFVRPEPRTVSKPEGVEPTAVKMSYFESNGETVKRAVEKVQIALNIARPALSFNRGKRDPLFKSGLGLYARESGHVGNRGEITMPVLQTLEDETGLGVQP
Physico‐chemical
properties
protein length:250 AA
molecular weight:27693,9 Da
isoelectric point:9,48
hydropathy:-0,46
Representative Protein Details
Accession
1gHkc
Protein name
1gHkc
Sequence length
256 AA
Molecular weight
27586,44890 Da
Isoelectric point
8,37369
Sequence
MSVEDVLRVAAGEIGYTRWNDPLPGTKYGRWYAQDHGAYYGTSGVPYCAMFASWVYDQCPNDGIPGGYQAYVPYFIDQARRANALVPFNNAQPGDLICFDWDGNGVADHVGIVAARPSGNAISTIEGNTSSGNSGSQSNGGGVYARTRYRASVAAIIRTTHRATPTPTPTPTPTPINQLEVINQMKATHIVFEYGGFTSVADVLAGTWRRFPTGKEYANAMTALKRAGAIVKSWQELGGKTNHVDDPNGAFGKRIL
Other Proteins in cluster: phalp2_4061
Total (incl. this protein): 4 Avg length: 258,3 Avg pI: 9,43

Protein ID Length (AA) pI
1gHkc 256 8,37369
I3NL82 277 10,37408
A0A6G6XI13 250 9,47893
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2684
7236j
6 61,2% 160 6.319E-62
2 phalp2_20813
65GA0
344 50,9% 218 2.217E-61
3 phalp2_552
ESm7
27 57,5% 160 1.752E-53
4 phalp2_5932
67zj5
27 40,7% 179 1.023E-35
5 phalp2_37209
21s1y
4 38,8% 162 4.665E-30
6 phalp2_1568
7XJtL
188 40,2% 189 8.663E-30
7 phalp2_40071
41qFE
139 36,0% 172 4.918E-26
8 phalp2_33264
5Ekx7
97 35,3% 164 3.629E-22
9 phalp2_29160
7s0gR
24 39,2% 181 2.274E-21
10 phalp2_40270
2UpwZ
7 36,2% 174 3.549E-20

Domains

Domains [InterPro]
CHAP
Unannotated
Representative sequence (used for alignment): 1gHkc (256 AA)
Member sequence: A0A8F3EC74 (250 AA)
1 256 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Microbacterium phage A3Wally
[NCBI]
2836046 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MZ150783 [NCBI]
CDS location
range 1281 -> 2033
strand +
CDS
ATGTTCAACCGTAACCCCGCGTTCGAGGCGGGCGTCATCGATCAGGCTCTCCGCCACGTCGGCTACCGCTCGCAGCCCAACCGTCAGTCCGCGATGCAGATCAAGGGCTACGCGGGCAAGCCGTGGAACGGTACGTTCGTGGACCGTGTGCTGCATGACGCATTCGGTGACTTCGCGGAGGTCCGGTTCGTCTCAACCGTGGCGGCTCTCGGATTCTACGCAGCACGAAACCGTCTGTACCGTAAGGCGCGTCCCGGCGATGTCGTGTTCTTCAACTTCGCGTCCAACCCGCAGGAGGCGTTCGAGCAACCGCACGTCGGGATCGTGACCGAGGTCCGCAAGGACGGCAGCTTCCGAACCGTGGAGGGTGAGACTGCACCGGGCAACCCCCAAGGCTCGCAGCTTGCGGACGGCGTGTTCGAACGGACCCGCTATCGCTCCGATGTGCTGGCGTTCGTGCGGCCCGAACCGCGAACCGTCTCTAAGCCCGAGGGCGTGGAACCGACCGCCGTCAAGATGAGCTATTTCGAGTCGAACGGCGAGACCGTAAAGCGCGCGGTGGAAAAGGTGCAGATCGCGCTCAACATCGCGCGGCCCGCCCTGAGCTTCAACCGTGGCAAGCGTGATCCGCTGTTCAAGAGCGGCCTGGGGCTGTACGCGCGCGAGAGCGGGCACGTCGGCAACCGTGGCGAAATCACGATGCCGGTACTCCAGACGCTGGAGGACGAGACCGGCCTGGGTGTGCAACCGTGA

CDS Source ID
CDS Source
MZ150783 [NCBI]
CDS location
range 180502 -> 181254
strand +
CDS
ATGTTCAACCGTAACCCCGCGTTCGAGGCGGGCGTCATCGATCAGGCTCTCCGCCACGTCGGCTACCGCTCGCAGCCCAACCGTCAGTCCGCGATGCAGATCAAGGGCTACGCGGGCAAGCCGTGGAACGGTACGTTCGTGGACCGTGTGCTGCATGACGCATTCGGTGACTTCGCGGAGGTCCGGTTCGTCTCAACCGTGGCGGCTCTCGGATTCTACGCAGCACGAAACCGTCTGTACCGTAAGGCGCGTCCCGGCGATGTCGTGTTCTTCAACTTCGCGTCCAACCCGCAGGAGGCGTTCGAGCAACCGCACGTCGGGATCGTGACCGAGGTCCGCAAGGACGGCAGCTTCCGAACCGTGGAGGGTGAGACTGCACCGGGCAACCCCCAAGGCTCGCAGCTTGCGGACGGCGTGTTCGAACGGACCCGCTATCGCTCCGATGTGCTGGCGTTCGTGCGGCCCGAACCGCGAACCGTCTCTAAGCCCGAGGGCGTGGAACCGACCGCCGTCAAGATGAGCTATTTCGAGTCGAACGGCGAGACCGTAAAGCGCGCGGTGGAAAAGGTGCAGATCGCGCTCAACATCGCGCGGCCCGCCCTGAGCTTCAACCGTGGCAAGCGTGATCCGCTGTTCAAGAGCGGCCTGGGGCTGTACGCGCGCGAGAGCGGGCACGTCGGCAACCGTGGCGAAATCACGATGCCGGTACTCCAGACGCTGGAGGACGAGACCGGCCTGGGTGTGCAACCGTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gHkc) rather than this protein.
PDB ID
1gHkc
Method AlphaFoldv2
Resolution 79.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50