Protein

Protein accession
40EaA [EnVhog]
Representative
40EaA (this protein)
Source
EnVhog (cluster: phalp2_25237)
Protein name
40EaA
Lysin probability
99%
PhaLP type
endolysin
Probability: 93% (predicted by ML model)
Protein sequence
MAIDAFFPKITLPKGETDIRKGEGLGIKYTSENGKVIVMVVDAFEGGEPTRGLIDWLKSIDVKQIDLAVATHAHGDHFGGFYDIAEAGIKIKDFRCYHIDSIREGNSESRKDSDNLLKLIRWLQARGTRVLFVDHGDVLEFGDISWHIYRKQPKGPAAKDDPNAWSYVNDGSLVLCSPELSGIIFGDGPAAQKEAIAYFQKKFGKTKFLIWVTVSHHGGHFSMSNAQSARNAGCEFAYESCVERRGPGKEEWTEFGARRLIQNGVTVWMQDENIYIHAAAGKITFKQGNKTLTYDIPYQGKESKVTGKWEYGTKGWWYKYSNGGYATGWKQIVYKGEPCWFLFDDDGWMLTGWRKDDGKWYFLDYKTGVMLKGWHKLPHGPNNVEDWYLLNKSGEMLSGWQWSTKDGKTGWYYLDPDNGMRTGWIYDDGEWYCLGNDGRMLTGWVTYKGRKCYLEPLSDKTHVQGVCYRSRTAVIDGKSYKFDKDGYAEEIAVGGGGKAELNGCDVASYQAGIDFAKMTTTDFAIIKGTQGTWYVNPYADIQYSGAKAAGKLLGMYHYAEGKDPIAEAQYFIRKVGSRVGSCILALDWEGHDNSKFNSDAEVAWVLKFAMEVYRLTKVHIFLYMSKSVTRRRNWAEVAKDVRLWCAQYANQEHTDYKSNPWTDNGGFGAWKYDTIRQYSSKGRVTGYGKDLDINRAYMSRAEWLAAAAGKNTVVATAAKPKTQWSAYVSMTTSPVKISNSGSDENGKYKGGKAGDQTGKEWRIRDWYNYPWNCVLRHPLAEVRACLATMAVKAAENDNIGYDQLQRDDYGIELAKNDYDPSKIKKPVESDCSKGVIDNTKGTGHTLGIPELQSIKATYTGNMRAAYKAAGFYVLTESKYLTSSDYLLAGDILLNDKHHTCTVITNGSKSGDTVVMPLVKKGSTGAAVEQLQEMLIAQGYSCGKWGADGDFGNDTLQAVKSYQRDHGLDVDGEVGPMTWGVLFK
Physico‐chemical
properties
protein length:983 AA
molecular weight:110079,7 Da
isoelectric point:7,92
hydropathy:-0,53
Other Proteins in cluster: phalp2_25237
Total (incl. this protein): 18 Avg length: 920,2 Avg pI: 7,47

Protein ID Length (AA) pI
1cLZi 925 8,07139
1cMec 970 6,67144
1cNub 944 8,42661
23FN9 933 8,17983
23qEI 928 7,71711
23vwv 839 6,85446
23ysT 794 7,29019
23zUz 988 6,24828
38GEP 935 7,71353
3TNAy 880 5,83619
3VGNn 929 8,31592
3VLtx 930 7,73626
3ielO 907 7,48032
409eA 841 6,14392
7q1lF 981 8,30754
7q23E 935 7,21266
8130e 921 8,41952
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15776
3WJi7
6 27,8% 1032 2.564E-170
2 phalp2_12980
3ibDa
3 27,2% 801 2.351E-137
3 phalp2_24350
3WE0c
3 25,0% 706 5.705E-86

Domains

Domains
Unannotated
Choline_bind_3
GH25
Unannotated
PG_1
Protein sequence: 40EaA
1 983
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF01471, PF19127

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
40EaA
Method AlphaFoldv2
Resolution 81.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50