Protein

Protein accession
7sMsN [EnVhog]
Representative
1bQAB
Source
EnVhog (cluster: phalp2_38435)
Protein name
7sMsN
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MSVIGEPIGKSVVEVGLNDSKLVSGLTNLNAKMKLADNTWKQSLSTFKQSDRSIEKLSVSVKGMTDKLKVQSQIVEVHKQKVAKLTSEYGETHTKVIKANAELKKQEATFGNLKRSISEVTSEIEQLKKAEQINNSPWGKRSQELQRYSDRLSAVGDKMTSIGQNMSMTVTAPIAAGFGIAVKSSMDFEAQMDRVGAISDTTGSKFNNMTKLAMELGASTTKSASEVAKGMEEMAAKGYNANQIMQAMPGIISAAEASGSDMAQTAEVMASAMNAFGIEAGKSGHVADVLAQTANQSAADITDMQYALKYAAAPAHSLGMSLEETSASIGMMVDAGLKGEQAGTTLRGALLGLLDPSEQNSKTMDKMGIAITDNEGNFVGMSKLIGNLQESMEGMTDTQKAATLSQLVGKEAVSGMLVMMQKSPEQINKMTNALEQSDGASKKAADAMMDNLKGAVEEMKGAFETLGIQVGQDLTPMIKGLADGLQRAATNFSEMPGWARKTAVGIGLVVAATGPVILGLGIVAKSASTAASGLSRLTGTFAKNTVAAEVNAAANLAAGASIEKQSGKLGKVTGLFTNLSKGAIGAAGSVGLLGRAGSIATKGIGLFASGPVGIAIGVVATLATTFKLAYDHIGWFHDGVENTKKLLGEVASTIDFDWVGNLGNGIKDTGKWLADSTGKLARFGFEISPIGMISKNTFKVVGDSVKKATDTVDVFGKGVSKSTKKVLQEYTDLSMKASKKLEDLKINHKTIGDQQYKEVVSIYSKINADVTKKLGERHKRETDGLRKLLVDTKGISNQEKQRIVIEAQSGNAAEVKAAKTINKQIMDIYKKAKTEKRALTRTEENKIANLQKQMDQKVVASLSNSEKEQRIILGRLKSNKKTLSIQAASEVIKASAKERDESIKNARKKRDKTIDEAIYQRDITKNISKEQADKIIKDAERQYSGSKKNAEKQHKSVVDEAKKQNKGVRTEIDSQTGRVLTQWEKTKKNVSLSASFLTSYVNTQFKKSYENTSKWMSETKNSIGKKWSEIKTNVSKFAEDTKKAAVDKFESMYDGATKWVSNIGKFITDSKKGITDKASSMGKSVANGAIGGLNGMIDGINSISSKIMDKNLLSKIPKLSTGTVKDGAIAKPTLAVVGDKGPGNGPGGFKREIIHRANGDMELTPATDTLVHLNKGDKVYNGTQTYSLMQKGLIPRFSIGTAIKNGWENTMEFGSTVKESVVDASKMVGKIAGDVFEYIENPSKLVDIVLGKLGSAFDNVGGITGDLGKSAFTSIKNSLVSKVKEWLEEFSGGDVDGSEILNWPKTTPYSPNSPVPGYPASFNGGRHYGIDLGIPSGTTIHAPTSGTVSQQYNYGGGIVARLISGKIAQYFLHLSKVLKKGPVKQGDAIAKSGNSGAWTTGDHLHYQVENPASSELTNRNTMDPVAFLKSKVSSGKDTAGKSWASEIRRAASQMKVKITDGDVRNISAQINRESSGNQNIVQSSAVWDKNTASGNPAQGLLQYIPQTFRAYAVPGHTNIRSGYDQLLAFFNNSNWRNDNPGGRSGWGPSGVRRFANGGFVKDESYIAGEEYEEAIIPMDPKRRDRANQLLAEANYKVNGPIKLSKGTSNKTHRVKWGDTLWDISRKNRTTVKALQLLNGIKNHLIYPGQIIKLTGSITNLSNNVSKQTKVQSKPKASTSYISRAQSLYNTGKSILNRGKSSNKVTGKDDVNLGTLIMNNTKNLGSLSLEAAQKNIDTIVKKINSMITSSTGKISSLNNKISKSTNKKTIANARNDIKAYKAQIASFKKLKQNEVLKTNYLKNLIKEKSSLTAKLNQRTEEGKALQEEKTNYRSSIASNLQNYAGFGVAKGHTSRDFVSFMKYRLSKMKEYASNVRKLKSMGLDPILLRELLAGGIENSMPRVAALVKGGKGYIGQINTLQKSINAEVNKISSEQANFGYNSDINANNKQIQTLKNQQKKIDKKKVVYLNERKRITKSNVKANPKKPISNTSRTVTTMRTHNIKWGDTLGHIAQRYGTTVNELKKANNLKSDMIYAGRTLKVPTKKVVQLPKTQTALDKSTKYIMDTAKRYQLVNNSSKLNNLQKQLNKIKSDKDKKNDVVITKLEKTLRDLTKKYDKQDDVVKLLQQLVNKNPDILLNGVKLTKEMDKLLATNSKINARRKAR
Physico‐chemical
properties
protein length:2163 AA
molecular weight:234419,1 Da
isoelectric point:9,79
hydropathy:-0,49
Representative Protein Details
Accession
1bQAB
Protein name
1bQAB
Sequence length
2191 AA
Molecular weight
237253,91120 Da
Isoelectric point
9,77587
Sequence
MSVIGEPIGKSVVEVGLDDSKLVKGLSNLNAQMRLADNTWKQSLSTFKQSDRSIAKLSVSVKGMSDKLKAQSQIVEAHKQKVAKLTSEYGETHTKVIKANAELKKQEATFGNLKRSISEVTSEIEQLKKAEQINNSPWGKRSQELQLYSDRLSAVGDKMTSIGQNMSMTVTAPIAAGFGAAVKTSMDFEAQMDRVGAISDTTGSKFNNMTKLAMELGASTTKSASEVAKGMEEMAAKGYNANQIMQAMPGIISAAEASGSDMAQTAEVMASAMNAFGIEAGKSGHVADVLAQTANQSAADITDMQYALKYAAAPAHSLGMSLEETSASIGMMVDAGLKGEQAGTTLRGALLGLLDPSEQNSKTMDKMGIAITDNEGNFVGMSKLIGNLQESMEGMTDTQKAATLSQLVGKEAVSGMLVMMQKSPEQIDKMTNALEQSDGASKKAADAMMDNLKGAVEEMKGAFETLGIQVGQDLTPMIKGLADGLQRAATNFSEMPGWARKTAVGIGLVAGATGPVILGLGIVAKSASTAASGLSRLTGTFAKNTVAAEVNAAANLAAGASIEKQGGKLGKVTGLFTNLSKGATGAADSVGLLGRAGSIATKGIGLFASGPVGIAIGVVATLGTTFKLAYDHIGWFHDGVENTKKLLGEVASTIDFDWVGNLGNGIKDTGKWLADSTGKLARFGFEISPIGMISKNTFKVVGDSVKKATDTVDVFGKGVGKSTKKVLQEYTDLSMKASKKLEDLKISHKTIGDQQYKEVISIYSKINDDVTKKLDERHRRETDGLKKLLADTKGISNQEKKRVLAEAQSGNSAEVKAAQDINKQITNIYKKAKNEKRALTRTEENKIANLQKQMDQKVVASLSNSEKEQRIILGKLKSNKKTLSIQAASEVIKSSAKERDESIKNARKKRDKTIDEAIYQRDITKNISKEQADKIIKDAERQYSGSKKNAEKQHKSVVDEAKKQNKGVRTEIDSQTGRVLSQWEKTKKNVSLSASFLTSYVNTQFKKSYENTSKWMSETKNSIGKKWSEIKTNVSNFAEDTKKAAVDKFESMYDGATKWVSNIGKFITDSKKGITDKASSMGKSVANGAIGGLNGMIDGINKISSGIMDKNLLSKIPTLSTGTVKDGAIAKPTLAVVGDKGPGNGPNGFRQEIIQRSNGDMHLTPAKDTLVHLGKGDRVFSGAETYSMLNGNIPHFSKGTDDNFYSKLKKGAHNVKEHAMDGIGAVKKGVSHKVGQAKDVIGDIMDYIENPKALVDKVLDSMGIDFSGLGATGTLAKSAYNKLKIMLQNKIKDWFESSGGYGFNPFSNWKKTPGRGWAAGGHAGIDYAMPAGTPIPSPITGEVLQSWFSPYKPSGGNEVQIFADGFTHILMHMLNGSRKVKKGDHVTAGQIIGKVGNTGNSFGDHLHWQVNKGRGYMRNEDSIDPELWARKYAKSSSGKGTWTSQIKKAASKMGAKVNNQDISDIMSLINKESSGNETVVQHGYVDRNTGGNEARGLLQYTPGTFAGYKVPGYGNILSGYDQLLAFFNNSNWRGDLSAWQRRIASGSTGWGPSGSRKYSTGAYINEAHNAIVGDKGPGNGPNGFTRELIHRSNGDVQLTPNKDTLVNLGKGDRVLNGSQTYSILSDIFPKFSRGTKQKTHRVKWGDTLWDISRKNGTTVKALQLLNGIKNHLIYPGQIIKLTGSITNLSKNVSKQTKVQSKPKASTSYISRAQALYNTGKSILNRGKSSNKVTGKDDVNLGTLIMNNTKNLGSLSLEAAQKNIDTIVKKINSMITSSTGKISSLNNKISKSTNKKTIANARNDIKAYKAQIASLKKLKQNEVLKTNYLKNLIKEKSSLTAKLNQRTEEGKALQEEKTNYRSSIASNLQNYAGFGVAKGHTSRDFVSFMKYRLSKMKEYASNVRKLKSMGLDPILLRELLAGGIENSMPRVAALVKGGKGYIGQINTLQKSINAEVNKISSEQANFGYNSDINANNKQIQTLKNQQKKIDKKKVVYLNERKRITKSNVKANPKKPISNTSHTVTTMRTHNIKWGDTLGHIAQRYGTTVNELKKANNLKSDMIYAGRTLKVPTKKVVQLPKTQTALDKSTKYIMDTAKRYQLVNNSSKLNNLQKQLNKIKSDKDKKNDVVITKLEKTLRDLTKKYDKQDDVVKLLQQLVNKNPDILLNGVKLTKEMDKLLATNSKINARRKAR
Other Proteins in cluster: phalp2_38435
Total (incl. this protein): 16 Avg length: 2103,4 Avg pI: 9,82

Protein ID Length (AA) pI
1bQAB 2191 9,77587
7qCIo 2017 9,84711
7qCQy 2017 9,86310
7qCWf 2163 9,75337
7qD7X 2024 9,84711
7qD7m 2191 9,76820
7qDcl 2024 9,85298
7qDha 2045 9,85117
7sMm2 2024 9,85298
7sMoh 2163 9,73693
7soE4 2163 9,78477
7soHS 2052 9,86980
7syN6 2024 9,84711
7wA6f 2163 9,74138
7wlzN 2231 9,86316
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8198
7fTRt
93 35,2% 1700 2.285E-270
2 phalp2_3959
7qClx
89 30,8% 1652 9.358E-188
3 phalp2_15249
11Bcx
7 26,8% 1581 1.733E-184
4 phalp2_22501
1enlE
152 25,9% 1673 2.565E-107
5 phalp2_36245
7zsCl
45 23,0% 1740 2.911E-51
6 phalp2_9569
1jLlx
3 21,8% 1764 3.462E-37
7 phalp2_30717
7iNrE
51 23,6% 1777 7.232E-31
8 phalp2_9669
1Z1dN
2 21,6% 1893 1.252E-30
9 phalp2_8045
5HEPh
40 21,5% 1621 3.434E-24
10 phalp2_18582
7iQOB
13 20,0% 1882 1.163E-19

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1bQAB) rather than this protein.
PDB ID
1bQAB
Method AlphaFoldv2
Resolution 60.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50