ID   YP_009725295.1          Unreviewed;      4405 AA.
AC   YP_009725295.1;
DT   24-MAR-2020, integrated from genpept.
DE   RecName: Full=orf1a polyprotein
GN   Name=orf1ab
OS   Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
CC   -!- CAUTION: The sequence is derived from genpept 'YP_009725295.1'.
CC   -!- NOTE:
SQ   SEQUENCE 4405 AA;
     MESLVPGFNE KTHVQLSLPV LQVRDVLVRG FGDSVEEVLS EARQHLKDGT CGLVEVEKGV
     LPQLEQPYVF IKRSDARTAP HGHVMVELVA ELEGIQYGRS GETLGVLVPH VGEIPVAYRK
     VLLRKNGNKG AGGHSYGADL KSFDLGDELG TDPYEDFQEN WNTKHSSGVT RELMRELNGG
     AYTRYVDNNF CGPDGYPLEC IKDLLARAGK ASCTLSEQLD FIDTKRGVYC CREHEHEIAW
     YTERSEKSYE LQTPFEIKLA KKFDTFNGEC PNFVFPLNSI IKTIQPRVEK KKLDGFMGRI
     RSVYPVASPN ECNQMCLSTL MKCDHCGETS WQTGDFVKAT CEFCGTENLT KEGATTCGYL
     PQNAVVKIYC PACHNSEVGP EHSLAEYHNE SGLKTILRKG GRTIAFGGCV FSYVGCHNKC
     AYWVPRASAN IGCNHTGVVG EGSEGLNDNL LEILQKEKVN INIVGDFKLN EEIAIILASF
     SASTSAFVET VKGLDYKAFK QIVESCGNFK VTKGKAKKGA WNIGEQKSIL SPLYAFASEA
     ARVVRSIFSR TLETAQNSVR VLQKAAITIL DGISQYSLRL IDAMMFTSDL ATNNLVVMAY
     ITGGVVQLTS QWLTNIFGTV YEKLKPVLDW LEEKFKEGVE FLRDGWEIVK FISTCACEIV
     GGQIVTCAKE IKESVQTFFK LVNKFLALCA DSIIIGGAKL KALNLGETFV THSKGLYRKC
     VKSREETGLL MPLKAPKEII FLEGETLPTE VLTEEVVLKT GDLQPLEQPT SEAVEAPLVG
     TPVCINGLML LEIKDTEKYC ALAPNMMVTN NTFTLKGGAP TKVTFGDDTV IEVQGYKSVN
     ITFELDERID KVLNEKCSAY TVELGTEVNE FACVVADAVI KTLQPVSELL TPLGIDLDEW
     SMATYYLFDE SGEFKLASHM YCSFYPPDED EEEGDCEEEE FEPSTQYEYG TEDDYQGKPL
     EFGATSAALQ PEEEQEEDWL DDDSQQTVGQ QDGSEDNQTT TIQTIVEVQP QLEMELTPVV
     QTIEVNSFSG YLKLTDNVYI KNADIVEEAK KVKPTVVVNA ANVYLKHGGG VAGALNKATN
     NAMQVESDDY IATNGPLKVG GSCVLSGHNL AKHCLHVVGP NVNKGEDIQL LKSAYENFNQ
     HEVLLAPLLS AGIFGADPIH SLRVCVDTVR TNVYLAVFDK NLYDKLVSSF LEMKSEKQVE
     QKIAEIPKEE VKPFITESKP SVEQRKQDDK KIKACVEEVT TTLEETKFLT ENLLLYIDIN
     GNLHPDSATL VSDIDITFLK KDAPYIVGDV VQEGVLTAVV IPTKKAGGTT EMLAKALRKV
     PTDNYITTYP GQGLNGYTVE EAKTVLKKCK SAFYILPSII SNEKQEILGT VSWNLREMLA
     HAEETRKLMP VCVETKAIVS TIQRKYKGIK IQEGVVDYGA RFYFYTSKTT VASLINTLND
     LNETLVTMPL GYVTHGLNLE EAARYMRSLK VPATVSVSSP DAVTAYNGYL TSSSKTPEEH
     FIETISLAGS YKDWSYSGQS TQLGIEFLKR GDKSVYYTSN PTTFHLDGEV ITFDNLKTLL
     SLREVRTIKV FTTVDNINLH TQVVDMSMTY GQQFGPTYLD GADVTKIKPH NSHEGKTFYV
     LPNDDTLRVE AFEYYHTTDP SFLGRYMSAL NHTKKWKYPQ VNGLTSIKWA DNNCYLATAL
     LTLQQIELKF NPPALQDAYY RARAGEAANF CALILAYCNK TVGELGDVRE TMSYLFQHAN
     LDSCKRVLNV VCKTCGQQQT TLKGVEAVMY MGTLSYEQFK KGVQIPCTCG KQATKYLVQQ
     ESPFVMMSAP PAQYELKHGT FTCASEYTGN YQCGHYKHIT SKETLYCIDG ALLTKSSEYK
     GPITDVFYKE NSYTTTIKPV TYKLDGVVCT EIDPKLDNYY KKDNSYFTEQ PIDLVPNQPY
     PNASFDNFKF VCDNIKFADD LNQLTGYKKP ASRELKVTFF PDLNGDVVAI DYKHYTPSFK
     KGAKLLHKPI VWHVNNATNK ATYKPNTWCI RCLWSTKPVE TSNSFDVLKS EDAQGMDNLA
     CEDLKPVSEE VVENPTIQKD VLECNVKTTE VVGDIILKPA NNSLKITEEV GHTDLMAAYV
     DNSSLTIKKP NELSRVLGLK TLATHGLAAV NSVPWDTIAN YAKPFLNKVV STTTNIVTRC
     LNRVCTNYMP YFFTLLLQLC TFTRSTNSRI KASMPTTIAK NTVKSVGKFC LEASFNYLKS
     PNFSKLINII IWFLLLSVCL GSLIYSTAAL GVLMSNLGMP SYCTGYREGY LNSTNVTIAT
     YCTGSIPCSV CLSGLDSLDT YPSLETIQIT ISSFKWDLTA FGLVAEWFLA YILFTRFFYV
     LGLAAIMQLF FSYFAVHFIS NSWLMWLIIN LVQMAPISAM VRMYIFFASF YYVWKSYVHV
     VDGCNSSTCM MCYKRNRATR VECTTIVNGV RRSFYVYANG GKGFCKLHNW NCVNCDTFCA
     GSTFISDEVA RDLSLQFKRP INPTDQSSYI VDSVTVKNGS IHLYFDKAGQ KTYERHSLSH
     FVNLDNLRAN NTKGSLPINV IVFDGKSKCE ESSAKSASVY YSQLMCQPIL LLDQALVSDV
     GDSAEVAVKM FDAYVNTFSS TFNVPMEKLK TLVATAEAEL AKNVSLDNVL STFISAARQG
     FVDSDVETKD VVECLKLSHQ SDIEVTGDSC NNYMLTYNKV ENMTPRDLGA CIDCSARHIN
     AQVAKSHNIA LIWNVKDFMS LSEQLRKQIR SAAKKNNLPF KLTCATTRQV VNVVTTKIAL
     KGGKIVNNWL KQLIKVTLVF LFVAAIFYLI TPVHVMSKHT DFSSEIIGYK AIDGGVTRDI
     ASTDTCFANK HADFDTWFSQ RGGSYTNDKA CPLIAAVITR EVGFVVPGLP GTILRTTNGD
     FLHFLPRVFS AVGNICYTPS KLIEYTDFAT SACVLAAECT IFKDASGKPV PYCYDTNVLE
     GSVAYESLRP DTRYVLMDGS IIQFPNTYLE GSVRVVTTFD SEYCRHGTCE RSEAGVCVST
     SGRWVLNNDY YRSLPGVFCG VDAVNLLTNM FTPLIQPIGA LDISASIVAG GIVAIVVTCL
     AYYFMRFRRA FGEYSHVVAF NTLLFLMSFT VLCLTPVYSF LPGVYSVIYL YLTFYLTNDV
     SFLAHIQWMV MFTPLVPFWI TIAYIICIST KHFYWFFSNY LKRRVVFNGV SFSTFEEAAL
     CTFLLNKEMY LKLRSDVLLP LTQYNRYLAL YNKYKYFSGA MDTTSYREAA CCHLAKALND
     FSNSGSDVLY QPPQTSITSA VLQSGFRKMA FPSGKVEGCM VQVTCGTTTL NGLWLDDVVY
     CPRHVICTSE DMLNPNYEDL LIRKSNHNFL VQAGNVQLRV IGHSMQNCVL KLKVDTANPK
     TPKYKFVRIQ PGQTFSVLAC YNGSPSGVYQ CAMRPNFTIK GSFLNGSCGS VGFNIDYDCV
     SFCYMHHMEL PTGVHAGTDL EGNFYGPFVD RQTAQAAGTD TTITVNVLAW LYAAVINGDR
     WFLNRFTTTL NDFNLVAMKY NYEPLTQDHV DILGPLSAQT GIAVLDMCAS LKELLQNGMN
     GRTILGSALL EDEFTPFDVV RQCSGVTFQS AVKRTIKGTH HWLLLTILTS LLVLVQSTQW
     SLFFFLYENA FLPFAMGIIA MSAFAMMFVK HKHAFLCLFL LPSLATVAYF NMVYMPASWV
     MRIMTWLDMV DTSLSGFKLK DCVMYASAVV LLILMTARTV YDDGARRVWT LMNVLTLVYK
     VYYGNALDQA ISMWALIISV TSNYSGVVTT VMFLARGIVF MCVEYCPIFF ITGNTLQCIM
     LVYCFLGYFC TCYFGLFCLL NRYFRLTLGV YDYLVSTQEF RYMNSQGLLP PKNSIDAFKL
     NIKLLGVGGK PCIKVATVQS KMSDVKCTSV VLLSVLQQLR VESSSKLWAQ CVQLHNDILL
     AKDTTEAFEK MVSLLSVLLS MQGAVDINKL CEEMLDNRAT LQAIASEFSS LPSYAAFATA
     QEAYEQAVAN GDSEVVLKKL KKSLNVAKSE FDRDAAMQRK LEKMADQAMT QMYKQARSED
     KRAKVTSAMQ TMLFTMLRKL DNDALNNIIN NARDGCVPLN IIPLTTAAKL MVVIPDYNTY
     KNTCDGTTFT YASALWEIQQ VVDADSKIVQ LSEISMDNSP NLAWPLIVTA LRANSAVKLQ
     NNELSPVALR QMSCAAGTTQ TACTDDNALA YYNTTKGGRF VLALLSDLQD LKWARFPKSD
     GTGTIYTELE PPCRFVTDTP KGPKVKYLYF IKGLNNLNRG MVLGSLAATV RLQAGNATEV
     PANSTVLSFC AFAVDAAKAY KDYLASGGQP ITNCVKMLCT HTGTGQAITV TPEANMDQES
     FGGASCCLYC RCHIDHPNPK GFCDLKGKYV QIPTTCANDP VGFTLKNTVC TVCGMWKGYG
     CSCDQLREPM LQSADAQSFL NGFAV
//