peptimizer-test / cpp_generator_dataset.txt
willengler-uc's picture
Upload data files
b91d4bf
KLALKLALKALQAALQLA
VSRRRRRRGGRRRR
VKRFKKFFRKLKKKV
KLALKAALKAWKAAAKLA
LIRLWSHLIHIWFQNRRLKWKKKGGC
AVPAENALNNPF
MAARLCCQLDPARDV
MIIYRALIS
GALFLGWLGAAGSTMGAPKSKRKVGGC
KCFMWQEMLNKAGVPKLRCARK
KSTGKANKITITNDKGRLSK
KKAAQIRSQVMTHLRVI
GGGRRRRRRYGRKKRRQRR
RLHRRLHRRLHRLHRRLHRLHRRLHRRLH
CAYGRKKRRQRRR
KLALKLALKALKAALKLA
ACRGSGRGCGRGSGRCG
YTFGLKTSFNVQ
FLGKKFKKYFLQLLK
RHHLRHLRRHLRHLLRHLRHHLRHLRRHLRHLL
KKKKKKKK
AKVKDEPQRRSARLSAKPAPPKPEPKPKKAPAKK
GGAYVTRSSAVRLRSSVPGVRLLQ
ACSGRGRGCGRGRGSCG
LALALALALALALALAKLAKLAKLAKLAKLAKKIK
GLKKLAELFHKLLKLG
KWCFAVCYAGICYAACAGK
NSGTMQSASRAT
GSVSRRRRRRGGRRRR
ACGRGRGRCRGRGRGCG
AGYLLGHINLHHLAHLHHIL
KLIKGRTPIKFGKADCDRPPKHSQNGMGK
KRKRWHW
RRRRRRRQIKILFQNRRMKWKKGGC
GWTLNSAGYLLGKINLKALAALAKKIL
CRWRFKCCKK
KETWWETWWTEWSQPGRKKRRQRRRPPQ
KRIPNKKPKK
MVRRFLVTLRIRRACGPPRVRV
GYGYGYGYGYGYGYGYKKRKKRKKRKKRKQQKQQKRRK
KLIKGRTPIKFGKADCDRPPKHSQNGK
QPIIITSPYLPS
VIRVHFRLPVRTV
KDCRWRWKCCKK
KITLKLAIKAWKLALKAA
NYQWRCKNQN
YGRRRRRRRRR
RQIKIWFQNRRMKW
KKLALHALHLLALLWLHLAHLALKK
RRIRPRPPRLPRPRPRPLPFPRPG
WKCRRQCFRVLHHWN
GLWRALWRALWRSLWKKKRKV
CGRKKRRQRWWRRPPQ
NHQQQNPHQPPMLLIILRRRIRKQAHAHSK
RKKRRQRAR
DAATARGRGRSAASRPTERPRAPARSASRPRRPVD
MLLLTRRRST
KIAAKSIAKIWKSILKIA
WLKLLKKWLKLWKKLLKLW
RRRRRRRRRRRRRRR
RRRRRRRRRRRR
TRRQRTRRARRNRGC
VKRGLKLRHVRPRVTRMDV
AEAEAEAEAKAKAKAK
LLIILRARIRKQAHAHSK
RHHLRHLRRHL
NRARRNRRRVR
ALALALALALALALALKIKKIKKIKKIKKLAKLAKKIK
DITYRFRGPDWL
MIAYRDLIS
GGVCPKILRRCRRDSDCPGACICRGNGWCGSGSD
RRRRRRRRRRRTYADFIASGRTGRRNAI
CGRKKRRQRAARRPPQ
CGGGGYGRKKRRQRRR
GRRHHCRSKAKRSRHH
SNPWDSLLSVST
AKKRRQRRR
LLIILRRRIRKQAHAHSK
CWKKKKKKKKKKKKK
RLWMRWYSPTTRRAG
MIIFRIAAYHKK
RRWRRWWRRWWRRWRR
MIIYRDLIA
CGRKKRLLRQRRRPPQ
CGRKKRRQRRLLRPPQ
RRRRRRRRK
LSTAADMQGVVTDGMASGLDKDYLKPDD
CWKKKKKKKK
RQIKIWFQNRRMKWKKGG
YARKARRAARR
GRCTKSIPPICWPD
ESGGGGSPGRRRRRRRRRRR
CGYGRKKRRQRRRGC
PRPRPRPLPFPRPG
HHHHHHTKRRITPKDVIDVRSVTTEINT
YKRKARRAARR
GLWWKAWWKAWWKSLWWRKRKRKA
SRRHHCRSAAKRSRHH
KMTRAQRRAAARRNRWTARGC
NKPILVFY
RLFMRFYSPTTRRYG
RQIKIWFQNRRMAWKK
RQIKIWAQNRRMKWKK
AAVACRICMRNFSTRQARRNHRRRHRR
AKKAKAAKKAKAAKKAKAAKKAKAAKKAKA
KCFQWQRNMRKVRGPPVSC
FLKLLKKFLKLFKKLLKLF
RKKRRQRRA
GLGSLLKKAGKKLKQPKSKRKV
GLKKLAELAHKLLKLGC
NYRWRCKNQ
WRWKKKKA
HHHHHHRRRRRRRRR
DRRRRGSRPSGAERRRRRAAAA
YGRKKRRQRRRGLFGAIAGFIENGWEGMIDGWYG
KLTRAQRRAAARKNKRNTRGC
RRRRRRRRGC
GRKRKKRT
ACRRSRRGCGRRSRRCG
LGTYTQDFNKFHTFPQTAIGVGAP
RAKRRQRRR
RLWMRWASPTTRRYG
RQIKIWFQNRRMKWKKTYADFIASGRTGRRNAI
RRKLSQQKEKK
ALWMTLLKKVLKAAAKAALNAVLVGANA
RWRWKCCKK
RVREWWYTITLKQES
SRRHHCRAKAKRSRHH
ARRRCSGSGSGCGSGSGSCGRRR
ELALELALEALEAALELA
LLIILARRIRKQAHAHSK
AYLLGKINLKALAALAKKIL
MIIFRIAASHKK
RTLVNEYKNTLKFSK
KKPGKKTTTKPTKKPTIKTTKK
ACSDRFRNCPADEALCGRRRRRRRR
VKLPPPVKLPPP
GLKKLAELAHKLLKLG
HEHEHEHEHEHEHEHEEFGGGGGYGRGRGRGRGRGRG
KFHTFPQTAIGVGAP
RKKRAQRRR
HSDGIFTDSYSRYRKQMAVKKYLAAVLGKRYKQRVKNK
GALFLGFLGAAGSTMGAWSQPKSKRKV
ACRGRGRGCGSGSRSCG
HEHEHEHEHEHEHEHEEFGGGGGYGRRRRRRGGGGGG
HHHHHHHHHHHHRRRRRRRRRRRRRRR
YGRRARRRARR
CRWRWKSSKK
LLRILRRSIRRARRAIRR
KKWALLALALHHLAHLALHLALALKKAHHHHHH
PRPPRLPRPRPRPLPFPRPG
ACSSSPSKHCGGGGRRRRRRRRR
SLGWMLPFSPPF
RKKRRQRRRGGGKLLKLLLKLLLKLLK
KRIPNKKPGKKTTTKPTKKPTIKTTKK
RLLMRLYSPTTRRYG
LNSAGYLLGKALAALAKKIL
INLKALAALAKKIL
NHQQQNPHQPPM
LLETLLKPFQCRICMRNFSTRQARRNHRRRHRR
VHLPPP
KLALKALKAALKLA
RQIKIWFQNRRMKWKKC
RLLRLLLRLWRRLLRLLR
VTPHHVLVDEYTGEWVDSQFK
LLGKINLKALAALAKKIL
DRRRRGSRPSGAERRRR
RIFIGC
WIIFRIAATHKK
MIIFRIAATHKK
RKKRRQRRR
TWLKYH
MVTVLFRRLRIRRACGPPRVRV
RHVYHVLLSQ
GLWRALWRGLRSLWKKKRKV
IKIWFQNRRMKWKK
WRFKAAVALLPAVLLALLAP
YARAAARQARAKALARQLGVAA
GGVCPKILKKCRRDSDCPGACICRGNGWCGSGSD
CSSLDEPGRGGFSSESKV
WRFKKSKRKV
YARAARRAARR
RRRRRRRRRRRRGC
CGRKKRLLRQRLLRLLRPPQ
FTFHFTFHF
KKRRQRRR
GKKTNLFSALIKKKKTA
RRIRPRPPRLPRPRPRP
NYRWRCKNQN
PKKKRKV
GRRRRRRRRR
RLWMRWYSPTTRRYG
GRKKRRQRARPPQC
ACRDRFRRCPADRRLCG
AAVALLPAVLLALLAK
KRIPNKKPGKK
MIIYRIAASHKK
LRHLLRHLLRHLRHL
AAVALLPAVLLALLAP
RARARARARARARARARARARARARARARARA
KKKKKKKKK
LLKTTALLKTTALLKTTA
RRHHCRSKAKRSR
ACRDRFRRCPADERLCG
YGRKKRRQRRRYGRKKRRQRRRYGRKKRRQRRR
CIGAVLKVLTTGLPALISWIKRKRQQ
GRKLKKKKNEKEDKRPRT
CRNGRGPDC
PPHNRIQRRLNM
CWKKKKKKKKKKKKKKKKKK
LLIILRRRARKQAHAHSK
AYRIKPTFRRLKWKYKGKFW
RWRRWWRRW
TRSSRAGLQWPVGRVHRLLRKGGC
ARCSDRFRNCPADEALCGR
KLWMRWWSPTTRRYG
WRRRRRRRR
YKQCHKKGGHCFPKEKICLPPSSDFGKMDCRWRWKCCKKGSG
IYRDLISH
HHHHHHHHRRRRRRRR
GSRHPSLIIPRQ
AIIYRDLIS
YARAAARQARA
GLWWRLWWRLRSWFRLWFRA
RIFIRIGC
TSHTDAPPARSP
VKRFKKFFRKLKKLV
SYIQRTPSTTLP
HYRIKPTFRRLAWKYKGKFW
MIIYRDL
FFKKLALHALHLLALLWLHLAHLALKK
AKKKAAKAAKKKAAKAAKKKAAKA
KRIPNKKPGKKTTTKPTKKPTIK
YEREARRAARR
RGGRLSYSRRRFSTSTGRA
HATKSQNINF
RSVTTEINTLFQTLTSIAEKVDP
SRAHHCRSKAKRSRHH
GYGRKKRRGRRRTHRLPRRRRRR
RQGAARVTSWLGRQLRIAGKRLEGRSK
GWTLNPAGYLLGKINLKALAALAKKIL
KHKALHALHLLALLWLHLAHLAKHK
RLIMRIYAPTTRRYG
EEEAAKKK
KWSFRVSYRGISYRRSRGK
KKKKKKNKKLQQRGD
TAMRAVDKLLLHLKKLFREGQFNRNFESIIICRDRT
GRQLRIAGKRLRGRSK
DCRWRWKCCKK
HEHEHEHEHEHEHEHEHEHEEFGGGGGYGRRRRRRGGGGGG
KMDRWRWKKK
MIIFRAAASHKK
NAKTRRHERRRKLAIERGC
ACRGRGRRCGSGSRSCG
GRKKRRQRRRG
GRKKRRQRRRPPQK
RQAKIWFQNRRMKWKK
ACGRGRGRCGRGRGRCG
RKKRRQRRRPPQCAAVALLPAVLLALLAP
RRRRWWWWRRRR
KETWFETWFTEWSQPKKKRKV
RRLRHLRHHYRRRWHRFR
CGAYDLRRRERQSRLRRRERQSR
FLIFIRVICIVIAKLKANLMCKT
KLALKAAAKAWKAAAKAA
LLIILRRRIRKQAHAHAK
NKRILIRIMTRP
YGRKKKRRQRRR
FFFAAGRKRKKRT
AAVALLPAVLLALLAPSGASGLDKRDYV
AAVALLPAVLLALLAPRRRRRR
KGRTPIKFGKADCDRPPKHSQNGMGK
AGYLLGKINLKALAALAKKIL
YSSYSAPVSSSLSVRRSYSSSSGS
VQAILRRNWNQYKIQ
KTIEAHPPYYAS
YTAIAWVKAFIRKLRK
LLIILRRAIRKQAHAHSK
RRRRWWWW
TARRITPKDVIDVRSVTTEINT
CHAIYPRH
RKKRRRESRKKRRRES
YGRRARRRRRR
CRWRWKCG
MIIYRDAIS
RRRRRRRRRRRRRRRR
RILQQLLFIHFRIGCRH
RLSGMNEVLSFRWL
SRRARRSPRHLGSG
QWQRNMRKVR
RKKRRQRR
RGERLERRELRLERRELRC
MIIFRILISHKK
KTIPSNKPKKK
ARTINAQQAELDSALLAAAGFGNTTADVFDRG
RQIKIWFQNRRMK
LLIILRRRIRAQAHAHSK
RLWMRWYSPWTRRWG
KLAKLAKKLAKLAK
RHIKIWFQNRRMKWKK
RQIKIW
TKRRITPKDVIDV
ACSDRFRNCPADEALCG
GRQLRIAGKRLEGRSK
MRRIRPRPPRLPRPRPRPLPFPRPGGCYPG
CELAGIGILTVKKKKKQKKK
GACTKSIPPICFPD
GGVCPAILKKCRRDSDCPGACICRGNGYCGSGSD
KETWWETWWTEWSQPKKKRKVC
MIIFRDLISH
KIAKLKAKIQKLKQKIAKLK
KKDGKKRKRSRKESYSVYVYKVLKQ
SPMQKTMNLPPM
FFLIPKGRRRRRRRRR
RQLRIAGRRLRGRSR
ACSGSGSGCGSGSGSCG
GSGKKGGKKHCQKY
PARAARRAARR
RRIRPRP
SATGAPWKMWVR
VRLPPPVRLPPP
RILQQLLFIHFRIGCRHSRI
YGRKKRRQRRRGCYGRKKRRQRRRG
RVIRWFQNKRCKDKK
FQNRRMKWKK
RLYMRYYSPTTRRYG
WFQNRRMKWKK
KMDCRPRPKCCKK
WKARRQCFRVLHHWN
IPLVVPLRRRRRRRRC
RRVWRRYRRQRWCRR
ACHGRRWGCGRHRGRCG
KKLFKKILKKL
ARRRRCSGSGSGCGSGSGSCGRRRR
GRKKRRQRRRPPQRKC
SRRARRSPRESGKKRKRKR
CVKRGLKLRHVRPRVTRDV
GRKKRRQRPPQC
CGRKKRRQRRAARPPQ
HHHHHHHHHHHHHHHHHHHHRRRRRRRRRRRRRRR
YGRKKRRQRRTALDWSWLQTE
MAPQRDTVGGRTTPPSWGPAKAQLRNSCA
ALWMRWYSPTTRRYG
GDVYADAAPDLFDFLDSSVTTARTINA
RHNFRFFFNFRTNR
GALFLAFLAAALSLMGLWSQPKKKRRV
DPKGDPKGVTVTVTVTVTGKGDPKPD
KKKKKKGGFLGFWRGENGRKTRSAYERMCILKGK
LRHLLRHLLRHLRHLLRHLRHLLRHLLRH
SRRRRRRRRR
GCGGGYGRKKRRQRRR
RRVTSWLGRQLRIAGKRLEGRSK
VHLPPPVHLPPPVHLPPP
YGRKKRRQRRTALDASALQTE
GRPRESGKKRKRKRLKP
GWTLNSAGYLLGKINLKAPAALAKKIL
MIIYRDLI
MVTVLFKRLRIRRACGPPRVKV
YGDCLPHLKLCKENKDCCSKKCKRRGTNIEKRCR
CRRLRHLRHHYRRRWHRFRC
AAVALLPAVLLALLAPEILLPNNYNAYESYKYPGMFIALSK
VQRKRQKLMP
LLRHLRRHIRRARRHIRR
DRDRDRDRDR
KCFQWQRNMRKVRGPPVSSIKR
RQIKIWFQNRRMKWK
LLIILRRRIRKQAHAASK
GKINLKALAALAKKIL
RLRLRLRLRLRLRLRLKRLKRLKRLKRLKKKKKKKGYK
KSICKTIPSNKPKKK
DFNKFHTFPQTAIGVGAP
KGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQ
KRIIQRILSRNS
IIYRDLISH
YRWRCKNQN
ISFDELLDYYGESGS
CKYGRKKRRQRRR
RVIRVWFQNKRCKDKK
RVIRWFQNKRSKDKK
LIIFRILISHKK
SWAQHLSLPPVL
YRRAARRAARA
KLAKLAKKLAKLAKGGRRRRRRR
FFGRRRRRRRGC
GKKKRKLSNRESAKRSR
KHKHKHKHKHKHKHKHKHKKLFKKILKYL
KLAAALLKKWKKLAAALL
DPVDTPNPTRRKPGK
ACRGRGRRCGSGRRSCG
RRRRRRRW
LLAILRRRIRKQAHAHSK
CGRKKRWWRQRRRPPQ
GPFHFYQFLFPPV
LKTLATALTKLAKTLTTL
RLLRLLRLL
HRHIRRQSLIML
ERKKRRRE
GGGARKKAAKAARKKAAKAARKKAAKAARKKAAKA
PKKKRKVWKLLQQFFGLM
CGRKKRWWRQRWWRWWRPPQ
LIIFAILISHKK
ACRDRFRNCPADERLCG
CRWRWKCCKK
QSPTDFTFPNPL
RRHLRRHLRHLRRHLRRHLRHL
YGRKKRRQRRRGC
AAVALLPAVLLALLAPRKKRRQRRRPPQ
AGYLLGHINLHHLAHLHHILC
PFVYLI
FDPFFWKYSPRD
CGRKKRAARQRRRPPQ
SRWRWKCCKK
CLLIILRRRIRKQAHAHSKNHQQQNPHQPPM
LIRLWSHLIHIWFQNRRLKWKKKC
CGGGYGRKKRRQRRR
KFLNRFWHWLQLKPGQPMY
RKKRRQARR
RWRCKNQN
MIIYRDKKSH
HRLRHALAHLLHKLKHLLHALAHRLRH
CGGGRRRRRRRRRLLLL
KRWRWKCCKK
RQIKIWFQNRRMKWKKDIMGEWGNEIFGAIAGFLG
GRCTKSIPPICFPD
RRRRRRRRR
ACSGRGRGCGSGSGSCG
ACRGRGRGCGRGRGRCG
CSIPPEVKFNKPFVYLI
GLFKALLKLLKSLWKLLLKAGGC
KHHWHHVRLPPPVRLPPPGNHHHHHH
GRKKRRQRRRPWQ
KGKKIFIMK
LKTLTETLKELTKTLTEL
YRQSHRRGGRRGSG
LILILILILILILILIKRKKRKKRKKRKKRAKRAKHSK
GRKKRRQRRRPPQC
FFLIPKGRRRRRRRRGC
RKKRRQR
GRGDGPRRKKKKGPRRKKKKGPRR
IWFQNRRMKWKK
QAASRVENYMHR
TKAARITPKDVIDVRSVTTEINT
RKKRRQRRRGGG
GRKKRRQRRRPPQY
CRWRWKCSKK
CASGQQGLLKLC
ACRGRGRGCGSGSGSCG
KLAKLAKKLAKLAKNYRWRCKNQN
KCRWRWKCCKK
GLWRALWRALWRSLWKLKWKV
KMIFVGIKKK
TCTWLKYH
GLPVCGETCVGGTCNTPGCTCSWPKCTRN
RVRILARFLRTRV
RIFIHFRIGC
RKKWFW
GYGNCRHFKQKPRRD
GRRRRRERNK
NRHFRFFFNFTNR
GGGGRRRRRRRRRLLLL
VKRFKKFFRKLKKSV
RRRRRRHHH
CRQIKIWFQNRRMKWKKKLAKLAKKLAKLAK
HHHHHHHHHHHHHHHHRRRRRRRRRRRRRRR
KMIFVGIKKKEERA
FFFFFFGRRRRRRRRGC
TLPSPLALLTVH
TRQARRNRRRRWRERQR
AGYLLGKLKALAALAKKIL
RQIRIWFQNRRMRWRRC
WWRRRRRRRR
IPSRWKDQFWKRWHY
KCGCRWRWKCGCKK
LLIILRRRIRKQAAAHSK
RQIKIFFQNRRMKFKK
RLRLRLRLRLRLRLRLKNNKNNKNNKNNKKKKKKKGYK
TCTWLKYHS
NYRWRCKN
AGYLLGKINLKALAALAKKILGGC
FFFFGRRRRRRRRGC
GKRKKKGKGLGKKRDPCLRKYK
RLWARWYSPTTRRYG
RQIKIWFQNRRAKWKK
RQIKIWFANRRMKWKK
TFPQTAIGVGAP
NTCTWLKYHS
RQIKIWFPNRRMKWKK
LHHLLHHLLHLLHHLLHHLHHL
RLIMRIYSPTTRRYG
RQIKIWFQNRRMKAKK
YTQDFNKFHTFPQTAIGVGAP
LIIFRILISHK
AYALCLTERQIKIWFANRRMKWKKEN
AAVALLPAVLLALLAPVQRKRQKLMP
YPRAARRAARR
NNNAAGRKRKKRT
EEEEEEEEPLGLAGRRRRRRRRN
HPGSPFPPEHRP
RQIKIWFQNRRMKWAK
WLKLWKKWLKLW
RLVMRVYSPTTRRYG
YGRKKRRQRRRYGRKKRRQRRR
ARRCSDRFRNCPADEALCGRR
MAARLCCQ
GWTLNSAGYLLGKFLPLILRKIVTAL
KLWMRWYSPTTRRYG
CGGKDCERRFSRSDQLKRHQRRHTGVKPFQ
DTWAGVEAIIRILQQLLFIHFR
RRIPNRRPRR
RKKRRQRRRGC
ARRCSGSGSGCGSGSGSCGRR
HEHEHEHEHEHEHEHEHEHEHEHEEFGGGGGYGRRRRRRGGGGGG
CRFRFKCCKK
SRRKRQRSNMRI
MIIYRD
GRKKRRQPPQC
DPATNPGPHFPR
KKICTRKPRFMSAWAQ
KLIKGRTPIKFGKADCDRPPKHSQNGM
LGISYGRKKRRQRRRPPQ
RQIKIWFQARRMKWKK
RRRQRRKKRGYCKCKYGRKKRRQRRR
GRQLRRAGRRLRGRSR
KCPSRRPKR
GKHRHERGHHRDRRER
YGRKKRRQRRRC
MVRRFLVTLRIRRACGPPRVRVFVVHIPRLTGEWAAP
RMKWKK
RLLRLLRRLLRLLRRLLRC
SKKKKTKV
ACRGRGRGCRGRGRGCG
PPRLPRPRPRPLPFPRPG
AGYLLGKTNLKALAALAKKIL
AQIKIWFQNRRMKWKK
RLWMRWYSPTTRRYA
KMDCRWRWKSCKK
RWRWRWRW
KIWFQNRRMKWKK
RRIRPRPPRLPRPRP
LLHILRRSIRRQAHAIRR
CRFRWKCCKK
GKKALKLAAKLLKKC
RQIKIWFQNRRMKWKKK
KKALLALALHHLAHLALHLALALKKA
CKDEPQRRSARLSAKPAPPKPEPKPKKAPAKK
AAAWFW
WEARLARALARALARHLARALARALRACEA
YKALRISRKLAK
KPRSKNPPKKPK
CGRKKRRQRRRPPQ
KLGLKLGLKGLKGGLKLG
GGVCPKILRRCRRDSDCPGACICRGNGYCGSGSR
IWRYSLASQQ
NIENSTLATPLS
KLALKLALKAWKAALKLA
EPDNWSLDFPRR
HEHEHEHEHEHEHEHEHEHEHEHEEFGGGGGYGRKKRRQRRR
TKRRITPDDVIDVRSVTTEINT
ALWKTLLKKVLKA
HEHEHEHEHEHEHEHEHEHEGGGGGKLALKLALKALKAALKLA
QQHLLIAINGYPRYN
QIISRDLISH
SRWRWKSCKK
NYRWRCK
SRRHHCRSKAKRARHH
TSPLNIHNGQKL
RQIKIWFQNRRMKWKKGC
AGYLLGKINLKKLAKLLLIL
KTVLLRKLLKLLVRKI
LAIILRRRIRKQAHAHSK
LAQLLAQLLAQLGGGGRRRRRRRRR
RKKRRQRRRRKKRRQRRR
GLWRALWRALWRSLWKSKRKV
FITKALGISYGRKKRR
ACSSSPSKHCG
GRCTRSIPPKCWPD
KLALKLALKWAKLALKAA
KMDCRWRWKCCKK
GLWRALWRALWRSLWKLKRKV
LNSAGYLLGKLKALAALAK
SRRHHCRSKAKRSAHH
RGGRLSYSRRRFSTSTGR
CVQWSLLRGYQPC
SARHHCRSKAKRSRHH
HQHKPPPLTNNW
MANLGCWMLVLFVATWSDLGLCKKRPKP
KLPCRSNTFLNIFRRKKPG
RKARRQRRR
TKRRITPKDVIDVRSVTTEINT
IYLATALAKWALKQGGRRRRRRR
RQIKIWFQ
KLWMRWYSATTRRYG
HSDAVFTDNYTALRKQMAVKKYLNSILNYGRKKRRQRRR
GRGDSPRRSPRR
SWWTPWHVHSES
KMDSRWRWKCCKK
WKQSHKKGGKKGSG
KLLKLLLKLWKKLLKLLKGGGRRRRRRR
YGRKKRRQRRRPPQG
GEQIAQLIAGYIDIILKKKKSK
GRKKRRQRRRPP
LTRNYEAWVPTP
ALWKTLLKKVLKAPKKKRKV
KRPTMRFRYTWNPMK
YQKQAKIMCS
GRRRRKRLSHRT
SRRHHARSKAKRSRHH
KRARNTEAARRSRARKLQRMKQGC
KMDCRWRWKKK
MIIYRALISHKK
RRWRRWNRFNRRRCR
KMDCRWRWKCSKK
MDCRWRWKCCKK
GRQLRIAGRRLRGRSR
GGVCPKILKACRRDSDCPGACICRGNGYCGSGSD
KLFMALVAFLRFLTIPPTAGILKRWGTI
QIKIWFQNRRMKWKK
LKRWGTIKKSKAINVLRGFRKEIGRMLNILNRRRR
KKWKMRRNQFWIKIQR
MIIYRDLIS
RRRRRRR
YKQCHKKGGKKGSG
MVKSKIGSWILVLFVAMWSDVGLCKKRPKP
KRIPNKKPGKKTTTKPTKK
GRKKRRQRRR
HHHHHHESGGGGSPGRRRRRRRRRRR
QNRRMKWKK
HALAHKLKHLLHRLRHLLHRHLRHALAH
CAYGGQQGGQGGG
RRRRRR
GGVCPKILKKCRRDSDCPGACICRGNGYCGSGSD
KLWMRWYSPWTRRYG
AKKRRQRRRAKKRRQRRR
KMDCRWRWKCKK
RLWMAWYSPTTRRYG
LIIFRIAASHKK
LLRARWRRRRSRRFR
GLWRALWRLLRSLWRLLWKA
RLWMRWYSPTTARYG
SRRAHCRSKAKRSRHH
WWWWRRRRRRRR
RKKARQRRR
GLKKLARLFHKLLKLGC
RQIKIWFQNR
IKIKIKIKIKIKIKIKKLAKLAKLAKLAKLAKLAKKIK
RQIKIWFQNMRRKWKK
AAVALLPAVLLALLAVTDQLGEDFFAVDLEAFLQEFGLLPEKE
RGSRRAVTRAQRRDGRRRRRSRRESYSVYVYRVLRQ
SAETVESCLAKSH
GWTLNPPGYLLGKINLKALAALAKKIL
RRWWRRWRR
NRRMKWKK
PPKKSAQCLRYKKPE
AGYLLGKINLKALAALAKKILTYADFIASGRTGRRNAI
CGRKKRRQRRWWRPPQ
CYGRKKRRQRRR
RLWMRWYAPTTRRYG
DSLKSYWYLQKFSWR
GRKGKHKRKKLP
RQIKIWFQNRRMKWKKRQIKIWFQNRRMKWK
GLWRALWRLLRSLWRLLWSQPKKKRKV
RKKRRARRR
GWTLNSAGYLLGKLKALAALAKKIL
RLWMRWYSPTARRYG
RRRRNRTRRNRRRVRGC
SRRHHCRSKAKASRHH
KETWWETWWTEWSQPKKKRKV
KDCERRFSRSDQLKRHQRRHTGVKPFQK
ACSHSGWGCGHGSWSCGRRRRRRRR
LIIFRILISHHH
MIISRDLISH
RAIKIWFQNRRMKWKK
WEAKLAKALAKALAKHLAKALAKALKACEA
KRIHPRLTRSIR
KSHAHAQKRIRRRLIILL
CRQIKIWFQNRRMKWKK
RQIKIWFQNRRMKWKA
FHFHFRFR
KCFQWQRNMRKVR
RLHLRLHLRHLRHHLRLH
YGRKKRRQRRRQRRRPTAPLSPMSP
RRRRRHHH
RAGLQFPVGRVHRLLRK
GAYDLRRRERQSRLRRRERQSR
GWTLNSAGYLLGPHAVGNHRSFSDKNGLTS
LLKTTELLKTTELLKTTE
MAIYRDLIS
NTCTWLKYH
AYGRKKRRQRRR
GRCTKSIPPICFPA
KGRKKRRQRRRPPQ
MIIYADLIS
AEKVDPVKLNLTLSAAAEALTGLGDK
IYLATALAKWALKQGFGGRRRRRRR
GLLEALAELLEGLRKRLRKFRNKIKEK
ARRARAARRARAARRARAARRARAARRARA
NYQRRCKNQN
RGDRGDRRDLRLDRGDLRC
KALKKLLAKWLAAAKALL
KRPAAIKKAGQAKKKK
MAMPGEPRRANVMAHKLEPASLQLRNSCA
MGLGLHLLVLAAALQGAWSQPKKKRKV
VNADIKATTVFGGKYVSLTTP
KLALKLALKALKAALK
ACSGSGSGCGSGSGSCGRRRRRRRR
GRKKRRQARAPPQC
KMDSRWRWKSSKK
WKCRRQAFRVLHHWN
RRRRRRRRRR
RKKAAA
RLAMRWYSPTTRRYG
LLIALRRRIRKQAHAHSK
GWTLNSAGYLLGKINLKALAALAKKLL
RRLLRRLRR
RRRQKRIVVRRRLIR
AAVALLPAVLLALLAKKNNLKDCGLF
YGRKKRRQRRRGTALDWSWLQTE
KALAALLKKLAKLLAALK
LTMPSDLQPVLW
RKKRRRESRRARRSPRHL
LIIFRILISHR
GLKKLAELFHKLLKLGC
GIGKFLHSAKKWGKAFVGQIMNC
KWFETWFTEWPKKRK
GGRRARRRRRR
MIIYRDLAS
ARRRAARAARRRAARAARRRAARAARRRAARA
CTWLKYH
RFTFHFRFEFTFHFE
CGNKRTR
CRWRWKCGCKK
RRRRRRRRRC
WIIFRIAASHKK
GLFEALLELLESLWELLLEA
ECYPKKGQDP
GNYAHRVGAGAPVWL
RKLTTIFPLNWKYRKALSLG
CGGMVTVLFRRLRIRRASGPPRVRV
WEAALAEALAEALAEHLAEALAEALEALAA
TPWWRLWTKWHHKRRDLPRKPEGC
FITKALGISYGRKKRRQRRRPPQ
GTKMIFVGIKKKEERADLIAYLKKA
LLGDFFRKSKEKIGKEFKRIVQRIKDFLRNLVPRTESC
KLIKGRTPIKFGKADCDRPPKHSGK
MIIYRAEISH
CRQIKIWFPNRRMKWKKC
KKALLAHALHLLALLALHLAHALKKA
PIRRRKKLRRLK
ACSGRGSGCGSGRGSCG
QLALQLALQALQAALQLA
WRFKWRFK
LNSAGYLLGKINLKALAALAKKIL
YTFGLKTSFNVQYTFGLKTSFNVQ
YYYAAGRKRKKRT
SRRHACRSKAKRSRHH
YGRKKRRQRRR
KKKEERADLIAYLKKA
RGPRRQPRRHRRPRR
RQIKIAFQNRRMKWKK
IRQRRRR
GRKKRRQRRRPQ
KLAKLAKKLAKLAKGRKKRRQRRRP
MIIARDLIS
RNRSRHRR
RRQRRTSKLMKR
CELAGIGILTVRKKRRQRRR
PLSSIFSRIGDP
RIMRILRILKLAR
CARSKNKDC
KKKKKKKKKKKKKKKKKKK
ERERERERERERER
RRRQRRKRGGDIMGEWGNEIFGAIAGFLG
RLWMRAYSPTTRRYG
LKKLLKLLKKLLKLAG
CTSTTAKRKKRKLK
KRRQRRR
KWFETWFTEWPKKRKGGC
LAELLAELLAELGGGGRRRRRRRRR
SRWRWKSSKK
RIKAERKRMRNRIAASKSRKRKLERIARGC
RQARRNRRRC
KMDCRWRPKCCKK
GLWRALWRALRSLWKLKRKV
CSKSSDYQC
RQIKIWFQNARMKWKK
RRGRRG
YNNFAYSVFL
RQIKIWFQNRR
RKKRRQRRRHRRKKR
NTGTWLKYHS
GGVCPRILRRCRRDSDCPGACICRGNGYCGSGSK
HYRIKPTARRLKWKYKGKFW
LLKKRKVVRLIKFLLK
RGDGPRRRPRKRRGR
SFHQFARATLAS
HHHRRRRRRRRRHHH
YRFKYRFKYRLFK
QWQRNMRKVRGPPVSCIKR
VRLPPP
LCLRPVG
FTYKNFFWLPEL
RRRRRRRGGIYLATALAKWALKQGF
WIIFRAAASHKK
RLWMRWYSPRTRAYG
RGDRLDRRDLRLDRRDLRC
PRPLPFPRPG
RVRVFVVHIPRLT
ARCSGSGSGCGSGSGSCGR
EARPALLTSRLRFIPK
KWCFRVCYRGICYRRCRGK
LLIIARRRIRKQAHAHSK
RTRRNRRRVR
GLRKRLRKFRNKIKEK
SRRHHCRSKAARSRHH
TPKTMTQTYDFS
PRPRPLPFPRPG
YGRKKRPQRRR
LLIILRRRIRRRARARSR
YKRAARRAARR
GLPRRRRRRRRR
LNVPPSWFLSQR
YAREARRAARR
GSRVQIRCRFRNSTR
YARVRRRGPRR
KMDCRWRWKSSKK
PKKKRKVALWKTLLKKVLKA
FQFNFQFNGGGHRRRRRRR
CRKARYRGRKRQR
RWRRWRRWRRWR
WLRRIKAWLRRIKALNRQLGVAA
SSSIFPPWLSFF
CRRRRRRRR
GLPVCGETCVGGTCNTPGCKCSWPVCTRN
GALFLGFLGAAGSTMGAWSQPKKKRKV
WRWRWRWRWRWRWR
LKKLAELAHKLLKLG
KCCKWRWRCK
MIIFKIAASHKK
GRQLRRAGRRLRRRSR
LIIFAIAASHKK
LIIFRILISHRR
RRLSYSRRRF
CCTGRKKRRQRRR
WRFKWRFKWRFK
MAARLCCQLDPARDVLCLRP
SRWRWKCSKK
AEAEAEAEAKAKAKAKAGGGHRRRRRRR
HHHRRRRRRRR
GRKKRRQRRRPPQGRKKRRQRRRPPQGRKKRRQRRRPPQ
RQIKAWFQNRRMKWKK
NYRRRCKNQN
NYTTYKSHFQDR
KAFAKLAARLYRKALARQLGVAA
VRRFLVTLRIRRA
FRVPLRIRPCVVAPRLVMVRHTFGRIARWVAGPLETR
RFTFHFRFEFTFHFEGGGRRRRRRR
RQIAIWFQNRRMKWKK
RLRLRLRLRLRLRLRLKLLKLLKLLKLLKKKKKKKGYK
RRARRPRRLRPAPGR
KKTTTKPTKK
DAATATRGRSAASRPTQRPRAPARSASRPRRPVE
LRRERQSRLRRERQSR
KLWSAWPSLWSSLWKP
RQIKIWFQNRRMKWKK
RRRRRRRRC
LLIILRRRIRKQAHAHSKNHQQQNPHQPPM
CLLYWFRRRHRHHRRRHRRC
YGRAARRAARR
ARRRCSDRFRNCPADEALCGRRR
GRKKRRQRRRCG
RKKNPNCRRH
RILQQLLFIHF
EEEAAGRKRKKRT
RVRSWLGRQLRIAGKRLEGRSK
FKKFRKF
RQARRNRRRALWKTLLKKVLKA
RRRRRRRRRGPGVTWTPQAWFQWV
ASMWERVKSIIKSSLAAASNI
MIIFAIAASHKK
VSRRRRRRGGRRRRK
CHHHHHRRRRRRRRRHHHHHC
GLKKLARLAHKLLKLGC
GYGRKKRRQRRRG
SHAFTWPTYLQL
GKRKKKGKLGKKRDP
IPLVVPLC
AHALCPPERQIKIWFQNRRMKWKKEN
MDAQTRRRERRAEKQAQWKAANGC
MHKRPTTPSRKM
CRGDKGPDC
LRHHLRHLLRHLRHLLRHLRHHLRHLLRH
RQIKIWFQNRAMKWKK
KLALQLALQALQAALQLA
TKRRITPKDVIDVESVTTEINT
LPHPVLHMGPLR
RRRQRRKKR
CVSRRRRRRGGRRRR
KALAALLKKWAKLLAALK
MGLGLHLLVLAAALQGAKKKRKV
VLGQSGYLMPMR
GRKKRRQRRRP
GGVCPKILRRCRRDSDCPGACICRGNGYCGSGSD
KLIKGRTPIKFGK
HEHEHEHEHEHEHEHEHEHEHEHEEFGGGGGYGRGRGRGRGRGRG
KLLAKAAKKWLLLALKAA
WWWRRRRRRRR
GGVCPKILAACRRDSDCPGACICRGNGYCGSGSD
RLWMRWYSPATRRYG
PSSSSSSRIGDP
MIIYARRAEE
GRKKRRQRRPPQC
VKRKKKPALWKTLLKKVLKA
LLKLLKKLLKLLKKLLKLL
KLLAKAALKWLLKALKAA
GGVCPKILAKCRRDSDCPGACICRGNGYCGSGSD
RQPKIWFPNRRKPWKK
RRRRRRRRRHHH
LIKKALAALAKLNI
FIIFRIAASHKK
LLIILRRRIRKQAHAHSA
RIRMIQNLIKKT
SWLPYPWHVPSS
GIGKFLHSAKKFGKAFVGEIMNSGGKKWKMRRNQFWVKVQRG
TKRRITPKDVIDVRSVTTKINT
VELPPPVELPPPVELPPP
RHHRRHHRRHRRHHRRHHRHHR
YRRRRRRRRRR
KHKLLHLLHLLALLWLHLLHLLKHK
VKLPPP
RGERGERRELRLERGELRC
HARIKPTFRRLKWKYKGKFW
WIIFRIAAYHKK
YSHIATLPFTPT
YRWRCKNQ
ALIILRRRIRKQAHAHSK
RQIKIFFQNRRMKWKK
TRRSKRRSHRKF
VRLPPPVRLPPPVRLPPP
LKKLCKLLKKLCKLAG
WIIFKIAASHKK
LALALALALALALALAKIKKIKKIKKIKKLAKLAKKIK
LDTYSPELFCTIRNFYDADRPDRGAAA
GRRERNKMAAAKCRNRRR
YKQSHKKGGKKGSG
ACRDRFRNCPADEALCG
GKRKKKGKLGKKRPRSR
RRRRRRRHHH
FAPWDTASFMLG
FIRIGC
KLALKLALKALKAA
RLWMRWYSPWTRRYG
YGRGGRRGRRR
AAVALLPAVLLALLAKKNNLKECGLY
GRKKRRQRRRPPQTYADFIASGRTGRRNAI
KWFRVYRGIYRRRGK
GRRRRATAKYRTAH
FQWQRNMRKVRGPPVS
GKRVAKRKLIEQNRERRR
GRGDSPRRKKKKSPRRKKKKSPRR
KLIKGRTPIKFGKARCRRPPKHSGK
HEHEHEHEHEHEHEHEHEHEEFGGGGGYGRGRGRGRGRGRG
MIIFRALISHKK
GALFLAFLAAALSLMGLWSQPKKKRKV
MTPSSLSTLPWP
VSKQPYYMWNGN
EKGKKIFIMK
HYRIKPTFRRLKWKYKGKFA
MIIRRDLISE
CHHRRRRHHC
MANLGYWLLALFVTMWTDVGLCKKRPKP
QTRRRERRAEKQAQW
VHLPPPVHLPPP
ARRRRCSDRFRNCPADEALCGRRRR
DRDDRDDRDDRDDRDDR
WIIFRALISHKK
RLWRALPRVLRRLLRP
LILIGRRRRRRRRGC
APWHLSSQYSRT
KNAWKHSSCHHRHQI
RRRRRRRRRRRRRRRRGC
HFAAWGGWSLVH
LIRLWSHLIHIWFQNRRLKWKKK
AKKKAAKAAKKKAAKAAKKKAAKAAKKKAAKA
RRRRRRRGGIYLATALAKWALKQ
KALAKALAKLWKALAKAA
RRRRRRRRHHH
KMDSRWRWKSCKK
TVDNPASTTNKDKLFAVRK
ACSHSGHGCGHGSHSCGRRRRRRRR
KFFKFFKFFK
GLGDKFGESIVNANTVLDDLNSRMPQSRHDIQQL
LLIILRRRIRKAAHAHSK
GLWRALWRGLRSLWKLKRKV
ARRRAARAARRRAARAARRRAARA
KLALKLALKALKAALKLAGC
RILQQLLFIHFRIGC
RKKRRRESWVHLPPPVHLPPPGGHHHHHH
PSKRLLHNNLRR
GRGDSPRR
GRCTKSIPPICWPK
ADVFDRGGPYLQRGVADLVPTATLLDTYSP
GLFRALLRLLRSLWRLLLRA
LLHILRRSIRKQAHAIRK
RLHHRLHRRLHRLHRRLHRLHHRLHRRLH
GRQLRIAGRRLRRRSR
RRRERRAEK
KWFKIQMQIRRWKNKR
FQPYDHPAEVSY
GWTLNSKINLKALAALAKKIL
KRVSRNKSEKKRR
LALALALALALALALAKKLKKLKKLKKLKKLKKLKYAK
KRRIRRERNKMAAAKSRNRRRELTDTGC
RQRSRRRPLNIR
RQIKIQFQNRRKWKK
AAVALLPAVLLALLAPRKKRRQRRRPPQC
CSIPPEVKFNPFVYLI
GLFKALLKLLKSLWKLLLKA
TRQARRNRRRRWRERQRGC
AHALCLTERQIKIWFQNRRMKWKKEN
RAWMRWYSPTTRRYG
PPRLRKRRQLNM
GALFLGWLGAAGSTMGAPKKKRKV
WEARLARALARALARHLARALARA
GSPWGLQHHPPRT
GRRRRRRRRRPPQ
RLHRRLHRRLHRLHR
RLWMRWYSPTTRAYG
CGRKKRRQRLLRRPPQ
IAWVKAFIRKLRKGPLG
KCFQWQRNMRKVRGPPVSCIKR
KALKLKLALALLAKLKLA
AIPNNQLGFPFK
PQNRLQIRRHSK
RHHLRHLRRHLRHLLRHLRHHL
VQLRRRWC
GRKKRRQRRRMVSAL
RQIKIWFQN
YGRKKRRQRRRAYFNGCSSPTAPLSPMSP
YIVLRRRRKRVNTKRS
CGGGARKKAAKAARKKAAKAARKKAAKAARKKAAKA
CGNKRTRGC
RLPRPRPRPLPFPRPG
FVTRGCPRRLVARLIRVMVPRR
LLYWFRRRHRHHRRRHRR
RRRRRRRGGKLAKLAKKLAKLAK
RKKRRRESRKKRRRESC
CGRKKRAARQRAARAARPPQ
PNTRVRPDVSF
RGGRLAYLRRRWAVLGR
KLLKLLLKLWKKLLKLLK
RVTSWLGRQLRIAGKRLEGRSK
VKLPPPVKLPPPVKLPPP
RRRRRRRR
GKRRRRATAKYRSAH
CTWLKY
CIISRDLISH
KKPTIKTTKK
GRKKRRQRRRC
LNSAGYLLGKLKALAALAKIL
YRDRFAFQPH
KKPGKKTTTKPTKK
ACRGRRRGCGRRRGRCG
LIIFRILISH
HIQLSPFSQSWR
GKKKKKKKKK
GKYVSLTTPKNPTKRRITPKDV
GSGKKGGKKICQKY
RRMKWKK
GKKKKRKREKL
LALALALALALALAKLAKLAKLAKLAKIKKIKKKIK
RLHHRLHRRLHRLHR
GKRARNTEAARRSRARKL
LDITPFLSLTLP
KRIPNKKPGKKTTTKPTKKPTIKTTKKDLKPQTTKPK
MIIYRDLISKK
RLALRLALRALRAALRLA
FKKLALHALHLLALLWLHLAHLALKK
HHHHHHHHRRRRRRRRRRRRRRR
KRIPNKKPGKKT
KRIPNKKPGKKTTTKPTKKPTIKTTKKDLK
RRRRRRRRRRR
SGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGC
YGRRARRAARR
HILPWKWPWWPWRR
RQIKIWFQNRRM
RQIRIWFQNRRMRWRR
SHNWLPLWPLRP
QRIRKSKISRTL
GRKKRRQRRRPPQ
KMDSRWRWKCSKK
TAKTRYKARRAELIAERRGC
YPYDANHTRSPT
CRGDKGDPC
LGLLLRHLRHHSNLLANI
TKRRITPKKVIDVRSVTTEINT
GLWRALWRLLRSLWRLLWRA
RPARPAR
LLIILRRRIARKQAHAHSK
MIIYRDLISH
SMLKRNHSTSNR
SGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKG
GRPRESGKKRKRKRLKP
KKYRGRKRHPR
GRKAARAPGRRKQ
RRRRRRRRRRRR
CRRRRRRRRRRRRC
CRRRRRRCRRRRRR
RRRRRRCRRRRRRC
CRRRRRRCRRRRRRC
CRRRRRRCCRRRRRRC
RXRRBRRXRRBR
CRXRRBRRXRRBRC
RXRRBRCRXRRBRC
RQIKIWFQNRRMKWKK
CQIKIWFCNKRAKIKK
SQIKIWFQCKRAKIKC
CSQIKIWFQNKRAKIKKC
LLIILRRRIRKQAHAHSK
LLIILRRRIRKQAHAHSKRXRRBRRXRRBR
CRLRWRC
IWIAQELRRIGDEFNAYYARR
RRIRPRPPRLPRPRPRPLPFPRPG
TRSSRAGLQWPVGRVHRLLRK
RGGRLSYSRRRFSTSTGR
ALWKTLLKKVLKAPKKKRKV
KLIKGRTPIKFGKADCDRPPKHSQNGMGK
PLSSIFSRIGDP
KLALKALKALKAALKLA
RRWWRRWRR
LKTLTETLKELTKTLTEL
VRLPPPVRLPPPVRLPPP
FKIYDKKVRTRVVKH
KGTYKKKLMRIPLKGT
LYKKGPAKKGRPPLRGWFH
IAWVKAFIRKLRKGPLG
GSPWGLQHHPPRT
RQVTIWSQNRRVKSKK
VSALK
PPRPPRPPR
PPRPPRPPRPPR
RLRWR
GAYDLRRRERQSRLRRRERQSR
RKKRRQRRR
RQIKIWFQNRRMKWKK
RRRRRRRRR
RSVTIWFQSRRVKEKK
KRVKAGYLLGKINLKALAALAKKIL
AGYLLGKINLKALAALAKKILKRVK
PKKKRKVAGYLLGKINLKALAALAKKIL
SDGTLAVPFKA