抓序列出错,请教,谢(正则表达相关?)
chenhao392
|
1#
chenhao392 发表于 2007-10-28 13:27
抓序列出错,请教,谢(正则表达相关?)
#!usr/bin/perl -w
open PPI,"test_ppi_yeast" or die"hell!"; open PROTEIN,"yeast_protein" or die"hell!"; our @PPI=<PI>; our @protein=<ROTEIN>; close PROTEIN,or die"hell"; close PPI or die"hell"; for($q=0;$q<@PPI;$q++){ @temp_array=~split('\t',$PPI[$q]); #print"$temp_array[0]$temp_array[1]"; #die; $name="$q._$temp_array[0]_$temp_array[1]"; open DATA,">$name"or die"hell"; for($e=0;$e<=@protein;$e++){ if($protein[$e]=~/$temp_array[0]/){ print"$protein[$e]"; die; print DATA "$protein[$e]"; for($w=1;$w<=500;$w++){ $protein[$e+$w]=~/^[A-Z]/? print DATA "$protein[$e+$w]": last; } } } for($r=0;$r<=@protein;$r++){ if($protein[$r]=~/$temp_array[1]/){ print DATA "$protein[$r]"; for($w=1;$w<=500;$w++){ if($protein[$r+$w]=~/^[A-Z]/){ print DATA "$protein[$r+$w]"; } elsif($protein[$r+$w]!=/^[A-Z]/){ last; } } } else{ last; } } close DATA or die"hell"; } ppi的文件如同: YMR056C YBR217W YMR056C YNR056C YMR056C YJL124C 实际proteins如同: >YAL001C TFC3 SGDID:S000000001, Chr I from 151168-151099,151008-147596, reverse complement, Verified ORF, "Largest of six subunits of the RNA polymerase III transcription initiation factor complex (TFIIIC); part of the TauB domain of TFIIIC that binds DNA at the BoxB promoter sites of tRNA and similar genes; cooperates with Tfc6p in DNA binding" MVLTIYPDELVQIVSDKIASNKGKITLNQLWDISGKYFDLSDKKVKQFVLSCVILKKDIE VYCDGAITTKNVTDIIGDANHSYSVGITEDSLWTLLTGYTKKESTIGNSAFELLLEVAKS GEKGINTMDLAQVTGQDPRSVTGRIKKINHLLTSSQLIYKGHVVKQLKLKKFSHDGVDSN PYINIRDHLATIVEVVKRSKNGIRQIIDLKRELKFDKEKRLSKAFIAAIAWLDEKEYLKK VLVVSPKNPAIKIRCVKYVKDIPDSKGSPSFEYDSNSADEDSVSDSKAAFEDEDLVEGLD NFNATDLLQNQGLVMEEKEDAVKNEVLLNRFYPLQNQTYDIADKSGLKGISTMDVVNRIT GKEFQRAFTKSSEYYLESVDKQKENTGGYRLFRIYDFEGKKKFFRLFTAQNFQKLTNAED EISVPKGFDELGKSRTDLKTLNEDNFVALNNTVRFTTDSDGQDIFFWHGELKIPPNSKKT PNKNKRKRQVKNSTNASVAGNISNPKRIKLEQHVSTAQEPKSAEDSPSSNGGTVVKGKVV NFGGFSARSLRSLQRQRAILKVMNTIGGVAYLREQFYESVSKYMGSTTTLDKKTVRGDVD LMVESEKLGARTEPVSGRKIIFLPTVGEDAIQRYILKEKDSKKATFTDVIHDTEIYFFDQ TEKNRFHRGKKSVERIRKFQNRQKNAKIKASDDAISKKSTSVNVSDGKIKRRDKKVSAGR TTVVVENTKEDKTVYHAGTKDGVQALIRAVVVTKSIKNEIMWDKITKLFPNNSLDNLKKK WTARRVRMGHSGWRAYVDKWKKMLVLAIKSEKISLRDVEELDLIKLLDIWTSFDEKEIKR PLFLYKNYEENRKKFTLVRDDTLTHSGNDLAMSSMIQREISSLKKTYTRKISASTKDLSK SQSDDYIRTVIRSILIESPSTTRNEIEALKNVGNESIDNVIMDMAKEKQIYLHGSKLECT DTLPDILENRGNYKDFGVAFQYRCKVNELLEAGNAIVINQEPSDISSWVLIDLISGELLN MDVIPMVRNVRPLTYTSRRFEIRTLTPPLIIYANSQTKLNTARKSAVKVPLGKPFSRLWV NGSGSIRPNIWKQVVTMVVNEIIFHPGITLSRLQSRCREVLSLHEISEICKWLLERQVLI TTDFDGYWVNHNWYSIYEST* 。。。。。。 报错如下: Applying pattern match (m//) to @array will act on scalar(@array) at blast_data_getting(yeast).pl line 10. Use of implicit split to @_ is deprecated at blast_data_getting(yeast).pl line 10. Applying pattern match (m//) to @array will act on scalar(@array) at blast_data_getting(yeast).pl line 10. Use of uninitialized value in concatenation (.) or string at blast_data_getting(yeast).pl line 13. Use of uninitialized value in concatenation (.) or string at blast_data_getting(yeast).pl line 13. Use of uninitialized value in regexp compilation at blast_data_getting(yeast).pl line 16. Died at blast_data_getting(yeast).pl line 18. >YAL001C TFC3 SGDID:S000000001, Chr I from 151168-151099,151008-147596, reverse complement, Verified ORF, "Largest of six subunits of the RNA polymerase III transcription initiation factor complex (TFIIIC); part of the TauB domain of TFIIIC that binds DNA at the BoxB promoter sites of tRNA and similar genes; cooperates with Tfc6p in DNA binding" 请教。。。十分感谢 |