“sh”, e.g.“runSTAR.sh”. If the file is made on a Windows computer, you need to make sure to
save the file as a LINUX style text file. From NotePad++, used the "Edit -> EOL Conversion ->
UNIX" option. If you are not sure about this, after uploading the script to Linux, run command
“dos2unit runSTAR.sh” to convert to LINUX text file.
You can use FileZilla(win & mac) to upload the file to your home directory. To make things
easier, both software include a function to directly save edited file to a remote LINUX machine.
Here are the lines in your shell script.
You can also use the shell script that we have prepared for you. It is located in the data
directory with the file name “runSTAR.sh”;
•
In these commands, I set --runThreadN to 2. You would want to increase the number
in real work.
•
You might want to run multiple jobs in parallel. Read the instructions at
https://biohpc.cornell.edu/lab/doc/using_BioHPC_CPUs.pdf for using BioHPC computer
efficiently, or get help form our office hours.
To run the shell script, start “screen”, and in a screen session run these commands:
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458493.fastq.gz --readFilesCommand zcat --outFileNamePrefix wt1_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458494.fastq.gz --readFilesCommand zcat --outFileNamePrefix wt2_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458495.fastq.gz --readFilesCommand zcat --outFileNamePrefix wt3_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458500.fastq.gz --readFilesCommand zcat --outFileNamePrefix mu1_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458501.fastq.gz --readFilesCommand zcat --outFileNamePrefix mu2_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
STAR --quantMode GeneCounts --genomeDir genome --runThreadN 2 --readFilesIn
ERR458502.fastq.gz --readFilesCommand zcat --outFileNamePrefix mu3_ --
outFilterMultimapNmax 1 --outFilterMismatchNmax 2 --outSAMtype BAM SortedByCoordinate
cd /workdir/<your_User_ID>
export PATH=/programs/STAR:$PATH
sh runSTAR.sh >& log &