BarSeq Analysis

Web supplement to
"BarSeq : Identification and Counting of Short Molecular Barcodes in HT-Seq data"





Script Usage

Two script files are included.
  1. barseq.pl : the main script, which is called with various parameters
  2. barseq.pm : library of functions, loaded into barseq.pl


In general, successive calls to the main script are made, calling the appropriate function and redirecting output to a file, which serves as input for the next step in the pipeline

4 functions can be called, listed in order
  1. barseq.pl count : input is from a qseq or fastq read file, output is a list of trimmed/truncated reads and their frequency within the file after quality filtering
    eg. SH: barseq.pl count het_test_qseq.txt > het_test_readcounts.txt
  2. barseq.pl cluster : input is the readcounts file from 1. Clusters sequences based on mm scoring.
    eg. SH: barseq.pl cluster het_test_readcounts.txt > het_test_clusters.txt
  3. barseq.pl map : input is the cluster file from 2, and a list of expected barcodes. This will map the barcodes to the clusters.
    eg. SH: barseq.pl map het_test_clusters.txt HET.txt > het_test_map.txt
  4. barseq.pl tabulate : input is the map file from 2, and a list of expected barcodes. This will generate a list of barcodes and the counts for each, with indications on how the mapping was assessed.
    eg. SH: barseq.pl tabulate het_test_map.txt HET.txt > het_test_mappedbarcodes.txt


Bucket EvaluationsInquiries can be addressed to guri.giaever@utoronto.ca OR corey.nislow@utoronto.ca