|  | Whole genome sequencing |  | RNA-Seq | ||||
---|---|---|---|---|---|---|---|---|
 |  | 2011 cost | output | 2011 time |  | 2011 cost | output | 2011 time |
Sample collection and experimental design | from blood samples (easy to collect) to brain tissue (hard to collect) | ~$100 onwards | Â | from a few hours to several days | same as for whole genome sequencing | |||
Sequencing | library preparation + running the sequencer (whole dual flow cell) | ~$6500 = ~$500 + ~$6000 | ~380 M reads/lane; 1 individual: ~1140 M total reads (~3 lanes for a 30 × coverage); ~250 Gb (intermediate files) | ~11-12 day | library preparation + running the sequencer (whole dual flow cell) | ~$3300 = ~$300 + ~$3000 | ~380 M reads/lane | ~12-14 day |
 | Data storage, low-level processing |  |  |  |  |  |  |  |
 | Alignment (transfer* and storing raw data + mapping) | ~$40 = ~$33 + ~$7 | 300 Gb (BAM file) | ~1/2 day *** (including transferring 250 Gb FASTQ ~7.5 hrs) | Alignment (transfer* and storing raw data + mapping) | ~$5 = ~$3 + ~$2 | ~30 Gb (BAM); ~22 Gb (MRF) | < 2 hrs *** |
 | (data transfer and storage for 10 days)*; ** | ~$40 |  | ~8.5 hrs | (data transfer and storage for 10 days)*; ** | < $4 |  | < 1 hr |
Data reduction and management | High-level summaries*** | Â | Â | Â | Â | Â | Â | Â |
 | SNP calling (compute + transfer out) | < $5 = ~$4 + ~$0.60 | < 1 Gb | ~3 hrs | Gene and exon expression quantification | < $1 | < 1 M | < 1 hr (1 CPU) |
 | Indel calling (compute + transfer out) | < $35 = ~$32 + ~$0.60 | < 1 Gb | ~1 day | Isoform quantification | ~$6 | < 1 M | ~4 h |
 | SV calling (compute + transfer out) | < $35 = ~$32 + ~$0.60 | < 1 Gb | ~1 day |  |  |  |  |
Downstream analyses | Â | > $100 K | ~310 Gb | months | Â | > $100 K | ~30 Gb | months |
Total of sequencing, data management and reduction | ~$6500 | ~310 Gb | ~15 days | Â | ~3500 | ~30 Gb | ~12-14 days |