Skip to the content.

Course description

Programming skills and software tools for building automated bioinformatics pipelines and computational biology analyses. Emphasis on UNIX tools and R libraries for distilling raw sequencing data into interpretable results. This course is aimed at students familiar with UNIX and with some programming experience in python, R, or C/C++.

Instructional staff

Please click on the links above for email addresses.

Meeting times and locations

Classes:

Monday and Wednesday, 9:00-10:20 am, Foege S110 (http://www.washington.edu/home/maps/southcentral.html?gnom).

Class Slack:

We will use Slack during class and outside of class to communicate, share code snippets, ask and answer questions. The class slack is here:

You will receive an invitation to join prior to the first class.

Office hours:

Prerequisites

Course requirements

Setting up your computer

Examinations

There will be no examinations.

Course grade

Grades will come 50% from the programming projects and 50% from class participation.

Course materials

We will read from several online resources and tutorials. I strongly encourage you to read all of the material in the following:

Specific, selected readings for the course will be listed in the course schedule below.

Helpful software

Class schedule

Date Topic Reading
3/31 Course overview, student setup, and version control pdf Git Basics;
4/2 Intro to bioinformatics pipelines, automation pdf Cao et al; Packer et al
4/9 Read alignment pdf SAM format; bedtools; STAR; STARsolo
4/14 Workflow automation pdf Essential UNIX; BASH basics (sections 1-7); Snakemake
4/21 Exploratory data analysis pdf R for Data Science (Chapter 13 especially)
4/23 Electronic lab notebooks with R Markdown pdf R for Data Science (Chapter 27); R Markdown (chapter 3)

Example files