This minicourse in databases and SQL is part of Computational Statistics and Operations Research, a course held in the spring semester of 2019 at Aalborg University for 2nd semester Operations Research master students.

The goal of the course is not to be a comprehensive course in SQL, but rather to teach the student enough of the basics to enable them to use SQL databases as part of their data analysis pipeline, in particular via the programming language R.

Contents

Each lecture consists of a slideshow presentation of new concepts, some live examples, and individual exercises. You should therefore bring a laptop to each lecture; see below for programs to install beforehand.

Click through below for detailed descriptions of and material for each session.

  1. Introduction to databases
  2. Aggregation and JOINs
  3. Databases in R
  4. Management and performance
  5. Self study session

For more info on how to use the slides, check out the slides overview page.

Evaluation

To pass the minicourse, you must hand in a satisfactory solution to the self study exercise.

Prerequisites and preparation

Some experience with the programming language R is required.

Before the first lecture:

  1. Install the following on your laptop:
  2. Make sure your postgres server is running and that you can connect to it in pgAdmin

For a quick guide to installing and setting up PostgreSQL Server and pgAdmin 4, see the installation and setup guide.

If you have problems, please contact me at janus@math.aau.dk.

Literature and resources

The following resources will be used and referenced throughout the lectures:

[D&W]
Joshua D. Drake and John C. Worsley. Practical PostgreSQL. O’Reilly Media, Inc. ISBN 978-1-56592-846-6. Available at http://proquest.safaribooksonline.com/9781449309770
[Docs]
Official PostgreSQL documentation. https://www.postgresql.org/docs/current/index.html

For extra notes, guides, and links, check out the Resources page.