STAD29

Statistics for the Life and Social Sciences

Ken Butler

bad picture

Welcome to the home page for STA 1007 / STAD29. This is the place to look for things course-related (notes, code, old exams etc., linked above) except for lecture videos, assignment hand-ins and marks, which will be on Quercus.

News (newest first)

  • 2024-02-26 11:30: Monday update:

    • Midterm: location and date is on an announcement on Quercus labelled “Midterm”. My reply to that announcement tells you which room you need to be in.
    • Assignment 5 was due last night. Assignment 6 opens tonight; you once again have 2 weeks to complete it, so that the midterm and an assignment are not due the same weekend. You may find it useful to start Assignment 6, however, as part of your midterm preparation (the material on it is on the exam).
    • Tutorial this afternoon. I will find something from the lecture before reading week.
    • Lecture this week: analysis of covariance (short), multivariate analysis of variance (longer), maybe some of repeated measures.
    • If you have questions as you prepare for the exam, post them on the Quercus discussion board or catch me after lecture.
  • 2024-02-22 11:50:

    • As I think I mentioned at the end of the last lecture, coverage for the midterm includes the last lecture before reading week (the “ANOVA revisited” stuff, as much as we saw in class).
    • The midterm has gone for printing. It has five questions, with a total of 23 parts worth 53 points altogether. Expect to be doing more explanation than coding (since the focus of this course is the understanding of the statistics).
    • The exam is open book, same rules as STAC32, and with the same expectation that you will organize your materials before the exam (or else you can expect to run out of time).
    • Suggestion: bring a calculator to the exam (an actual calculator, not your phone). You might find it helpful in a couple of places.
    • There will be a lecture next week before the midterm. That material will not be on the midterm, but you can count on it being on the final exam (and it will help you understand what follows in the course).
    • re Assignment 2, I have asked the grader to add some explanation to the grading so that you know where you didn’t get full marks. In the meanwhile, compare your answers with my solutions. The same applies to Assignment 3 and Assignment 4 that you don’t have back yet.
  • 2024-02-20 14:00: Assignment 2 is graded, and I am about to post the marks. Appeals by the usual procedure between Feb 23 and Mar 1.

  • 2024-02-18 22:15: A Sunday night “Monday update”, to remind you that the upcoming week is Reading Week, and there is therefore no tutorial on Monday or lecture on Wednesday. (I will be spending the time sorting out your midterm.) We resume on the 26th.

  • 2024-02-14 13:30: a couple of things:

    • My solutions to Assignment 4
    • The lecture notes for ANOVA Revisited are rather long, and I won’t be talking about all of it. The beginning is review; today, I plan to start with the Rats and Vitamin B example.
  • 2024-02-12 11:25: Monday update:

    • Assignment 4 was due last night; Assignment 5 opens tonight. You will have two weeks to do Assignment 5, since it is due on Sunday at the end of reading week.
    • Tutorial this afternoon on Zoom as usual. I will find a survival analysis example from PASIAS to talk about.
    • Lecture: there is a little more material at the end of the Survival Analysis slides that you would probably do well to look through, but this week I’ll be moving on to ANOVA Revisited. This is followed (probably after reading week) by some other ideas that are based on ANOVA: MANOVA, repeated measures, discriminant analysis and so on.
  • 2024-02-08 10:50: my solutions to assignment 3.

  • 2024-02-07 20:10: I updated the lecture notes to reflect what I added for class today (the slides “behind the scenes” and the one after that). This now seems to be working for both the .html and the .pdf versions of the slides.

  • 2024-02-05 15:15: assignment planning: there will be no assignments due during reading week or the weekend of the midterm. That means you’ll get two weeks to do each of Assignment 5 (due the Sunday night at the end of reading week) and Assignment 6 (due March 10). The eighth and final assignment will be due on March 24, so you’ll get a break at the end of the course.

  • 2024-02-05 11:30: Monday update:

    • Assignments: #1 is marked, appeals by usual procedure between Feb 8 and 15; #2 is being marked; #3 was due yesterday; #4 opens tonight.
    • Tutorial on zoom this afternoon as usual (usual link)
    • Lecture this week: survival analysis. (I’ll decide whether I want to do any more of the dates and times stuff, but I think we have all we need of that.) Content warning on the survival analysis: often the data is on people that will die of something (often cancer), and we will be investigating treatments that will help them live longer, but some of them will die of whatever-it-is. I think that will take up our two hours this week.
  • 2024-02-01 10:30: My solutions to Assignment 2.

  • 2024-01-29 13:20: Monday update:

    • Tutorial on zoom this afternoon at 4 (link in Quercus announcement)
    • Assignments: #2 was due yesterday; #3 opens tonight. I will let you know when #1 is graded.
    • Lecture this week:
      • logistic regression with multi-category but unordered response
      • dates and times (in which you learn just how fiddly dates and times are to handle, and how useful it is to have packages that handle them for us)
    • extra practice problems on this week’s material: PASIAS chapters 29 and 21 (respectively).
  • 2024-01-25 20:30: After Wednesday’s class, I decided that you might like some extra practice on the stuff we did in the first half of class (the log-odds stuff), so I added a short question on that to the next assignment. If that material is still confusing you, we can talk about that on Monday. (It is probably not giving much away to say that the second question on that assignment is on the stuff we did after half-time in class this week.)

  • 2024-01-25 12:20: my solutions to Assignment 1.

  • 2024-01-24 11:25: We have a midterm date. See the announcement on Quercus for date, time, and place (not until early March).

  • 2024-01-22 13:30: upcoming, this week:

    • tutorial today at 4 on zoom, same coordinates as last week. As I write, Accuweather is telling me “periods of heavy snow for at least 60 minutes”, so I am quite happy to not be travelling today! I plan to pick a problem from PASIAS to talk about, and of course bring other questions if you have them.
    • in lecture this week, some or all of:
      • log-odds and odds ratios and relative risk (partly in response to last week’s question about what those slope and intercept numbers actually mean)
      • logistic regression with a multi-category response when the response categories are ordered (the coal miners lung disease example)
      • logistic regression with a multi-category response when the response categories are not ordered (the brand preference example)
    • extra examples: chapters 28 and 29 of PASIAS.
    • Assignment 2, on the logistic regression stuff from last week’s lecture, opens tonight and is due next Sunday.
  • 2024-01-17 13:30: practice problems for this week’s material: PASIAS chapter 26 (edited 2024-01-18: we’ll get to chapter 27 next week).

  • 2024-01-15 11:15: on the agenda this week:

    • tutorial today at 4:00pm, on zoom (link in Quercus announcement), on the regression stuff we did in lecture last week.
    • The stuff in the slides that I didn’t talk about is regression review. Read through those if you feel you need more review.
    • Assignment 1 opens tonight, is due next Sunday night (the 21st), on the stuff we did in lecture last week.
    • This week’s lecture is on logistic regression. The thing that distinguishes this from regular regression is that the response variable is categorical rather than quantitative. There are three parts (that we won’t do all of this week): when the outcome is a success/failure, when the outcome is several categories but ordered, when the outcome is several categories that are not ordered. There are also variations in how the data come to us (and therefore how we have to deal with it).
    • Next week’s tutorial (the 22nd) is on the stuff in this week’s lecture, and Assignment 2 (opens on the 22nd and is due on the 28th) is on the same material.
  • 2024-01-11 20:45: we have a Zoom meeting set up for Monday’s tutorial. The coordinates are in a Quercus announcement (I am not announcing it publicly).

  • 2024-01-11 11:45: Here is a worksheet on the material we looked at in class yesterday. I’m not promising a worksheet every week, but I will point you at some practice problems for each section of the course. Look out for information about Monday’s session tomorrow (once I have sorted that out).

  • 2024-01-08 12:45: Our course begins this week:

    • expect things to be structured a lot like they were in C32.
    • one two-hour lecture a week, on Wednesdays (I will give you a break in the middle). Your ACORN has the location.
    • one hour of tutorial-slash-office hour on Mondays, starting next week, probably on zoom. You can also catch me after lecture, or post in the Quercus discussions.
    • weekly assignments, with the first one going out on January 15 (out Monday night, due the following Sunday night), in the same style as for C32.
    • D29 is more about the statistics and less about the coding than C32 was; you will be learning some statistical methods that are definitely new to you, and I will assume that you are keeping up with the coding part.
  • 2024-01-03 11:00: lecture 1 is a week away:

    • Quercus page is up (for assignments, discussion board etc)
    • be ready to learn some new statistics (there will be stuff you haven’t seen before)
    • I will assume that you are familiar with the R stuff, and the statistical ideas, that you learned in C32. (If you are not, be prepared to do some catching up.)
    • we begin with some new ideas in regression.
    • I don’t think there is anyone taking the course as STA 1007 (graduate course) this year; if you are, let me know.
  • 2023-12-12 18:30: a bit more detail:

    • lectures are once a week on Wednesdays for 2 hours
    • there is a “practical” on Mondays at 4, starting in week 2, probably on zoom. This is a sort of combined office hour / tutorial to which you can bring questions about the previous week’s material.
    • weekly assignments (that you are used to from C32), opening Monday night, due the following Sunday night.
    • a 2-hour midterm and a 3-hour final as usual, on dates to be announced.
  • 2023-12-08 14:30: here is this year’s version of the site. Class meets once a week for two hours, starting on Wed January 10, 2024.