STAC32

Applications of Statistical Methods

Ken Butler

bad picture

Welcome to the home page for STAC32. This is the place to look for all things course-related (notes, assignments, announcements etc., linked above) except for assignment hand-ins and marks, which will be on Quercus.

This is an applied course. Expect to be describing the process by which you got your answers, and explaining what the answers mean in the context of the data you are working with: that is to say, using your language skills as well as your statistical skills. Be prepared to show your understanding and insight; this course is about a lot more than “getting the answer”.

In real life, people do Statistics to make decisions or inform actions, and you will be expected to play your full part in that process, both in this course and in your statistical future.

News (most recent at the top):

  • 2024-12-18 15:00: The final exams are marked, coursework appeals are dealt with, grades are submitted. You should be receiving them shortly.

    • Reminders:
      • Read section 5, Grades in the detailed course policies.
      • I am not allowed to discuss or negotiate grades. Do not attempt to do so.
      • If you are unhappy with your grade, the only avenues of appeal you have are through the Registrar’s office.
      • Course grades are based only on the work you have done in a course, and not on any external circumstances. If you want a certain grade, it is up to you to do the work needed to obtain it.
    • Happy holidays to everyone, in whatever way you celebrate. I hope to see you in D29!
  • 2024-12-17 11:45: Assignment 7 is marked, grades to be posted shortly. We are on a tight timeline, so any appeals (by the usual procedure) need to reach me by 12 noon tomorrow, Dec 18.

  • 2024-12-16 22:15: now 65% done.

  • 2024-12-14 21:10: tonight’s report: 43% done, including (I think) all of mine.

  • 2024-12-13 21:15: We are up to 26% done. Expect your TAs and me to devote our weekend to marking your exam. My solutions, with edits from the questions I’ve been marking. The figures are at the end.

  • 2024-12-12 21:30: final exam marking is under way. We are now 11% done (which is actually all me so far). Your TAs have been briefed about the marking of their questions, and they will get to it when they have a chance.

  • 2024-12-10 14:15:

    • we haven’t forgotten about Assignment 7. This will be marked and included in your course grade.
    • final exam tomorrow. See Quercus announcement for where and when.
  • 2024-12-04 18:55: weather permitting, I’ll be in my office tomorrow (Thursday), sometime around early afternoon. (If the door is open, but I’m not there, I’m somewhere around and will be back soon.)

  • 2024-12-04 12:45:

    • Assignment 8 is now marked, and the marks will be released in a moment. Appeals by the usual procedure between Dec 5 and Dec 10 (so that I can get to them before the final exam). Assignment 7 will be coming your way soon.
    • in response to a question on the discussion board, I rewrote some of my solutions to worksheet 11 to fit better with the way we have been doing things.
  • 2024-12-03 11:30: final exam:

    • the final exam is comprehensive: you are responsible for anything in the course.
    • specifically: expect to see most of the material from after our midterm, and a selection of material from before.
  • 2024-12-02 14:45: updates:

    • I think I am happy with the final exam. It has 37 questions worth a total of 91 points; in that regard it is not much bigger than the midterm, but I think you can expect to have to work harder for each point here than you did on the midterm.
    • I will have pre-exam office hours on Monday Dec 9, 1:00-3:00pm.
    • I may also be in the office this week, depending on the forecast for snow. I was thinking of Wed or Thursday, but as I write, the forecast is for snow both days, so we’ll see.
  • 2024-11-28 13:05: final exam time and location is on Quercus.

  • 2024-11-28 11:15: Some solutions of mine:

  • 2024-11-25 11:30: the final Monday update:

    • Assignments:
      • Assignment 7: my solutions
      • Assignment 8 (the last one) due tonight.
    • Lectures:
      • Tuesday: functions part 1
      • Thursday: functions part 2. Once we are done this, we are done.
      • no lectures after that (Dec 2 and Dec 3 are makeup Mondays).
    • Worksheet 11 for tutorial on Wednesday.
      • Tutorial at 9:00am on Wednesday is cancelled. If you were going to that one, please go to one of the others (12 noon, 3:00pm, 4:00pm in the same room).
    • The final exam is under construction. I will let you know the number of pages and questions when it is finished.
  • 2024-11-21 22:05: My solutions to worksheet 10. Sorry for the delay, and thanks to the student who reminded me.

  • 2024-11-21 10:35: Assignment 6 has also been marked, and I am about to post the marks. Appeals by usual procedure between Nov 24 and Dec 1 inclusive.

  • 2024-11-19 10:50: Assignment 5 has been marked, and I am about to post the marks. Appeals by usual procedure between Nov 22 and Nov 29 inclusive.

  • 2024-11-18 11:20: Monday update (the second-to-last one):

    • Assignments:
      • Assignment 5 appears to be marked. When the grader confirms that it is all done, I will post the marks.
      • Assignment 6 marking is under way.
      • Assignment 7 is due tonight.
      • Assignment 8 (the last one) opens on Wednesday and is due on Monday of next week (the 25th).
    • Lectures:
      • Tuesday: asphalt case study
      • Thursday: regression with categorical (explanatory) variables
      • remaining: writing functions
    • Worksheet 10 for tutorial on Wednesday. There is one more tutorial and one more worksheet after this.
  • 2024-11-14 10:30: solutions to things:

  • 2024-11-11 15:15: I am about ready to post the midterm marks. You have received an email from Crowdmark with instructions for retrieving your marked exam.

    • Stats: median 57.25 (69%), Q1 45.75 (55%), Q3 70.25 (85%).
    • Appeals by the same procedure as for assignments, between Nov 14 and Nov 21 inclusive. Appeals will only be considered for grading errors, that you can demonstrate by reference to my solutions. In addition, the graders have been as consistent as possible across students, so I will not consider appeals for partial credit. Hence the only thing worth appealing is an answer where you can make the case for full credit for your answer, with reference to my solutions. To do this, you will need to demonstrate that the grader didn’t see something you wrote, or graded inconsistently with my solutions.
    • If you are disappointed by your exam mark:
      • Your first task is to go through your marked exam carefully with my solutions and understand where you earned and did not earn points.
      • If, despite your best efforts, you do not understand what you were missing, talk to me in office hours. These conversations are only about your understanding, not about your marks.
  • 2024-11-11 14:45: a (late) Monday update:

    • the midterm is marked! After I have finished checking it over, I will upload the marks to Quercus, and send you your marked exam (you will get an email from Crowdmark with your marked exam attached).
    • Assignment 6 is due tonight at the usual time.
    • Lectures:
      • Tuesday: the last little bit of tidying data, then the windmill case study (regression).
      • Thursday: the rest of the windmill case study, and maybe start on the asphalt case study (multiple regression).
    • Worksheet 9 for tutorial on Wednesday.
    • There are two more assignments after #6 and two more worksheets after #9.
  • 2024-11-10 21:10: now at 90% marked. Remaining questions seem to be 5, 6, 12, and 24. Results tomorrow, maybe.

  • 2024-11-09 20:50: we are now up to 68% of the midterm graded. I am done my share, and we are waiting on the TAs to finish their parts.

  • 2024-11-08 21:55: looks as if it’s been a productive day of marking; we are up to 53% done. I am approaching the end of my share of the marking. My updated solutions.

  • 2024-11-07 23:45: Midterm marking now at 38%. I did question 33 between lectures today and question 25 tonight, which was probably further than I expected to get today. (I’m working kinda backwards.) Tomorrow will be a (hopefully) productive day.

  • 2024-11-07 11:45: my midterm solutions as they stand now (will be updated as marking continues).

  • 2024-11-06 22:00: update:

    • my worksheet 8 solutions. I messed up getting these to the TAs in time, so if your TA seemed underinformed in tutorial today, that was my fault.
    • exam marking progress report: we are 28% done as of right now. I got through most of the ANOVA question today. I aim to do some more in between the two lectures tomorrow. It’s always a challenge to mark midterms while simultaneously keeping track of everything else in the course. Hence, you might need to remind me in lecture tomorrow (Thursday) of where we got to on Tuesday.
  • 2024-11-03 16:45: an early “Monday update”:

    • Midterm on Monday night! See Quercus announcement for time and place(s).
    • There are no assignments due this week. Assignment 6 will be due on Mon Nov 11, and there will be two more assignments after that.
    • Lectures:
      • Tuesday: the first part of tidying data
      • Thursday: the second part of tidying data
    • Worksheet 8 for tutorial on Wednesday.
  • 2024-10-30 13:15: course grade procedures:

    • if you miss the midterm for any reason, the weight is automatically transferred to the final exam (you don’t have to ask me or provide documentation).
    • if you write the midterm, but your final exam is better, your final exam automatically counts instead of your midterm.
    • therefore, it is in your best interest to write the midterm if you can: it is “marks in the bank” if you do well, and is ignored if you do better on the final.
    • on the other hand, if you are sick on exam day (particularly if you have something that is contagious or will disturb the person sitting next to you), stay home.
  • 2024-10-29 23:40: reminder of pre-exam office hours Wed 2:00-4:00pm in my office (see course outline for where that is).

  • 2024-10-25 17:00: when I was a bit younger than you are now (that is to say, a long time ago now), I was given some tips about writing exams, which I pass on below. This exam has 33 questions grouped together under a number of “scenarios” with a title at the top (as on the assignments and worksheets); you will probably want to attempt the questions for a particular scenario together:

    • budget your time. For my exams (and, I suspect, for most exams), the number of points is a reasonable proxy for the amount of time that it will take for a well-prepared student to answer the question. You know how many total points the exam is worth (see yesterday’s note), and you know how many minutes you have, so you know how long things should be taking. If you can, allow yourself a few minutes at the end for checking your answers.
    • before you answer any of the questions, read (or at least skim) through the whole exam, and note anything that you know how to answer.
    • start with the questions that look easiest, and make sure you get the points for those.
    • if a question is taking you too long (given your time budget), leave it and do another one. Your job is not to do all the questions, but to get as many points as possible. If you spend too long on a question, you are stopping yourself from getting points elsewhere. The worst thing is to not even attempt questions that you know how to do.
    • if you follow the principle of doing the easiest remaining questions at any point, anything you don’t have time for will be something that you weren’t going to get many points on anyway. (Note how different this is from the strategy of starting at the beginning and continuing until you run out of time.)
  • 2024-10-25 16:40:

    • I added last year’s exams to the old exam page. In contrast to this year, last year’s midterm was early, so there is a lot of material that can be on this year’s midterm that was not on last year’s. Midterms vary in how far through the semester they are, so be careful about using previous midterms as a guide to what will appear on this year’s.
    • I used to teach SAS in this course as well as R, so if you go back far enough, you will find questions on SAS in the exams. You can ignore those, or, perhaps better, think about how you would accomplish the same task with R.
    • Also, if you go back far enough, you may in addition find R things done differently from how they are now. This is the nature of a course using developing software. Your aim is to do these tasks as they are done now, meaning as I am teaching them this year. There are, I think, some notes in my solutions indicating how things have changed, but I doubt I have caught everything. I plan to take another look at those during reading week and see what I can fix up.
  • 2024-10-24 15:50: I think I am happy with the midterm. It has 33 numbered questions worth a total of 83 points, some of which are writing code and some of which are explaining things.

  • 2024-10-24 11:30: a quick Thursday update:

    • reminder that next week is Reading Week: no lectures or tutorials.
    • I forgot to tell you where one of the data files was on Assignment 6! This is now fixed (thank you to the student who noticed it).
    • My solutions to Assignment 5.
    • My solutions to Worksheet 7.
    • I intend to hold pre-exam office hours on about Wednesday of next week (during reading week). I’m anticipating 2:00-4:00pm in my office.
  • 2024-10-21 12:30: Monday update:

    • Assignment 5 is due tonight. If you are having difficulty installing smmr, make sure you have read the lecture notes and see the discussion on Quercus, or see the smmr homepage for another way.
    • Assignment 4 marking is done. My solutions. I will post the marks shortly. Appeals by the usual process between Oct 24 and Oct 31 inclusive. Common errors (according to the TA): missing the alternative in question 5 (and hence getting a different answer in question 6); not considering sample size in question 8).
    • Assignment 6 will open on Wed night. This is rather long (sorry!), but it is not due until Nov 11, so you have almost three weeks to do it.
    • Lectures:
      • Tuesday: the last bit of matched pairs, then the start of analysis of variance.
      • Thursday: The rest of analysis of variance.
    • Worksheet 7 for tutorial on Wednesday. This is again rather long, because I wanted to give you practice on everything that might appear on the midterm. Questions 1-10 should be doable after Tuesday’s lecture, but you might want to wait until after Thursday’s lecture before tackling the rest of it. I will post all my solutions on Wednesday as usual, but (see note 2024-10-20 16:00) you should not look at my solutions to question 11 and on until after you have made an honest attempt at solving them yourself.
  • 2024-10-20 16:00:

    • One of the most powerful ways to find out whether you know something is to solve problems on it, and when you have given the problems your best effort, then you look at the solutions and grade yourself. It then becomes very clear where any gaps in your knowledge are, and you know what to go back and study again. This is why I give you my worksheets first, and then later give you the solutions, and why I post my detailed solutions to the assignments after you have had a chance to work on them. (I will post my solutions to the midterm after you have written it, for the same reason. You will be able to see your graded exam, so you should go through that with my solutions.)
    • This means that the best way you have of studying for the midterm is, after reading through your lecture notes:
      • go to tutorial and do the worksheets yourself, then look at my solutions and grade yourself.
      • do the assignments yourself, and afterwards look at my solutions and see what you were missing.
      • if you want more practice, look at the problems in PASIAS. Each problem appears twice in its chapter, the first time without answers, the second time with. Attempt the problem as it appears without answers, then read the same problem as it appears with answers and grade yourself.
      • after you have graded yourself, go back to your lecture notes and re-read the things you are weak on (and, ideally, when you are ready, do another practice problem to see whether you understand it now).
      • There are old midterms. Use these in the same way: write the ones without answers first, under exam conditions with the notes you intend to bring to the actual exam, and then grade yourself using my solutions. Be mindful that in some of the older exams I do things differently from how I do them now. If you are confused by any of those, ask on the Quercus discussion board. See also Brian Harrington’s reddit post on using old exams.
      • If you don’t do these things (for example, you read through things with answers instead of taking the time to solve these problems yourself), you are setting yourself up to struggle in my exams, because solving my problems without knowing what the answers are is literally what you will need to do on the exams.
    • All this applies in any STEM-like course where later material depends on earlier material. You learn by solving problems (yourself) like the ones you will see on the exam, and preferably enough of them that you are ready for anything that might appear on the exam. There is no shortcut to success apart from this.
  • 2024-10-20 15:55: I had a couple of questions about read.csv last week after class. The key thing for you to know is that I did not teach this as a way of reading in a CSV file, so that in this course it is wrong and only read_csv with an underscore is correct. To have any hope of getting credit for using it, you need to say in your assignment where you got it from, and to say why it is better for the purpose than what I taught you. (It is too late to put that in an appeal.) It is also rather a signal that you got help from outside the course, and I warned you in the syllabus that this is unlikely to help you in the long run.

  • 2024-10-16 23:30: A late-night check-in:

    • Assignment 3 is graded. Appeals by the same procedure as for Assignments 1 and 2, between Oct 19 and Oct 26 inclusive. Added 2024-10-21 12:45: Common errors: questions 1 and 5 use of read.csv (see note 2024-10-20 15:55); question 4 and 9 failure to mention sample size and overcoming the skewness; question 7 and 8 make sure to use greater.
    • My solutions to Worksheet 6.
  • 2024-10-11 13:15: An early “Monday update”, since Monday is a holiday:

    • Assignment 4 is due on Tuesday night.
    • Assignment 3 marking is under way. My solutions.
    • Assignment 5 will open on Wed night.
    • Lectures next week, approximately:
      • Tuesday: the last little bit of sign test, then normal quantile plots.
      • Thursday: Mood’s median test
    • Worksheet 6 for tutorial on Wednesday. You should be able to do the first question now, and the second one after (next) Tuesday’s lecture.
  • 2024-10-09 22:30: my worksheet 5 solutions.

  • 2024-10-09 15:30: I have taken a look at the appeals for Assignment 1. You should have a comment from me on Quercus, together with a possibly adjusted mark.

  • 2024-10-09 12:30: about the midterm:

    • The date, time, and place of the midterm is in an announcement on Quercus, edited today. I am not announcing these to people outside the class.
    • The midterm is open-book. You can bring what you wish, such as your lecture notes, assignments, my slides, assignment solutions, etc, printed (no computers or other devices at the exam). You will need to organize whatever you bring, so that you can quickly find what you are looking for (and so there is an upper limit on what there is any point in bringing). If you are not well prepared, you can expect to run out of time; you will not have time to look everything up or go searching for things.
    • The exam coverage is everything that is done in lecture up to and including Thursday Oct 24. This is currently scheduled to include up to Analysis of Variance, but that may change. If I haven’t talked about it in class by the end of that day’s lecture, it’s not on the exam.
    • Expect to be writing code and explanations (for example, I might ask for code to do a task, or give you code or output and ask questions about it).
    • At the exam, you will get an exam paper with spaces to write your answers, and you also get a booklet with numbered Figures to refer to during the exam. The exam will say things like “In Figure 10, what is…” and you will need to find Figure 10 in the other booklet.
    • This is a Crowdmark exam, so it is best to use a pen or a sharp pencil, otherwise we may have trouble reading your answers. (Your exam will be scanned and then marked online, so if we cannot read the scan of your writing, we cannot give it points.) Only the front of the pages will be scanned, so do not write anything on the back of the pages, since no-one will see it. I add a blank page at the end of the exam which you can use for writing answers if you run out of space elsewhere.
    • The exam will have some number of numbered questions worth 2 or 3 or 4 or so points each, as on the assignments. I will tell you later how many questions there are.
    • if you have an exam accommodation, you will be writing with Accessability, and should make arrangements with them.
    • Any questions about exam procedures should go in Quercus discussions.
  • 2024-10-07 11:00: Monday update:

    • Assignment 3 is due tonight.
    • Assignment 2 has been marked, and I will release the marks shortly. Appeals between Oct 10 and 17 inclusive by the procedure described at 2024-09-27 11:25.
    • Assignment 4 will open on Wed night.
    • Lectures this week, approximately:
      • Tuesday: the rest of power of hypothesis tests
      • Thursday: the sign test. After that is normal quantile plots.
    • Worksheet 5 for tutorial on Wednesday (which should be doable after Tuesday’s lecture, and will prepare you for Assignment 4).
  • 2024-10-02 18:45: dinner is on the way, and I fixed my solutions.

  • 2024-10-02 18:10: a temporary version of my solutions to worksheet 4. I realized that I put the wrong plot in for question 6 (we haven’t seen the normal quantile plot in lecture yet), so will fix this, but I need to eat dinner first.

  • 2024-09-30 11:10: Monday update:

    • Assignment 2 is due tonight.
    • Lectures:
      • Tuesday: bootstrap sampling distribution of the sample mean (this will help us answer the question of “is our sample big enough for the Central Limit Theorem to help?”)
      • Thursday: power of hypothesis tests (this may spill into next week)
    • Tutorial: Worksheet 4
    • Assignment 3, on the same material as Worksheet 4, will open on Wednesday night.
  • 2024-09-27 11:25: Assignment 1 is now graded. I will release the marks shortly:

    • My solutions
    • If you wish to appeal your mark, you will need to follow the procedures given here precisely, in particular items 18–22 in section 3. In short, you need to find an error in the marking, and you need to explain clearly what that error is and how your work is in fact a correct answer to the question. Grader’s judgement is not an error, and not liking your mark is in no way grounds for an appeal. The time window for appeals on Assignment 1 is September 30 - October 7 inclusive. Appeals sent outside that window will not be considered.
  • 2024-09-25 20:30: My solutions to worksheet 3. Assignment 2 should now be open.

  • 2024-09-24 12:15: We have an official second section of STAD29, which will run Monday 1-3pm (one two-hour lecture a week, same as the other section). It should be big enough to accommodate all the students currently on the waitlist. If you are in section 1 currently and want to move to the new second section, that should be possible (which would allow space for students currently on the waitlist to join section 1, if that suits them better).

  • 2024-09-23 11:50: Monday update:

    • Assignment 1 due tonight.
    • Lectures: I got to the end of “choosing things” last Thursday, so we have for this week:
      • Tuesday: one-sample \(t\)
      • Thursday: two-sample \(t\) and start of bootstrap for sample mean
    • Tutorial: Worksheet 3 questions. Questions 18 and 19 relate to Tuesday’s lecture this week.
    • Assignment 2 will open on Wednesday night.
  • 2024-09-22 15:00: If you are having trouble accessing the assignment, your first port of call should be the student computer helpdesk to find out why that is (you are probably also going to have trouble handing in the assignment if you do not get this sorted out). For this week only, here are the questions.

  • 2024-09-19 15:30: I just took a look through the Assignment 0’s. If you got a 1 and a “yep”, you are all good. Most people got both of those. If you handed in something and didn’t get a 1, or got a 1 and a comment, you need to figure out what happened. The commonest things missing were the embed-resources: true thing that meant that there were no graphs, and the df-print: paged thing that meant that there was no nice display of dataframes. See question 13 in worksheet 1; these go in the “YAML block” at the very top of your document. If you got a 0, you are welcome to try again and I will endeavour to take a look at your revised version. (I was pretty relaxed about what you included; if you had some code, some output and a graph, ideally with some text of your own, I was good with it.)

  • 2024-09-19 10:35: two things:

    • Assignment 1 is now open, due next Monday.
    • September 16 was the last day to register yourself in a tutorial. If you haven’t done so yet, you will now need to go through the Registrar’s office to do this.
  • 2024-09-19 00:15: before I go to bed, my solutions to Worksheet 2.

  • 2024-09-18 12:00: We have a date for the midterm. I don’t want to share the time and place in public, so I am about to post an announcement on Quercus so that you can make plans.

  • 2024-09-16 12:55: Assignment 1 is now on Quercus, and will open for you on Wed night. Note that it is available until Wed of next week, but it is due on Monday night, and there is a late penalty (1% per hour) if you submit it after 11:59pm Monday. If you have an official Accessibility accommodation that grants you an extension on assignments, it is up to you to claim it (by emailing me) before the official due date each time you need it. The maximum such extension is two days, since I will be publishing solutions for everyone after that. There are no other extensions or make-ups, as per the syllabus. You may hand in your assignment up to two days late, but the late penalty will apply. (You will recall from your reading of the syllabus that the worst two assignments are dropped; if life circumstances intervene and you cannot hand in an assignment before it closes, that will automatically count as one of your dropped assignments.)

  • 2024-09-16 12:15: Monday update:

    • lectures: the rest of numerical summaries (Tuesday), choosing things from dataframes (both days), one-sample \(t\) (start Thursday).
    • tutorial: worksheet 2. Note that this week (and from here forward) this only has the questions. The idea is that by learning how to do the worksheet yourself (using your lecture notes and your brain), you are also learning how to do the assignment that depends on this worksheet (and thereby preparing yourself for any midterm questions on the material in this worksheet). After you have made an honest effort on this worksheet yourself, feel free to ask your tutorial TA for guidance if you get stuck. I will post my solutions on Wed evening.
    • assignment 1 will be posted on Wednesday evening, after all the tutorials are done and I have posted my worksheet solutions, and will be due the following Monday night (at the traditional time of 11:59pm). The idea is that you do the assignment yourself, based on what you have learned from doing the worksheet (and from the material in your lecture notes and by using your brain); expect minimal help from me. You have practiced the submission procedure on worksheet 1 (“assignment 0”), so you should be able to figure that out for Assignment 1.
  • 2024-09-11 18:00: to answer the question from lecture: if you’re reading in data from an Excel sheet, can you get hold of the names of the sheets in R, so that you can figure out which one contains the data you want? The answer turns out to be yes. To use the test2.xlsx from lecture:

library(readxl)
book <- "test2.xlsx"
excel_sheets(book)
[1] "data"  "notes"

and hence

xy <- read_excel(book, "data")
xy
# A tibble: 6 × 4
  id        x     y group
  <chr> <dbl> <dbl> <chr>
1 p1       10    21 upper
2 p2       11    20 lower
3 p3       13    25 upper
4 p4       15    27 lower
5 p5       16    30 upper
6 p6       17    31 lower

Coding tip: I knew I was going to be referring to the spreadsheet file twice (once to get the names of the sheets in it, and once to access the sheet I wanted), so I saved the filename (in book, short for “workbook”) to avoid typing the filename twice. (I also gave the dataframe a name that says at least something about what it contains, even though we don’t know what x and y actually are.)

  • 2024-09-09 13:20: on the table for this week:

    • lectures:
    • tutorials: start this week, on Wednesday. Your chance to work through (or finish working through) Worksheet 1 with a TA there to help, as well as handing in the practice Assignment 0 (checking that your graphs successfully made it).
  • 2024-09-06 13:00: I thought I would write out my plans for the course and share them with you (Google sheet):

    • I went about as far as reading week, because I am usually too ambitious. Expect there to be edits material-wise.
    • There is a midterm to be scheduled; I try not to have an assignment and a midterm the same week (eg. if the midterm is on a Friday, there won’t be an assignment due the Monday after).
    • This means that the last assignment will probably end up being due on Nov 25 and not as shown. Expect that the material in assignment \(n\) will be the same as on worksheet \(n+1\) (the questions will of course be different), at least up until the midterm when things might get further out of sync.
    • Depending on when the midterm is scheduled, the last tutorial before it might be a chance for you to bring exam-related questions to the TA.
    • I made assignment 4 due the day after Thanksgiving rather than Thanksgiving itself.
  • 2024-09-05 10:20: Today’s class: reading in data. Files for today’s class (added after: it looks as if they all work now):

  • 2024-09-04 17:30: I think I forgot to mention exams in my go-through of the course outline on Tuesday. The information you need is on the Assessment page of the course outline, at the bottom (you might need to scroll down).

  • 2024-09-03 12:50: Worksheet 1, for tutorial next week, in case you want to start on it now.

  • 2024-09-03 11:30: There were three fourth-year students on the waitlist this morning, and I have asked for them to be added to the course. Otherwise, the course is full because the classrooms for both lecture sections are full.

  • 2024-08-29 14:30: there is indeed a fourth tutorial:

    • Since there has been some last-minute reorganization, tutorials start in week 2 this year. This means that some of the early worksheets (and therefore assignments) will be a bit behind what appears in the lectures, but we will catch up as we go. The first worksheet should actually be doable after the very first lecture, so I will make sure it’s up in case you want to start on it right away. It looks as if there will be eight assignments this year (the first one will be due after week 3), of which the best six will count.
    • If you are still on the waitlist, it’s up to you to register yourself for the new LEC02. Also, if you are in LEC01 but would rather be in LEC02, you should be able to move to that section.
  • 2024-08-28 14:20: there will soon be a fourth tutorial (on Wednesday in BV 498, like the others, is the plan). It doesn’t matter which lecture section you are in; the tutorials are all good for both sections.

  • 2024-08-27 11:40: we have a second lecture section, and there should be space in it for everyone on the waitlist as of right now:

    • the lectures for LEC02 (taught by me as well) are Tu 15:00-16:00 and Th 17:00-18:00
    • if you are currently on the waitlist, you can sign up for LEC02 right away
    • there will be tutorial space for everyone on Wed (it doesn’t matter which tutorial you are in, because the two lecture sections will run in parallel). Expect a fourth tutorial to be created soon. You have (I think) the option to switch tutorials, up to a certain date. Since things are in a chaotic state right now, I may switch the tutorials back to starting in week 2 (and not what I said on August 12).
    • Acorn will tell you the locations of everything.
    • The times for LEC01 are unchanged.
    • if you are currently in LEC01 and want to switch to LEC02, you can do that, which would allow people on the waitlist/in LEC02 to switch to LEC01 if that suits them better.
  • 2024-08-23 15:45: breaking news: we are seeing whether we can open a second section of this course. This is contingent on being able to find a classroom to hold it in at a time that those of you on the waitlist (and I) can manage, which (as is usually the case at UTSC) is the biggest part of the problem. Expect more news early next week. (If need be, we will also hold a second section of STAD29, so there will be room in any case in that course for everyone that makes it through STAC32.)

  • 2024-08-12 12:00:

    • A reminder that the course is full because the lecture room is full, and so I cannot admit any more students even if I wanted to.
    • The course outline, so that you can get a sense of what to expect.
    • Highlights:
      • lectures on Tuesday and Thursday (1 hour each)
      • tutorial on Wednesday in a computer lab, starting in week 1. In tutorial, you get a worksheet to work through using R with TA help if you get stuck. The first worksheet takes you through things from the very beginning and comes with answers, including a practice assignment hand-in. Later worksheets don’t come with answers (so you have to work it out yourself), but I will publish answers after each week’s tutorials are done.
      • tutorial worksheets prepare you for the assignments.
      • assignments are due weekly on Mondays and open the previous Wednesday night (the first one opens in week 2 and is due the Monday of week 3), on the same material that the previous worksheet was on. Your assignment answers will need to show code, output, and explanation of the results. (The practice assignment shows you what is needed.) I publish complete solutions after each assignment has been handed in.
      • if you do your own work on the worksheets, this will help you for the assignments; if you do your own work on the assignments, this will help you on the exams and will thus help your course grade.
      • there will be one out-of-class midterm on a date to be announced, around halfway through the course.
      • there will be a final exam, also on a date to be announced, during the final exam period.
      • exams are handwritten with short-answer questions, some of which may ask you to write code. Exams are open-book, but bear in mind that you need to organize what you bring, or else you will run out of time trying to find things.
      • the course materials (and your brain) will contain everything you need to succeed in this course, except for a few places where I ask you to find something out.
  • 2024-07-29 10:45:

    • I have found space for a few more students in the course, and have asked for these students to be added. Once this is done, the course will be full, since we will be at maximum capacity for our lecture room and I am not allowed to enlarge the class any further.
    • In the next couple of days, you will see either that you have been enrolled in the course or that you have moved up the waitlist. A warning that the waitlist in this class historically moves very slowly, so if you are still on the waitlist after this, I advise you to make backup plans for your courses.
    • The waitlist is long, so I had to make some choices about who to admit. Since STAC32 and STAD29 are two of the last courses in the Statistics Applied Minor program, I prioritized those students in the program who are closest to graduating, on the basis that this may be their last year, whereas students who are further from graduating can take these courses next year. This is based on the number of credits that each student will have completed by the end of the summer (assuming they pass any summer courses), with the students on the waitlist ranked from greatest to least. If you are in your 3rd year of study (as determined by UTSC), you need to be very close to the 14.0 credits that will make you 4th year to get in this time (if you don’t, you will have a higher priority next year).
    • You can find your total number of credits in your Degree Explorer: under Degree and Completed Credits you will see something like 17.00/20.00 (includes 7.00 In Progress). This student will have 17.0 credits once they have completed the courses they are now enrolled in, and thus they have \(17-7 = 10\) credits completed now and are in their 3rd year of study.
    • My decision of students to enrol is final and is not open to appeal, and, as I say, the course is full once these students are enrolled because the lecture room is full.
  • 2024-06-28 19:00: time to open things up for the fall 2024 edition of this course:

    • I’m aware that the course is filling up fast. After the 3rd years have had a chance to enrol, I will look more carefully at who is on the waitlist, with the aim of making sure that those who really need the course are able to take it, subject to space being available. My priority order is: students in the Statistics Applied Minor program, and within those, students in the fourth year of study (or beyond) as calculated by UTSC, then students in earlier years of study in the program, and then students in other programs. Appealing directly to me will not change that.
    • There are (COVID-era old) recordings of the lecture material. Things have changed since they were made, but they will be available in case you must miss an occasional lecture (for example, you are sick). They are not a substitute for routinely missing lecture; if you do that, you risk missing important material and thereby hurting your grade.
    • I will be checking prerequisites. To take STAC32, you must have completed (by the time STAC32 starts in September) either STAB27 or PSYC08 or MGEB12. If you have not completed one of these courses by then, you will be removed from STAC32.
    • Looking ahead to next winter’s STAD29: my aim is to make sure that there will be enough space in that course for everyone that wants to take it (and has completed STAC32).
    • Program FAQ.