2024-12-18 15:00: The final exams are marked, coursework appeals are dealt with, grades are submitted. You should be receiving them shortly.
- Reminders:
- Read section 5, Grades in the detailed course policies.
- I am not allowed to discuss or negotiate grades. Do not attempt to do so.
- If you are unhappy with your grade, the only avenues of appeal you have are through the Registrar’s office.
- Course grades are based only on the work you have done in a course, and not on any external circumstances. If you want a certain grade, it is up to you to do the work needed to obtain it.
- Happy holidays to everyone, in whatever way you celebrate. I hope to see you in D29!
2024-12-17 11:45: Assignment 7 is marked, grades to be posted shortly. We are on a tight timeline, so any appeals (by the usual procedure) need to reach me by 12 noon tomorrow, Dec 18.
2024-12-16 22:15: now 65% done.
2024-12-14 21:10: tonight’s report: 43% done, including (I think) all of mine.
2024-12-13 21:15: We are up to 26% done. Expect your TAs and me to devote our weekend to marking your exam. My solutions, with edits from the questions I’ve been marking. The figures are at the end.
2024-12-12 21:30: final exam marking is under way. We are now 11% done (which is actually all me so far). Your TAs have been briefed about the marking of their questions, and they will get to it when they have a chance.
2024-12-10 14:15:
- we haven’t forgotten about Assignment 7. This will be marked and included in your course grade.
- final exam tomorrow. See Quercus announcement for where and when.
2024-12-04 18:55: weather permitting, I’ll be in my office tomorrow (Thursday), sometime around early afternoon. (If the door is open, but I’m not there, I’m somewhere around and will be back soon.)
2024-12-04 12:45:
- Assignment 8 is now marked, and the marks will be released in a moment. Appeals by the usual procedure between Dec 5 and Dec 10 (so that I can get to them before the final exam). Assignment 7 will be coming your way soon.
- in response to a question on the discussion board, I rewrote some of my solutions to worksheet 11 to fit better with the way we have been doing things.
2024-12-03 11:30: final exam:
- the final exam is comprehensive: you are responsible for anything in the course.
- specifically: expect to see most of the material from after our midterm, and a selection of material from before.
2024-12-02 14:45: updates:
- I think I am happy with the final exam. It has 37 questions worth a total of 91 points; in that regard it is not much bigger than the midterm, but I think you can expect to have to work harder for each point here than you did on the midterm.
- I will have pre-exam office hours on Monday Dec 9, 1:00-3:00pm.
- I may also be in the office this week, depending on the forecast for snow. I was thinking of Wed or Thursday, but as I write, the forecast is for snow both days, so we’ll see.
2024-11-28 13:05: final exam time and location is on Quercus.
2024-11-28 11:15: Some solutions of mine:
2024-11-25 11:30: the final Monday update:
- Assignments:
- Assignment 7: my solutions
- Assignment 8 (the last one) due tonight.
- Lectures:
- Tuesday: functions part 1
- Thursday: functions part 2. Once we are done this, we are done.
- no lectures after that (Dec 2 and Dec 3 are makeup Mondays).
- Worksheet 11 for tutorial on Wednesday.
- Tutorial at 9:00am on Wednesday is cancelled. If you were going to that one, please go to one of the others (12 noon, 3:00pm, 4:00pm in the same room).
- The final exam is under construction. I will let you know the number of pages and questions when it is finished.
2024-11-21 22:05: My solutions to worksheet 10. Sorry for the delay, and thanks to the student who reminded me.
2024-11-21 10:35: Assignment 6 has also been marked, and I am about to post the marks. Appeals by usual procedure between Nov 24 and Dec 1 inclusive.
2024-11-19 10:50: Assignment 5 has been marked, and I am about to post the marks. Appeals by usual procedure between Nov 22 and Nov 29 inclusive.
2024-11-18 11:20: Monday update (the second-to-last one):
- Assignments:
- Assignment 5 appears to be marked. When the grader confirms that it is all done, I will post the marks.
- Assignment 6 marking is under way.
- Assignment 7 is due tonight.
- Assignment 8 (the last one) opens on Wednesday and is due on Monday of next week (the 25th).
- Lectures:
- Tuesday: asphalt case study
- Thursday: regression with categorical (explanatory) variables
- remaining: writing functions
- Worksheet 10 for tutorial on Wednesday. There is one more tutorial and one more worksheet after this.
2024-11-14 10:30: solutions to things:
2024-11-11 15:15: I am about ready to post the midterm marks. You have received an email from Crowdmark with instructions for retrieving your marked exam.
- Stats: median 57.25 (69%), Q1 45.75 (55%), Q3 70.25 (85%).
- Appeals by the same procedure as for assignments, between Nov 14 and Nov 21 inclusive. Appeals will only be considered for grading errors, that you can demonstrate by reference to my solutions. In addition, the graders have been as consistent as possible across students, so I will not consider appeals for partial credit. Hence the only thing worth appealing is an answer where you can make the case for full credit for your answer, with reference to my solutions. To do this, you will need to demonstrate that the grader didn’t see something you wrote, or graded inconsistently with my solutions.
- If you are disappointed by your exam mark:
- Your first task is to go through your marked exam carefully with my solutions and understand where you earned and did not earn points.
- If, despite your best efforts, you do not understand what you were missing, talk to me in office hours. These conversations are only about your understanding, not about your marks.
2024-11-11 14:45: a (late) Monday update:
- the midterm is marked! After I have finished checking it over, I will upload the marks to Quercus, and send you your marked exam (you will get an email from Crowdmark with your marked exam attached).
- Assignment 6 is due tonight at the usual time.
- Lectures:
- Tuesday: the last little bit of tidying data, then the windmill case study (regression).
- Thursday: the rest of the windmill case study, and maybe start on the asphalt case study (multiple regression).
- Worksheet 9 for tutorial on Wednesday.
- There are two more assignments after #6 and two more worksheets after #9.
2024-11-10 21:10: now at 90% marked. Remaining questions seem to be 5, 6, 12, and 24. Results tomorrow, maybe.
2024-11-09 20:50: we are now up to 68% of the midterm graded. I am done my share, and we are waiting on the TAs to finish their parts.
2024-11-08 21:55: looks as if it’s been a productive day of marking; we are up to 53% done. I am approaching the end of my share of the marking. My updated solutions.
2024-11-07 23:45: Midterm marking now at 38%. I did question 33 between lectures today and question 25 tonight, which was probably further than I expected to get today. (I’m working kinda backwards.) Tomorrow will be a (hopefully) productive day.
2024-11-07 11:45: my midterm solutions as they stand now (will be updated as marking continues).
2024-11-06 22:00: update:
- my worksheet 8 solutions. I messed up getting these to the TAs in time, so if your TA seemed underinformed in tutorial today, that was my fault.
- exam marking progress report: we are 28% done as of right now. I got through most of the ANOVA question today. I aim to do some more in between the two lectures tomorrow. It’s always a challenge to mark midterms while simultaneously keeping track of everything else in the course. Hence, you might need to remind me in lecture tomorrow (Thursday) of where we got to on Tuesday.
2024-11-03 16:45: an early “Monday update”:
- Midterm on Monday night! See Quercus announcement for time and place(s).
- There are no assignments due this week. Assignment 6 will be due on Mon Nov 11, and there will be two more assignments after that.
- Lectures:
- Tuesday: the first part of tidying data
- Thursday: the second part of tidying data
- Worksheet 8 for tutorial on Wednesday.
2024-10-30 13:15: course grade procedures:
- if you miss the midterm for any reason, the weight is automatically transferred to the final exam (you don’t have to ask me or provide documentation).
- if you write the midterm, but your final exam is better, your final exam automatically counts instead of your midterm.
- therefore, it is in your best interest to write the midterm if you can: it is “marks in the bank” if you do well, and is ignored if you do better on the final.
- on the other hand, if you are sick on exam day (particularly if you have something that is contagious or will disturb the person sitting next to you), stay home.
2024-10-29 23:40: reminder of pre-exam office hours Wed 2:00-4:00pm in my office (see course outline for where that is).
2024-10-25 17:00: when I was a bit younger than you are now (that is to say, a long time ago now), I was given some tips about writing exams, which I pass on below. This exam has 33 questions grouped together under a number of “scenarios” with a title at the top (as on the assignments and worksheets); you will probably want to attempt the questions for a particular scenario together:
- budget your time. For my exams (and, I suspect, for most exams), the number of points is a reasonable proxy for the amount of time that it will take for a well-prepared student to answer the question. You know how many total points the exam is worth (see yesterday’s note), and you know how many minutes you have, so you know how long things should be taking. If you can, allow yourself a few minutes at the end for checking your answers.
- before you answer any of the questions, read (or at least skim) through the whole exam, and note anything that you know how to answer.
- start with the questions that look easiest, and make sure you get the points for those.
- if a question is taking you too long (given your time budget), leave it and do another one. Your job is not to do all the questions, but to get as many points as possible. If you spend too long on a question, you are stopping yourself from getting points elsewhere. The worst thing is to not even attempt questions that you know how to do.
- if you follow the principle of doing the easiest remaining questions at any point, anything you don’t have time for will be something that you weren’t going to get many points on anyway. (Note how different this is from the strategy of starting at the beginning and continuing until you run out of time.)
2024-10-25 16:40:
- I added last year’s exams to the old exam page. In contrast to this year, last year’s midterm was early, so there is a lot of material that can be on this year’s midterm that was not on last year’s. Midterms vary in how far through the semester they are, so be careful about using previous midterms as a guide to what will appear on this year’s.
- I used to teach SAS in this course as well as R, so if you go back far enough, you will find questions on SAS in the exams. You can ignore those, or, perhaps better, think about how you would accomplish the same task with R.
- Also, if you go back far enough, you may in addition find R things done differently from how they are now. This is the nature of a course using developing software. Your aim is to do these tasks as they are done now, meaning as I am teaching them this year. There are, I think, some notes in my solutions indicating how things have changed, but I doubt I have caught everything. I plan to take another look at those during reading week and see what I can fix up.
2024-10-24 15:50: I think I am happy with the midterm. It has 33 numbered questions worth a total of 83 points, some of which are writing code and some of which are explaining things.
2024-10-24 11:30: a quick Thursday update:
- reminder that next week is Reading Week: no lectures or tutorials.
- I forgot to tell you where one of the data files was on Assignment 6! This is now fixed (thank you to the student who noticed it).
- My solutions to Assignment 5.
- My solutions to Worksheet 7.
- I intend to hold pre-exam office hours on about Wednesday of next week (during reading week). I’m anticipating 2:00-4:00pm in my office.
2024-10-21 12:30: Monday update:
- Assignment 5 is due tonight. If you are having difficulty installing
smmr
, make sure you have read the lecture notes and see the discussion on Quercus, or see the smmr
homepage for another way.
- Assignment 4 marking is done. My solutions. I will post the marks shortly. Appeals by the usual process between Oct 24 and Oct 31 inclusive. Common errors (according to the TA): missing the
alternative
in question 5 (and hence getting a different answer in question 6); not considering sample size in question 8).
- Assignment 6 will open on Wed night. This is rather long (sorry!), but it is not due until Nov 11, so you have almost three weeks to do it.
- Lectures:
- Tuesday: the last bit of matched pairs, then the start of analysis of variance.
- Thursday: The rest of analysis of variance.
- Worksheet 7 for tutorial on Wednesday. This is again rather long, because I wanted to give you practice on everything that might appear on the midterm. Questions 1-10 should be doable after Tuesday’s lecture, but you might want to wait until after Thursday’s lecture before tackling the rest of it. I will post all my solutions on Wednesday as usual, but (see note 2024-10-20 16:00) you should not look at my solutions to question 11 and on until after you have made an honest attempt at solving them yourself.
2024-10-20 16:00:
- One of the most powerful ways to find out whether you know something is to solve problems on it, and when you have given the problems your best effort, then you look at the solutions and grade yourself. It then becomes very clear where any gaps in your knowledge are, and you know what to go back and study again. This is why I give you my worksheets first, and then later give you the solutions, and why I post my detailed solutions to the assignments after you have had a chance to work on them. (I will post my solutions to the midterm after you have written it, for the same reason. You will be able to see your graded exam, so you should go through that with my solutions.)
- This means that the best way you have of studying for the midterm is, after reading through your lecture notes:
- go to tutorial and do the worksheets yourself, then look at my solutions and grade yourself.
- do the assignments yourself, and afterwards look at my solutions and see what you were missing.
- if you want more practice, look at the problems in PASIAS. Each problem appears twice in its chapter, the first time without answers, the second time with. Attempt the problem as it appears without answers, then read the same problem as it appears with answers and grade yourself.
- after you have graded yourself, go back to your lecture notes and re-read the things you are weak on (and, ideally, when you are ready, do another practice problem to see whether you understand it now).
- There are old midterms. Use these in the same way: write the ones without answers first, under exam conditions with the notes you intend to bring to the actual exam, and then grade yourself using my solutions. Be mindful that in some of the older exams I do things differently from how I do them now. If you are confused by any of those, ask on the Quercus discussion board. See also Brian Harrington’s reddit post on using old exams.
- If you don’t do these things (for example, you read through things with answers instead of taking the time to solve these problems yourself), you are setting yourself up to struggle in my exams, because solving my problems without knowing what the answers are is literally what you will need to do on the exams.
- All this applies in any STEM-like course where later material depends on earlier material. You learn by solving problems (yourself) like the ones you will see on the exam, and preferably enough of them that you are ready for anything that might appear on the exam. There is no shortcut to success apart from this.
2024-10-20 15:55: I had a couple of questions about read.csv
last week after class. The key thing for you to know is that I did not teach this as a way of reading in a CSV file, so that in this course it is wrong and only read_csv
with an underscore is correct. To have any hope of getting credit for using it, you need to say in your assignment where you got it from, and to say why it is better for the purpose than what I taught you. (It is too late to put that in an appeal.) It is also rather a signal that you got help from outside the course, and I warned you in the syllabus that this is unlikely to help you in the long run.
2024-10-16 23:30: A late-night check-in:
- Assignment 3 is graded. Appeals by the same procedure as for Assignments 1 and 2, between Oct 19 and Oct 26 inclusive. Added 2024-10-21 12:45: Common errors: questions 1 and 5 use of
read.csv
(see note 2024-10-20 15:55); question 4 and 9 failure to mention sample size and overcoming the skewness; question 7 and 8 make sure to use greater
.
- My solutions to Worksheet 6.
2024-10-11 13:15: An early “Monday update”, since Monday is a holiday:
- Assignment 4 is due on Tuesday night.
- Assignment 3 marking is under way. My solutions.
- Assignment 5 will open on Wed night.
- Lectures next week, approximately:
- Tuesday: the last little bit of sign test, then normal quantile plots.
- Thursday: Mood’s median test
- Worksheet 6 for tutorial on Wednesday. You should be able to do the first question now, and the second one after (next) Tuesday’s lecture.
2024-10-09 22:30: my worksheet 5 solutions.
2024-10-09 15:30: I have taken a look at the appeals for Assignment 1. You should have a comment from me on Quercus, together with a possibly adjusted mark.
2024-10-09 12:30: about the midterm:
- The date, time, and place of the midterm is in an announcement on Quercus, edited today. I am not announcing these to people outside the class.
- The midterm is open-book. You can bring what you wish, such as your lecture notes, assignments, my slides, assignment solutions, etc, printed (no computers or other devices at the exam). You will need to organize whatever you bring, so that you can quickly find what you are looking for (and so there is an upper limit on what there is any point in bringing). If you are not well prepared, you can expect to run out of time; you will not have time to look everything up or go searching for things.
- The exam coverage is everything that is done in lecture up to and including Thursday Oct 24. This is currently scheduled to include up to Analysis of Variance, but that may change. If I haven’t talked about it in class by the end of that day’s lecture, it’s not on the exam.
- Expect to be writing code and explanations (for example, I might ask for code to do a task, or give you code or output and ask questions about it).
- At the exam, you will get an exam paper with spaces to write your answers, and you also get a booklet with numbered Figures to refer to during the exam. The exam will say things like “In Figure 10, what is…” and you will need to find Figure 10 in the other booklet.
- This is a Crowdmark exam, so it is best to use a pen or a sharp pencil, otherwise we may have trouble reading your answers. (Your exam will be scanned and then marked online, so if we cannot read the scan of your writing, we cannot give it points.) Only the front of the pages will be scanned, so do not write anything on the back of the pages, since no-one will see it. I add a blank page at the end of the exam which you can use for writing answers if you run out of space elsewhere.
- The exam will have some number of numbered questions worth 2 or 3 or 4 or so points each, as on the assignments. I will tell you later how many questions there are.
- if you have an exam accommodation, you will be writing with Accessability, and should make arrangements with them.
- Any questions about exam procedures should go in Quercus discussions.
2024-10-07 11:00: Monday update:
- Assignment 3 is due tonight.
- Assignment 2 has been marked, and I will release the marks shortly. Appeals between Oct 10 and 17 inclusive by the procedure described at 2024-09-27 11:25.
- Assignment 4 will open on Wed night.
- Lectures this week, approximately:
- Tuesday: the rest of power of hypothesis tests
- Thursday: the sign test. After that is normal quantile plots.
- Worksheet 5 for tutorial on Wednesday (which should be doable after Tuesday’s lecture, and will prepare you for Assignment 4).
2024-10-02 18:45: dinner is on the way, and I fixed my solutions.
2024-10-02 18:10: a temporary version of my solutions to worksheet 4. I realized that I put the wrong plot in for question 6 (we haven’t seen the normal quantile plot in lecture yet), so will fix this, but I need to eat dinner first.
2024-09-30 11:10: Monday update:
- Assignment 2 is due tonight.
- Lectures:
- Tuesday: bootstrap sampling distribution of the sample mean (this will help us answer the question of “is our sample big enough for the Central Limit Theorem to help?”)
- Thursday: power of hypothesis tests (this may spill into next week)
- Tutorial: Worksheet 4
- Assignment 3, on the same material as Worksheet 4, will open on Wednesday night.
2024-09-27 11:25: Assignment 1 is now graded. I will release the marks shortly:
- My solutions
- If you wish to appeal your mark, you will need to follow the procedures given here precisely, in particular items 18–22 in section 3. In short, you need to find an error in the marking, and you need to explain clearly what that error is and how your work is in fact a correct answer to the question. Grader’s judgement is not an error, and not liking your mark is in no way grounds for an appeal. The time window for appeals on Assignment 1 is September 30 - October 7 inclusive. Appeals sent outside that window will not be considered.
2024-09-25 20:30: My solutions to worksheet 3. Assignment 2 should now be open.
2024-09-24 12:15: We have an official second section of STAD29, which will run Monday 1-3pm (one two-hour lecture a week, same as the other section). It should be big enough to accommodate all the students currently on the waitlist. If you are in section 1 currently and want to move to the new second section, that should be possible (which would allow space for students currently on the waitlist to join section 1, if that suits them better).
2024-09-23 11:50: Monday update:
- Assignment 1 due tonight.
- Lectures: I got to the end of “choosing things” last Thursday, so we have for this week:
- Tuesday: one-sample \(t\)
- Thursday: two-sample \(t\) and start of bootstrap for sample mean
- Tutorial: Worksheet 3 questions. Questions 18 and 19 relate to Tuesday’s lecture this week.
- Assignment 2 will open on Wednesday night.
2024-09-22 15:00: If you are having trouble accessing the assignment, your first port of call should be the student computer helpdesk to find out why that is (you are probably also going to have trouble handing in the assignment if you do not get this sorted out). For this week only, here are the questions.
2024-09-19 15:30: I just took a look through the Assignment 0’s. If you got a 1 and a “yep”, you are all good. Most people got both of those. If you handed in something and didn’t get a 1, or got a 1 and a comment, you need to figure out what happened. The commonest things missing were the embed-resources: true
thing that meant that there were no graphs, and the df-print: paged
thing that meant that there was no nice display of dataframes. See question 13 in worksheet 1; these go in the “YAML block” at the very top of your document. If you got a 0, you are welcome to try again and I will endeavour to take a look at your revised version. (I was pretty relaxed about what you included; if you had some code, some output and a graph, ideally with some text of your own, I was good with it.)
2024-09-19 10:35: two things:
- Assignment 1 is now open, due next Monday.
- September 16 was the last day to register yourself in a tutorial. If you haven’t done so yet, you will now need to go through the Registrar’s office to do this.
2024-09-19 00:15: before I go to bed, my solutions to Worksheet 2.
2024-09-18 12:00: We have a date for the midterm. I don’t want to share the time and place in public, so I am about to post an announcement on Quercus so that you can make plans.
2024-09-16 12:55: Assignment 1 is now on Quercus, and will open for you on Wed night. Note that it is available until Wed of next week, but it is due on Monday night, and there is a late penalty (1% per hour) if you submit it after 11:59pm Monday. If you have an official Accessibility accommodation that grants you an extension on assignments, it is up to you to claim it (by emailing me) before the official due date each time you need it. The maximum such extension is two days, since I will be publishing solutions for everyone after that. There are no other extensions or make-ups, as per the syllabus. You may hand in your assignment up to two days late, but the late penalty will apply. (You will recall from your reading of the syllabus that the worst two assignments are dropped; if life circumstances intervene and you cannot hand in an assignment before it closes, that will automatically count as one of your dropped assignments.)
2024-09-16 12:15: Monday update:
- lectures: the rest of numerical summaries (Tuesday), choosing things from dataframes (both days), one-sample \(t\) (start Thursday).
- tutorial: worksheet 2. Note that this week (and from here forward) this only has the questions. The idea is that by learning how to do the worksheet yourself (using your lecture notes and your brain), you are also learning how to do the assignment that depends on this worksheet (and thereby preparing yourself for any midterm questions on the material in this worksheet). After you have made an honest effort on this worksheet yourself, feel free to ask your tutorial TA for guidance if you get stuck. I will post my solutions on Wed evening.
- assignment 1 will be posted on Wednesday evening, after all the tutorials are done and I have posted my worksheet solutions, and will be due the following Monday night (at the traditional time of 11:59pm). The idea is that you do the assignment yourself, based on what you have learned from doing the worksheet (and from the material in your lecture notes and by using your brain); expect minimal help from me. You have practiced the submission procedure on worksheet 1 (“assignment 0”), so you should be able to figure that out for Assignment 1.
2024-09-11 18:00: to answer the question from lecture: if you’re reading in data from an Excel sheet, can you get hold of the names of the sheets in R, so that you can figure out which one contains the data you want? The answer turns out to be yes. To use the test2.xlsx
from lecture:
Coding tip: I knew I was going to be referring to the spreadsheet file twice (once to get the names of the sheets in it, and once to access the sheet I wanted), so I saved the filename (in book
, short for “workbook”) to avoid typing the filename twice. (I also gave the dataframe a name that says at least something about what it contains, even though we don’t know what x
and y
actually are.)