Eck Sports Lab

Logo

View My GitHub Profile

Course Syllabus

Greetings

I am Daniel J. Eck and I will be the instructor for STAT 430: Baseball Analytics. Baseball research is a primary interest of mine. I am interested in developing useful statistical methodology for performance evaluation and am interested in the statistical history of baseball. Several of the examples and reading materials in this course will come from projects that I have worked on or that I find interesting and useful.

Email: dje13@illinois.edu

<BR CLEAR=”left” /> <BR CLEAR=”left” />

Course time: Wednesdays from 11 am to 1:50 pm

Course location: 140 Henry Admin Building

Office hours: Instructor office hours are on Thursdays, 10 - 11 am.

TA: Shen Yan (shenyan3@illinois.edu). TA office hours are on Fridays, 10am - 12pm.

Course website: https://github.com/stat430fa22/stat430materials

My course website is a GitHub repository. You can click here to access this website. Note that this Syllabus is a Markdown document. Open the .md file to view the basic Markdown syntax used to build your Syllabus.


Course Materials

The course materials will largely consist of instructor notes, papers, articles, and software. The course will also use the textbook Analyzing Baseball Data with R (second edition)

Papers, Articles, and Instructor Notes

Instructor notes will be posted and updated on GitHub. Relevant papers and articles will either be provided on GitHub or be referenced in course notes.

Software Used in the Course (all free)


Course Information

This is a reading, seminar, and project based course on the intersection of baseball and statistics. In this course you will learn how to conduct relevant data analyses with a focus on how to quantify aspects of baseball play associated with winning games. You will also learn about the statistical history of baseball with an emphasis on comparing players across eras. Founding principles as well as advanced statistical methods for both directions will be discussed. The analyses that you conduct will also develop your critical thinking skills as a statistician. Furthermore, practical advantages, limitations, and comparisons of methods will be discussed. If you are interested in quantifying how good Mike Trout is or in ranking the careers of Barry Bonds, Willie Mays, and Babe Ruth, then this is the course for you.

Prerequisites: STAT 385 or equivalent experience with R, STAT 425 or equivalent experience. Familiarity with Git and GitHub is helpful, but not required.

Student Learning Outcomes

Upon successful completion of this course students will be able to conduct methodologically strong data analyses that can answer questions of scientific interest, specifically the students will:

Getting Help in STAT 430

You have many options to get help for this class:


Grading Breakdown

category notes points
Attendance 10 points each course; two free misses 130 points
Labs 100 points each 500 points
Final Project 50 points for approved project; 70 points for presentation (recorded summary); 250 for project materials and writeup 370 points
total   1000 points

I will be using a +/- grading schema. The grade distributions will be:

Lower bound Upper bound Letter grade
980 1000 A+
933 979 A
900 932 A-
867 899 B+
833 866 B
800 832 B-
and so on    

Attendance and Course Content

Attendance will be a course requirement. There will be 15 lectures and you will receive 10 points for each lecture you attend, up to 130 points. Thus you are allowed to miss up to 2 lectures with no penalty. There will be a signup sheet that is distributed at the beginning of class, you need to sign your name to verify your presence in class. Let your instructor know in advance if you cannot attend any class.

Labs

There are 5 labs total. When completing the assignment, read it carefully, and follow the directions. These labs can be completed in groups of 2-3 students. Every group member will submit their own lab report. Make sure to list your collaborators in your lab report.

For each lab assignment, you will submit two files - .Rmd and .html (or .pdf) - saving your files with your name and lab assignment number. The lab naming convention is netid_lab#. For a student with netid abc123 that is submitting lab4 files, their files would be saved as abc123_lab4.Rmd and abc123_lab4.html (or abc123_lab4.pdf). Failure to adopt this lab naming convention will result in point deductions and headaches. Make sure your lab is professional and reproducible containing only relevant derivations, code, results, and explanations. Questions about the grading should be directed to the TA.

Labs should be stored in a lab# directory in your GitHub repo that is a sub directory of a labs directory in your GitHub repo. For example, your fourth homework assignment should be saved in the directory

labs/lab4/

within your personal GitHub repo. Failure to do this will result in point deductions.

Late lab submissions will be accepted with a penalty. There will be a 20 point deduction if a lab is submitted 48 hours after the deadline. Labs submitted later than 48 hours after the deadline will not be considered.

Students must use GitHub to start, finish, and submit their labs. More details will be discussed in class.

Labs will be worth 100 points unless otherwise noted.

Project

This course will have a final project instead of an exam. This will be a group project comprised of 2-3 students. Final projects for this course will be fairly open-ended and not subject to a one-size-fits all standard. That being said, all projects need instructor approval.

Details and project rubrics are TBD.

Possible projects include:

Exams

There will be no exams!


Schedule (subject to change)