Overview of Topics and Course Objectives

This course builds on the R/tidyverse programming skills developed in DATA-412/612 for the collection, organization, analysis, interpretation, and presentation of data. Topics include version control, web scraping, date manipulation, vectorized operations, web application development with R Shiny, R package development, and introductions to SQL and python.

Learning Objectives:

Materials

Course Material

Almost all course material will be posted and accessed through GitHub. git is a widely used version control system for collaborating, deploying, and managing data science projects. git works by saving snapshots, or “commits”, of your projects. These snapshots are always retrievable, even if you make further changes to your code/notes/reports. You can also write small descriptions of what you have accomplished since the last time you committed. GitHub is a website that hosts git repositories. We will spend the first week or two covering git and the command line. Following that, all assignments will be accessed and submitted through git.

Graded Work

Assignments

  • Most homework assignments will be posted and returned on GitHub.
    • The best practice for a version control system is to commit frequently with informative commit messages. Thus, this will be formal part of your homework grade.
    • I expect frequent commits. At the minimum, I expect you to commit after you have completed each question.
    • I expect informative messages for each commit.
    • Example good message: “Completes Question 1.2 that required tidying the college scorecard data.”
    • Example bad message: “More stuff”
    • Lack of frequent and informative commits will result in up to a 20% reduction in an assignment grade.
  • Because git allows me to view your progress on an assignment, I will sometimes accept a late assignment if I see some progress and a consistent commit history in that assignment. This should only be used under extraordinary circumstances. If I do not see any progress in an assignment, I will not accept a late submission.
  • To account for life circumstances that pop up, I will also drop the lowest homework assignment grade.

Quizzes

  • We will have four closed book, closed computer, closed notes, closed calculator quizzes for a total of 40% of the grade (about 10% each).

  • My intention is to make these straightforward (no trick questions). But you will still have to know the material to do well on them.

  • I will provide practice questions to study from.

  • I plan on each quiz taking about 30 minutes.

Group Project

All students will prepare a final project using the tools learned in the class. This project will be completed in groups of 2-3 students. Work with me to get your project topic approved. Tentatively, your project will involve creating an R package.

Assignments and Grading

Usual grade cutoffs will be used:

Grade Lower Upper
A 93 100
A- 90 92
B+ 88 89
B 83 87
B- 80 82
C+ 78 79
C 73 77
C- 70 72
D 60 69
F 0 59

Individual assignments will not be curved. However, at the discretion of the instructor, the overall course grade at the end of the semester may be curved.

Late Work Policy

All assignments must be submitted, in class, on the day they are due. You will be penalized 15% every day an assignment is late. I will not accept assignments submitted over three days after the deadline. If you become ill or the victim of an emergency, please let me know within 48 hours.

Students requiring a temporary leave of absence for medical or mental health reasons must provide documentation to the Office of the Dean of Students (), which will verify with the academic unit that the documentation is appropriate and supports the leave. Students with an ASAC-approved accommodation for disability reasons should, to the greatest extent possible, make arrangements in advance of the due date or deadline.

Generative AI Policy

Official Policy:

  • Generative AI will be allowed for homeworks and the final project.

  • If you use generative AI you must include an appendix with your prompts and resulting output. You can alternatively use a share link (e.g., ChatGPT has this feature).

  • This can either be by printing the webpage to PDF, or by copying and pasting the prompts and output to a Word file that you then convert to a PDF.

  • If you use generative AI, then failure to include a generative AI appendix or share link will be considered a violation of the academic integrity code.

  • I still expect you to be able to explain your work on assignments/projects and your rationale. Based on your explanation (or lack thereof), I may modify your grade.

  • I mostly want to see how students use generative AI and see whether I need to adapt my teaching. So please take this seriously.

My Rant

  • Generative AI (like ChatGPT) gives the most accurate solutions for simple problems — that is, problems you do in college.

  • Generative AI gives the least accurate solutions for hard problems — that is, problems you do in work.

  • But you need to know the simple stuff (stuff in college) in order to be able to learn the hard stuff (stuff in work).

  • So if you use generative AI for college, I think you are cheating yourself.

  • Data Science and Statistics are also special fields in that a degree will only get you an interview, it won’t get you a job.

  • Companies want to make sure you know the simple stuff (stuff in college) so that they can be sure you can learn the hard stuff (stuff in work).

  • So if you rely on generative AI, you won’t learn the foundational skills and you will have trouble finding a job.

  • Thus, I will just have more in-class assessment and lower the weight of homeworks.

  • We will have three quizzes that will be closed notes/books/computer/calculator. No reference sheet will be allowed.

  • If you rely too much on generative AI, you will likely fail those quizzes. Homeworks are the best practice for exams.

Academic Integrity Code

Important Dates

* Tentative, subject to change.

Incomplete Policy

At the discretion of the faculty member and before the end of the semester, the grade of I (Incomplete) may be given to a student who, because of extenuating circumstances, is unable to complete the course during the semester. The grade of Incomplete may be given only if the student is receiving a passing grade for the coursework completed. Students on academic probation may not receive an Incomplete. The instructor must provide in writing to the student the conditions, which are described below, for satisfying the Incomplete and must enter those same conditions when posting the grades for the course. The student is responsible for verifying that the conditions were entered correctly.

Conditions for satisfying the Incomplete must include what work needs to be completed, by when the work must be completed, and what the course grade will be if the student fails to complete that work. At the latest, any outstanding coursework must be completed before the end of the following semester, absent an agreement to the contrary. Instructors will submit the grade of I and the aforementioned conditions to the Office of the University Registrar when submitting all other final grades for the course. If the student does not meet the conditions, the Office of the University Registrar will assign the default grade automatically.

The Associate Dean of the Academic Unit, with the concurrence of the instructor, may grant an extension beyond the agreed deadline, but only in extraordinary circumstances. Incomplete courses may not be retroactively dropped. An Incomplete may not stand as a permanent grade and must be resolved before a degree can be awarded.

More information on AU Regulations and Policies.

Sharing Course Content:

Students are not permitted to make visual or audio recordings (including livestreams) of lectures or any class-related content or use any type of recording device unless prior permission from the instructor is obtained and there are no objections from any student in the class. If permission is granted, only students registered in the course may use or share recordings and any electronic copies of course materials (e.g., PowerPoints, formulas, lecture notes, and any discussions – online or otherwise). Use is limited to educational purposes even after the end of the course. Exceptions will be made for students who present a signed Letter of Accommodation from the Academic Support and Access Center. Further details are available from the ASAC website.

Syllabus Change Policy

This syllabus is a guide for the course and is subject to change with advanced notice. These changes may come via email. Make sure to check your university-supplied email regularly. You are accountable for all such communications.