Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Harvard University

Data Science: Productivity Tools

Harvard University via edX


A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging.

Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and how to keep the file system organized. You will be introduced to the version control systems git, a powerful tool for keeping track of changes in your scripts and reports. We also introduce you to GitHub and demonstrate how you can use this service to keep your work in a repository that facilitates collaborations.

Finally, you will learn to write reports in R markdown which permits you to incorporate text and code into a document. We'll put it all together using the powerful integrated desktop environment RStudio.

Taught by

Rafael Irizarry


5.0 rating, based on 1 Class Central review

4.3 rating at edX based on 21 ratings

Start your review of Data Science: Productivity Tools

  • Profile image for Luiz Cunha
    Luiz Cunha
    Very easy but usefull review of the Data Science Software Tools: Rstudio, Git, GitHub.
    I found the Git/GitHub sections very useful: after having struggled a long time using this tool due to rather unclear tutorials, this is the first concise but very clear explanations of the workings of Git/GitHub and how to properly use it.

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.