1 What is GitHub?

GitHub is a git repository hosting service and web-based, collaborative version-control platform for developers. While GitHub offers a suite of features, in this course we will primarily use GitHub for cloud-based hosting of our coding projects. For this exercise, we will learn how to:

  • Create a Github account
  • Create a blank repository
  • Integrate with RStudio
  • Create and commit changes through the RStudio Git tab
  • View Changes on GitHub
  • Create a project webpage in GitHub

GitHub is generally used for version control, working on collaborative projects, creating websites, developing and sharing teaching tools, software development, and code development, but is generally not good for storing or manipulating large datasets.

1.1 Create GitHub Account

To begin, you will need to create an account on GitHub.com. This will allow you to create, fork, share, collaborate, etc. on projects.

github signup

1.2 Create Repository

Once you have created an account you can navigate to repositories and click on the green button for a new repository.

repository bar

Create a name for the repository, for this example we will use GitHub-Basics, provide a description, be sure to make the repository public, uncheck the “Initialize this repository with a README” (for this example; it might be necessary to leave this checked in the future), add a .gitignore file using the R filter in the drop-down menu, and click create repository.

2 Integration with RStudio

Rstudio has integrated version control and allows you to clone a project from any Git repository. For this example we will use the link obtained in the above step to integrate local changes with the GitHub repository we created.

While the following steps will use the GUI interface in RStudio, the majority of these steps can also be completed at the command line.

2.1 New Project

To begin we will create a new project using the File > New Project command. In previous examples the New Directory option was used, but in this exercise we will instead use the Version Control option.

RStudio Version Control

2.1.1 Version Control

This option will provide us the ability to choose between Git and Apache Subversion. We will follow the option for using Git as we learned in the previous exercise.

Version Control

2.1.2 Git for Versioning in RStudio

The version control option will allow you to obtain a project from a version control repository such as GitHub.

If you want to bring an already developed repository into RStudio in a blank project, from the terminal you can use the git command git clone <GitHub Link> in the folder you wish to store the repository and then save as a new project.

2.1.3 Create Project

Before you finalize your project, you will be prompted to including the following information:

Clone Repository

  • Repository URL
    • Here you will paste the link you copied from GitHub: http://github.com/username/repository
  • Project Directory Name
    • This is the folder that will be created to organize the project.
      • Tip: Making this the same as the repository name will help keep you organized between the local and online environment
  • Subdirectory
    • This is the local root directory where the project folder will reside. For example, if you want the project folder to be housed within the Ex-8 folder on the Z drive you would browse to: z:/students/name/Ex-8 and then click create project.

2.2 View Project in RStudio

This will open the typical RStudio Project window you are used to seeing including the project information in the upper-right corner, project files in the files tile, etc. but you will also now see a new Git tab alongside the “Environment-History-Connections” section. It is in this tab that we will control the integration for our local project and online repository.

Project Window

2.2.1 Git Tab

In the Git tab you will notice several options including:

  • Diff which details the difference between versions of a file.
  • Commit which allows us to initial a commit similar to the git commit command in the previous exercise.
  • Blue down arrow for a pull command
  • Green up arrow for a push command
  • Clock to view a history of the commits
  • Gear that allows you to revert to previous stages
  • New branch icon
  • Branch information showing which branch is currently being edited
  • Refresh icon

2.2.1.1 Staged Files

There are two ways to stage files in RStudio. The first is directly from the Git tab. By click on the button in the “Staged” column, you will see the icon in the “Status” column change. The second is by clicking the “commit button” and staging the files from there. Because of the simple interface, files can easily be staged/unstaged from either location.

You could also complete many of these steps via command line from the terminal window.

2.2.1.2 Color-coded Icons

In RStudio as you stage files their icon will change to match their current status. You are most likely to see the following information:

  • Yellow box for newly created/untracked files
  • Blue box for files that have been modified
  • Green box for files that have been added/staged and are ready for a commit
  • Red box for files that have been deleted

Once you have the files appropriately staged, you are ready to commit them to your repository.

2.2.2 Commit and Push to GitHub

In the commit window you can stage files, view differences in the changed files, add a commit message, and push your changes to the remote repository.

It is important to always include a commit message each time you commit files to your repository. As we have seen in the previous exercise, this helps to keep track of each stage of development and informs collaborators of changes made in each update. Because RStudio provides a text box for commits, there is ample room for an appropriate commit message. Check out these suggestions about composing better commit messages: https://chris.beams.io/posts/git-commit/

After staging your files and writing a useful commit message your next step will be to commit that information. Remember even though you have your local and GitHub repositories linked, you are only making this commit to the local repository. In order to sync the two repositories you must Push the changes to GitHub.

2.3 View Changes on GitHub

Once you have completed these steps you can navigate back to your GitHub repository and view the information you have just pushed to the repository.

Updated GitHub Page

2.4 Create a GitHub.io webpage

One excellent feature of GitHub is the ability to create webpages out of content within your repository. This is an extremely easy process and at this point each of the exercises you have completed in the course can be turned into a webpage with very little effort. In the setting menu of your repository, scroll down to the section for GitHub Pages and in the “Source” portion, select master branch as the source.

github settings

This will enable a webpage for your repository that utilizes a consistent url theme:

http://username.github.io/repository name

In order to create a webpage using GitHub Pages, simply create a copy of your *.nb.html and call it index.html. Once the index.html file is created, push it to your master repository and the website will automatically be created.

It may take several minutes before the site appears on your github.io page for the first time.

3 Your Turn!

While there are a number of features an options available in GitHub, here we covered only the basics of creating and maintaining a repository and enabling GitHub Pages. At the end of the semester your final project will be to develop materials based on your thesis research. So for this exercise I want you to create a repository and GitHub webpage for your thesis work. You should create a README.md file for your project that includes:

  • Project Title
  • Project Abstract
  • Link to the Github.io page
  • Repository Description
  • References

This repository and webpage should be periodically updated throughout the remainder of the semester culminating in your final project. You should take a look at the various options and features (such as themes, notifications, social previews, wikis, projects, etc.) available in GitHub Octocat to create a useful repository for sharing and maintaining your project.

3.1 Additional Help

Colt Steele again came through with a new video after presenting this exercise in class. This GitHub tutorial, called Learn Github in 20 Minutes, walks you through many of the steps we discussed in class and covers topics that are slightly more advanced that you might need in the future.

---
title: GitHub Basics <br><small>Advanced Data Analytics</small></br>
author: BIOL 5700, Fall 2019
output:
  html_notebook:
    df_print: paged
    rows.print: 10
    theme: cosmo
    highlight: breezedark
    number_sections: yes
    toc: true
    toc_float:
      collapsed: false
      smooth_scroll: true
  pdf_document: default
editor_options: 
  chunk_output_type: inline
---
<style type="text/css">

h1.title {
  font-size: 40px;
  font-family: "Times New Roman", Times, serif;
  color: DarkBlue;
  text-align: center;
}
h4.author { /* Header 4 - and the author and data headers use this too  */
  font-size: 20px;
  font-family: "Times New Roman", Times, serif;
  color: DarkBlue;
  text-align: center;
}
</style>

# What is GitHub?
**GitHub** is a git repository hosting service and web-based, collaborative version-control platform for developers. While **GitHub** offers a suite of features, in this course we will primarily use **GitHub** for cloud-based hosting of our coding projects. For this exercise, we will learn how to:

- Create a <a href="https://github.com/join" target="_blank"> Github</a> account
- Create a blank repository
- Integrate with RStudio
- Create and commit changes through the RStudio Git tab
- View Changes on GitHub
- Create a project webpage in GitHub

<p>GitHub is generally used for version control, working on collaborative projects, creating websites, developing and sharing teaching tools, software development, and code development, but is generally not good for storing or manipulating large datasets.</p>

## Create GitHub Account
To begin, you will need to create an account on <a href="https://github.com/join" target="_blank"> GitHub.com</a>. This will allow you to create, fork, share, collaborate, etc. on projects.

<p align="center">

![github signup](./Images/GitHub-SignUp.PNG "GitHub Sign-Up Window")
</p>

## Create Repository
Once you have created an account you can navigate to repositories and click on the green button for a <a href="https://github.com/new" target="_blank"> new repository</a>.

<p align="center">

![repository bar](./Images/repo-bar.PNG "GitHub Repository Bar")
</p>

Create a name for the repository, for this example we will use *GitHub-Basics*, provide a description, be sure to make the repository public, uncheck the "Initialize this repository with a README" (for this example; it might be necessary to leave this checked in the future), add a *.gitignore* file using the **R** filter in the drop-down menu, and click *create repository*.

### Copy Repository Link
Once you create your new repository you should be taken to the home page for the repo. 

<p align="center">

![Copy Link and Clone Option](./Images/copy-link.PNG "Clone and Download Link")
</p>

This link will be used in **RStudio** to connect your project to the *GitHub* repository. At this point you can move to **RStudio** but keep the **GitHub** window open so you can refer back to see changes you have made during this example. 

# Integration with RStudio

**Rstudio** has integrated version control and allows you to clone a project from any *Git* repository. For this example we will use the link obtained in the above step to integrate local changes with the **GitHub** repository we created. 

> While the following steps will use the GUI interface in **RStudio**, the majority of these steps can also be completed at the command line.

## New Project
To begin we will create a new project using the *File > New Project* command. In previous examples the *New Directory* option was used, but in this exercise we will instead use the *Version Control* option. 

<p align="center">

![RStudio Version Control](./Images/RS-New-Project.png "New Project Window"){width=45%}
</p>

### Version Control
This option will provide us the ability to choose between **Git** and *Apache Subversion*. We will follow the option for using **Git** as we learned in the previous exercise.

<p align="center">

![Version Control](./Images/RS-Version-Control.png "Version Control Window"){width=45%}
</p>

### Git for Versioning in RStudio
The *version control* option will allow you to obtain a project from a version control repository such as **GitHub**.

> If you want to bring an already developed repository into **RStudio** in a blank project, from the terminal you can use the git command ```git clone <GitHub Link>``` in the folder you wish to store the repository and then save as a new project.

### Create Project
Before you finalize your project, you will be prompted to including the following information:

<p align="center">

![Clone Repository](./Images/RS-Clone-Repo.png "Clone Repository Window"){width=45%}
</p>

- Repository URL
  - Here you will paste the link you copied from **GitHub**: ```http://github.com/username/repository```
- Project Directory Name
  - This is the folder that will be created to organize the project.
    - Tip: Making this the same as the repository name will help keep you organized between the local and online environment
- Subdirectory
  - This is the local root directory where the project folder will reside. For example, if you want the project folder to be housed within the *Ex-8* folder on the Z drive you would browse to: z:/students/name/Ex-8 and then click *create project*.

## View Project in RStudio
This will open the typical RStudio Project window you are used to seeing including the project information in the upper-right corner, project files in the files tile, etc. but you will also now see a new **Git** tab alongside the "Environment-History-Connections" section. It is in this tab that we will control the integration for our local project and online repository.

<p align="center">

![Project Window](./Images/RS-Project-Window.png "Project Window")
</p>

### Git Tab
In the *Git* tab you will notice several options including:

- Diff <img src="./Images/diff_button.png" title="Diff Selected File(s)"/> which details the difference between versions of a file. 
- Commit <img src="./Images/commit_button.png" title="Commit pending changes"/> which allows us to initial a commit similar to the ```git commit``` command in the previous exercise.
- Blue down arrow <img src="./Images/pull_button.png"/> for a pull command
- Green up arrow <img src="./Images/push_button.png"/> for a push command
- Clock <img src="https://github.githubassets.com/images/icons/emoji/unicode/1f552.png" title="View history of previous commits" width=15 height=15/> to view a history of the commits
- Gear <img src="https://github.githubassets.com/images/icons/emoji/unicode/2699.png" width=15 height=15/> that allows you to revert to previous stages
- New branch icon <img src="./Images/new_branch.png"/>
- Branch information <img src="./Images/branch_select.png" title="switch branch"/> showing which branch is currently being edited
- Refresh icon <img src="./Images/refresh_button.png"/>

#### Staged Files
There are two ways to stage files in **RStudio**. The first is directly from the *Git* tab. By click on the button in the "Staged" column, you will see the icon in the "Status" column change. The second is by clicking the "commit button" and staging the files from there. Because of the simple interface, files can easily be staged/unstaged from either location.

> You could also complete many of these steps via command line from the terminal window.

#### Color-coded Icons
In **RStudio** as you stage files their icon will change to match their current status. You are most likely to see the following information:

- Yellow box <img src="./Images/new.png" width=15 height=15/> for newly created/untracked files
- Blue box  <img src="./Images/modified.png" width=15 height=15/> for files that have been modified
- Green box <img src="./Images/added.png" width=15 height=15/> for files that have been added/staged and are ready for a commit
- Red box <img src="./Images/deleted.png" width=15 height=15/> for files that have been deleted

Once you have the files appropriately staged, you are ready to commit them to your repository.

### Commit and Push to GitHub
In the commit window you can stage files, view differences in the changed files, add a commit message, and push your changes to the remote repository.

It is important to always include a commit message each time you commit files to your repository. As we have seen in the previous exercise, this helps to keep track of each stage of development and informs collaborators of changes made in each update. Because **RStudio** provides a text box for commits, there is ample room for an appropriate commit message. Check out these suggestions about composing better commit messages: https://chris.beams.io/posts/git-commit/

After staging your files and writing a useful commit message your next step will be to commit that information. Remember even though you have your local and **GitHub** repositories linked, you are only making this commit to the local repository. In order to sync the two repositories you must <img src="./Images/push_button.png"/> Push the changes to **GitHub**.

## View Changes on GitHub
Once you have completed these steps you can navigate back to your **GitHub** repository and view the information you have just pushed to the repository.

<p align="center">

![Updated GitHub Page](./Images/GitHub-updates.png "Viewing Updates on GitHub")
</p>

## Create a GitHub.io webpage
One excellent feature of **GitHub** is the ability to create webpages out of content within your repository. This is an extremely easy process and at this point each of the exercises you have completed in the course can be turned into a webpage with very little effort. In the setting menu of your repository, scroll down to the section for *GitHub Pages* and in the "Source" portion, select master branch as the source. 

<p align="center">

![github settings](./Images/github-settings-io.png "GitHub Settings for Pages"){width=65%}
</p>

This will enable a webpage for your repository that utilizes a consistent url theme: 

http://**username**.github.io/**repository name** 

In order to create a webpage using *GitHub Pages*, simply create a copy of your  \*.nb.html and call it **index.html**. Once the index.html file is created, push it to your master repository and the website will automatically be created.

> It may take several minutes before the site appears on your *github.io* page for the first time. 

# Your Turn! 

While there are a number of features an options available in **GitHub**, here we covered only the basics of creating and maintaining a repository and enabling *GitHub Pages*. At the end of the semester your final project will be to develop materials based on your thesis research. So for this exercise I want you to create a repository and GitHub webpage for your thesis work. You should create a README.md file for your project that includes:

- Project Title
- Project Abstract
- Link to the Github.io page
- Repository Description
- References

This repository and webpage should be periodically updated throughout the remainder of the semester culminating in your final project. You should take a look at the various options and features (such as themes, notifications, social previews, wikis, projects, etc.) available in <img src="https://github.githubassets.com/images/icons/emoji/octocat.png" alt="GitHub Octocat" title="GitHub Octocat" width=15 heigh=15/> to create a useful repository for sharing and maintaining your project. 

## Additional Help

[Colt Steele](https://www.youtube.com/channel/UCrqAGUPPMOdo0jfQ6grikZw) again came through with a new video after presenting this exercise in class. This **GitHub** tutorial, called [Learn Github in 20 Minutes](https://www.youtube.com/watch?v=nhNq2kIvi9s), walks you through many of the steps we discussed in class and covers topics that are slightly more advanced that you might need in the future.