Instructor Guidance for Datahub and the Data Science and Machine Learning Platform


Core Service Functionality


The Data Science and Machine Learning Platform (DSMLP) provides a standard set of features for instructional use, including web and command-line access to jupyter notebook servers, GPU access, student home directories and storage, and large dataset access. Learn more about DSMLP.

For more information about the default course containers available for this platform, see "An overview of Standard Datahub/DSMLP Containers maintained by Educational Technology Services."

Scope of Support

Critical Dates

Service Level Objectives and Roles/Responsibilities


IT Services' Responsibilities

Instructor/TA Responsibilities

Service Timeline


How-To's for Instructors and TAs


Add Custom Packages to a Standard Course Container

Add TAs or Observers to a Datahub/DSMLP Course

  1. Add Teaching Assistants (TAs) and Other Users to a Canvas Course
  2. If the user is not a UC San Diego student, such as departmental staff and co-instructors, please email datahub@ucsd.edu (or follow up in your support.ucsd.edu course ticket) to request that ITS staff add them to the datahub course roster.

Add Groups to a Datahub/DSMLP Course

Student Disk Space Quotas

Large Datasets Shared By Multiple Courses

Course Datasets and Files

The public directory is intended to share datasets and files between all users of the course. All users can read from this directory. The grader has write permission to this directory.

To put files into the public directory, login to datahub.ucsd.edu as the grader. Then select the public directory and upload files via the UI.

After the files are uploaded check the permissions of the files to make sure they are only writeable by the grader. To fix incorrect permissions, open the terminal and run the chmod command. This removes the write (w) flag from files and adds the execute (X) to directories so they are visible.

chmod -R u+rwX,go+rXs,go-w ~/public

Naming assignments

Removing old "nbgrader" courses from "Assignments" dropdown

  1. Open a new Terminal
    1. New → Terminal
  2. Navigate to the "nbgrader_cache" directory
    1. Enter command: cd ~/.local/share/jupyter/nbgrader_cache
  3. List all files in the directory
    1. Enter command: ls -al
  4. Delete the old directories
    1. Enter command: rm -r directory_name

Service Resource Limitations


Resource exhaustion (insufficient GPUs/CPUs) can occur during periods of heavy usage. If eviction of a user's server is required, they will be notified 10 minutes before termination and instructed to save their data.

Scheduled Weekly Patching and Potential Downtime


Datahub/DSMLP Weekly Maintenance Window: Tuesdays, 6 a.m. - 8 a.m. (Pacific)

The worldwide increase in ransomware and similar cybersecurity attacks has prompted UC San Diego to adopt strict policies regarding updating/patching of campus servers. To minimize interruption to your students' work, we've selected Tuesdays from 6 a.m. - 8 a.m. as a weekly maintenance period.

We recognizing the impact of any disruption during 10th week and Finals week, and will attempt to defer patching during these critical instructional periods unless the nature of the vulnerability requires immediate action.

Finally, as campus policy requires that "Critical" severity threads be addressed within 24 hours of discovery, we may be forced to patch outside of the Tuesday 6 a.m. - 8 a.m. window. In such cases, we will provide advance warning to instructors and TAs to the extent possible.

Troubleshooting


If you have further questions or concerns, submit a ticket or email us at datahub@ucsd.edu.