Intro to ARC series: “Introduction to DASK”

This course is a part of SHARCNET’s ongoing “Introduction to Advanced Research Computing” series of online courses for 2021-2022. Compute Canada account is required to enroll.To register for any of the courses:

• Follow this link:
• Click the Log in link at the top right-hand side
• Log in with your Compute Canada login and password
• Click Site Home in the left-hand side menu
• Click 2021-2022 Introduction to Advanced Research Computing (ARC)
• Browse the list of (currently available) courses and enroll in the ones you are interested in
• To enroll in a course click on the course name and then click on that course’s enroll button

Course Syllabus:

Some common libraries for data analytics in Python, such as Numpy, Pandas, Scikit-Learn, etc. usually work well if the dataset fits into the existing RAM on a single machine. However, when dealing with large datasets, it can be a significant challenge to work around such memory constraints. This is where Dask can help. Dask provides a framework and libraries that can handle large datasets on a single multi-core machine or on a cluster.

This course provides an introduction to Dask.

No tickets available.