{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "## N Ways to GPU Programming - MD\n", "\n", "## Learning objectives\n", "With the release of CUDA in 2007, different approaches to programming GPUs have evolved. Each approach has its own advantages and disadvantages. By the end of this bootcamp session, students will have a broader perspective on GPU programming approaches to help them select a programming model that better fits their applications' needs and constraints. The bootcamp will teach how to accelerate a popular algorithm of Radial Distribution Function (RDF) using the following methods:\n", "* C++ Standard: pSTL\n", "* Directives: OpenACC, OpenMP\n", "* Frameworks: Kokkos\n", "* Lower level C Construct: CUDA C\n", "\n", "Let's start with testing the CUDA Driver and GPU you are running the code on in this lab:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!nvidia-smi" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "**IMPORTANT**: Before we start please download the input file needed for this application from the [Google drive](https://drive.google.com/drive/folders/1aQ_MFyrjBIDMhCczse0S2GQ36MlR6Q_s?usp=sharing) and upload it to the input folder. From the top menu, click on *File*, and *Open* and navigate to `C/source_code/input` directory and copy paste the downloaded input file (`alk.traj.dcd`).\n", "\n", "\n", "### Tutorial Outline\n", "\n", "We will be working on porting a radial distribution function (RDF) to GPUs. Please read the [RDF Overview](C/jupyter_notebook/serial/rdf_overview.ipynb) to get familiar with how this application works.\n", "\n", "Below is the list of GPU programming approaches we will be covering during this course, click on the link below to start exploring:\n", "\n", "- [Introduction to Profiling](../../profiler/English/jupyter_notebook/profiling-c.ipynb)\n", " - Overview of Nsight profiler tools\n", " - Introduction to Nsight Systems\n", " - How to use NVTX APIs\n", " - Introduction to Nsight Compute\n", " - Optimization Steps to parallel programming \n", "1. [OpenACC](C/jupyter_notebook/openacc/nways_openacc.ipynb) , [OpenACC Advanced](C/jupyter_notebook/openacc/nways_openacc_opt.ipynb)\n", "2. [Kokkos](C/jupyter_notebook/kokkos/nways_kokkos.ipynb)\n", "3. [stdpar](C/jupyter_notebook/stdpar/nways_stdpar.ipynb)\n", "4. [OpenMP](C/jupyter_notebook/openmp/nways_openmp.ipynb) \n", "5. [CUDA C](C/jupyter_notebook/cudac/nways_cuda.ipynb) " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "To finish the lab let us go through some final [remarks](C/jupyter_notebook/Final_Remarks.ipynb)\n", "\n", "### Tutorial Duration\n", "The lab material will be presented in a 4hr session. Link to material is available for download at the end of the lab.\n", "\n", "### Content Level\n", "Beginner, Intermediate\n", "\n", "### Target Audience and Prerequisites\n", "The target audience for this lab is ..\n", "\n", "\n", "-----\n", "\n", "#
[HOME](../../nways_start.ipynb)
\n", "-----\n", "\n", "\n", "## Licensing \n", "\n", "This material is released by NVIDIA Corporation under the Creative Commons Attribution 4.0 International (CC BY 4.0). " ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.4" } }, "nbformat": 4, "nbformat_minor": 4 }