Bladeren bron

Chnaged to latest version of RAPIDS and CUDA. Fixed minor bugs

bharatk-parallel 4 jaren geleden
bovenliggende
commit
e63f62f12f

+ 2 - 1
ai/RAPIDS/Dockerfile

@@ -1,5 +1,6 @@
 # Select Base Image 
-FROM rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04-py3.7
+#FROM rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04-py3.7
+FROM rapidsai/rapidsai:cuda11.0-runtime-ubuntu18.04-py3.7
 
 # Update the repo
 RUN apt-get update -y

+ 111 - 0
ai/RAPIDS/English/Python/START_HERE.ipynb

@@ -0,0 +1,111 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# GPU BOOTCAMP FOR RAPIDS"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "This repository consists of gpu bootcamp material for RAPIDS AI. The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. In this series you can access RAPIDS learning resources in the form of labs. The modules covered in this Bootcamp are CuDF, CuML, Dask and Challenge. To access each module individually, you can refer to the respective folders in this repository. "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Contents\n",
+    "\n",
+    "This RAPIDS Bootcamp will be covering some interesting libraries that are easily integratable with the daily data science pipeline and accelerate computations for faster execution. Each module has a backup exercise notebook that you can refer to incase you wish to recover some cells in a lab that you deleted by mistake. Here are the modules in this tutorial:\n",
+    "\n",
+    "- Introduction\n",
+    "    - [Introduction to RAPIDS](jupyter_notebook/Introduction_To_Rapids.ipynb)\n",
+    "    - [Acknowledgement and References](jupyter_notebook/References.ipynb)\n",
+    "    \n",
+    "\n",
+    "- CuDF\n",
+    "    - [Introduction To CuDF](jupyter_notebook/CuDF/01-Intro_to_cuDF.ipynb)\n",
+    "    - [Introduction To UDFs](jupyter_notebook/CuDF/02-Intro_to_cuDF_UDFs.ipynb)\n",
+    "    - [Exercise](jupyter_notebook/CuDF/03-Cudf_Exercise.ipynb)\n",
+    "    - [Backup](jupyter_notebook/CuDF/Backup.ipynb)\n",
+    "    \n",
+    "    \n",
+    "    \n",
+    "- CuML\n",
+    "    - [Linear Regression and Hyperparameter Tuning](jupyter_notebook/CuML/01-LinearRegression-Hyperparam.ipynb)\n",
+    "    - [Exercise](jupyter_notebook/CuML/03_CuML_Exercise.ipynb)\n",
+    "    - [Backup](jupyter_notebook/CuML/Backup.ipynb)\n",
+    "    - [Bonus Lab- Logistic Regression and CuPy](jupyter_notebook/CuML/Bonus_Lab-LogisticRegression.ipynb)\n",
+    "    \n",
+    "    \n",
+    "    \n",
+    "- Dask\n",
+    "    - [Introduction to Dask](jupyter_notebook/Dask/01-Intro_to_Dask.ipynb)\n",
+    "    - [Dask-cuDF](jupyter_notebook/Dask/02-CuDF_and_Dask.ipynb)\n",
+    "    - [Dask-cuML](jupyter_notebook/Dask/03-CuML_and_Dask.ipynb)\n",
+    "    - [Exercise](jupyter_notebook/Dask/04-Challenge.ipynb)\n",
+    "    - [Backup](jupyter_notebook/Dask/Backup.ipynb)\n",
+    "    \n",
+    "    \n",
+    "\n",
+    "\n",
+    "- Challenge\n",
+    "\n",
+    "    - Gene Expression Classification\n",
+    "        - [Challenge](jupyter_notebook/Challenge/Gene-Expression-Classification/Challenge.ipynb)\n",
+    "        - [Backup](jupyter_notebook/Challenge/Gene-Expression-Classification/Backup.ipynb)\n",
+    "                     \n",
+    "    - Bike Rental Prediction\n",
+    "        - [Challenge](jupyter_notebook/Challenge/Bike-Rental-Prediction/Challenge.ipynb)\n",
+    "        - [Backup](jupyter_notebook/Challenge/Bike-Rental-Prediction/Backup.ipynb)\n",
+    "\n",
+    "\n",
+    "\n",
+    "Note: The challenge is an extra module that you can try after learning the individual modules to test your knowledge.\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "For more information on RAPIDS, feel free to explore the official documentation here:\n",
+    "- https://rapids.ai/about.html\n",
+    "- https://github.com/rapidsai"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Licensing\n",
+    "  \n",
+    "This material is released by NVIDIA Corporation under the Creative Commons Attribution 4.0 International (CC BY 4.0)."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.6.2"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}

+ 2 - 2
ai/RAPIDS/English/Python/jupyter_notebook/CuML/01-LinearRegression-Hyperparam.ipynb

@@ -365,7 +365,7 @@
     "plt.scatter(x, y_noisy, label='empirical data points')\n",
     "plt.plot(x, y, color='black', label='true relationship')\n",
     "plt.plot(inputs, outputs, color='red', label='predicted relationship (cpu)')\n",
-    "plt.plot(inputs, gpu_outputs.to_array(), color='green', label='predicted relationship (gpu)')\n",
+    "plt.plot(inputs, gpu_outputs.get(), color='green', label='predicted relationship (gpu)')\n",
     "plt.legend()\n",
     "\n",
     "</pre>\n",
@@ -758,7 +758,7 @@
     "&emsp;&emsp;&emsp;&emsp;&emsp;\n",
     "&emsp;&emsp;&emsp;&emsp;&emsp;\n",
     "&emsp;&emsp;&emsp;&emsp;&emsp;\n",
-    "[Next Notebook](03_CuML_Exercise.ipynbb)\n",
+    "[Next Notebook](03_CuML_Exercise.ipynb)\n",
     "\n",
     "\n",
     "&emsp;&emsp;&emsp;&emsp;&emsp;\n",

+ 5 - 11
ai/RAPIDS/English/Python/jupyter_notebook/Introduction_To_Rapids.ipynb

@@ -32,17 +32,11 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "#### Why? \n",
-    "\n",
-    "- With GPU Acceleration RAPIDS help by reducing data preparation, train and inference time \n",
-    "- RAPIDS is designed to have API and interface similar to the traditional familiar tools and maintain accuracy\n",
-    "#### How? \n",
-    "RAPIDS is a open source project and can be donwloaded from [here](https://github.com/rapidsai) . Also RAPIDS is presnet as a container in NGC and can be downloaded from [here](https://ngc.nvidia.com/catalog/containers/nvidia:rapidsai:rapidsai)\n",
-    "#### Who?\n",
-    "RAPIDS is primarily targeted for Data Scientist who are looking at improving performance of their experiments without losing out on accuracy\n",
-    "#### When? \n",
-    "Same day RAPIDS posted to Github\n",
-    "\n"
+    "#### Key Features? \n",
+    "- Hassle-Free Integration: Accelerate your Python data science toolchain with minimal code changes and no new tools to learn \n",
+    "- Top Model Accuracy: Increase machine learning model accuracy by iterating on models faster and deploying them more frequently\n",
+    "- Reduced Training Time: RAPIDS is primarily targeted for Data Scientist who are looking at improving performance of their experiments without losing out on accuracy. Drastically improve your productivity with near-interactive data science.\n",
+    "- Open Source: Customizable, extensible, interoperable - the open-source software is supported by NVIDIA and built on Apache Arrow. It can be donwloaded from [here](https://github.com/rapidsai) . Also RAPIDS is presnet as a container in NGC and can be downloaded from [here](https://ngc.nvidia.com/catalog/containers/nvidia:rapidsai:rapidsai)\n"
    ]
   },
   {

+ 3 - 1
ai/RAPIDS/English/Python/jupyter_notebook/References.ipynb

@@ -17,7 +17,9 @@
     "- https://rapids.ai/about.html\n",
     "- https://distributed.dask.org/en/latest/worker.html#\n",
     "- https://elitedatascience.com/imbalanced-classes\n",
-    "- https://machinelearningmastery.com/train-test-split-for-evaluating-machine-learning-algorithms/"
+    "- https://machinelearningmastery.com/train-test-split-for-evaluating-machine-learning-algorithms/\n",
+    "- https://github.com/zronaghi/nasa-ml-workshop\n",
+    "-  https://github.com/rapidsai/notebooks"
    ]
   },
   {

+ 1 - 1
ai/RAPIDS/README.MD

@@ -2,7 +2,7 @@
 
 ## GPU Bootcamp for RAPIDS AI
 
-This repository consists of gpu bootcamp material for RAPIDS AI. The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. In this series you can access RAPIDS learning resources in the form of labs. The modules covered in this Bootcamp are CuDF, CuML, Dask and Challenge. 
+This repository consists of gpu bootcamp material for RAPIDS AI. The RAPIDS suite of open source software libraries gives you the freedom to execute end-to-end data science and analytics pipelines entirely on GPUs. In this series you can access RAPIDS learning resources in the form of labs. The modules covered in this Bootcamp are CuDF, CuML, Dask and Challenge.
 
 ## Prerequisites
 To run this tutorial you will need a machine with NVIDIA GPU.

+ 2 - 2
ai/RAPIDS/Singularity

@@ -1,7 +1,7 @@
 # Copyright (c) 2020 NVIDIA Corporation.  All rights reserved.
 
 Bootstrap: docker
-FROM: rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04-py3.7
+FROM: FROM rapidsai/rapidsai:cuda11.0-runtime-ubuntu18.04-py3.7
 
 %environment
 %post
@@ -19,4 +19,4 @@ FROM: rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04-py3.7
     "$@"
 
 %labels
-    AUTHOR Infernolia
+    AUTHOR Infernolia