SysAdminConfigMod.xml 23 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650
  1. <?xml version="1.0" encoding="UTF-8"?>
  2. <!DOCTYPE book PUBLIC "-//OASIS//DTD DocBook XML V4.5//EN"
  3. "http://www.oasis-open.org/docbook/xml/4.5/docbookx.dtd">
  4. <book xml:base="../">
  5. <chapter id="Usin-HPCC-ConfigMgr">
  6. <title>Using Configuration Manager</title>
  7. <para id="cfgmgr_introP0">Configuration Manager is the utility with which
  8. we configure the HPCC platform. The HPCC platform's configuration is
  9. stored in an XML file named <emphasis
  10. role="bold">environment.xml</emphasis>. Once you generate an environment
  11. (xml) file, it gets saved into a source directory (default is <emphasis
  12. role="bold">/etc/HPCCSystems/source</emphasis>). You then need to stop the
  13. system to copy it into the active HPCC directory, then distribute it into
  14. place on to each node and restart the HPCC system. At no time during
  15. configuration do you work on the live environment file.</para>
  16. <para id="cfgmgr_introP1">When you install the HPCC system package, a
  17. default single-node environment.xml file is generated. After that, you can
  18. use the Configuration Manager to modify it and/or create a different
  19. environment file to configure components, or add nodes. There is a
  20. Configuration Manager wizard to help create an environment file. Give any
  21. environment file you create a descriptive name that would indicate what it
  22. is for in the source. For example, you might create an environment without
  23. a Roxie, you could call that file
  24. <emphasis>environmentNoRoxie.xml</emphasis>.</para>
  25. <para id="cfgmgr_p1b">You would then copy the new configuration file you
  26. generate from the source directory to the <emphasis
  27. role="bold">/etc/HPCCSystems</emphasis> directory. Rename the file to
  28. environment.xml, and restart the system in order to reconfigure your
  29. system.</para>
  30. <para id="cfgmgr_introP2">Configuration Manager also offers an <emphasis
  31. role="bold">Advanced View</emphasis> which allows more granularity for you
  32. to add instances of components or change the default settings of
  33. components for more advanced users. Even if you plan to use the Advanced
  34. View, it is a good idea to start with a wizard generated configuration
  35. file and use Advanced View to edit it.</para>
  36. <para id="cfgmgr_introP3">More information and specific details for each
  37. Configuration Manager component and attributes of those components is
  38. detailed in <emphasis>Using Configuration Manager</emphasis>.</para>
  39. <para>The following sections will provide the details for configuring an
  40. HPCC environment using the Configuration Manager.</para>
  41. <sect1 id="configuring-a-multi-node-system">
  42. <title>Running the Configuration Manager</title>
  43. <para>This section will guide you through configuring an HPCC
  44. environment using the Configuration Manager.</para>
  45. <para>The HPCC package should already be installed on ALL nodes.</para>
  46. <para>You can use any tool or shell script you choose.</para>
  47. <orderedlist>
  48. <listitem>
  49. <para>SSH to a node in your environment and login as a user with
  50. sudo privileges. We would suggest that it would be the first node,
  51. and that it is a support node, however that is up to your
  52. discretion.</para>
  53. </listitem>
  54. <listitem>
  55. <para>Start the Configuration Manager service on the node (again we
  56. would suggest that it should be on a support node, and further that
  57. you use the same node to start the Configuration Manager every time,
  58. but this is also entirely up to you).</para>
  59. <programlisting>sudo /opt/HPCCSystems/sbin/configmgr</programlisting>
  60. <para><graphic
  61. fileref="../../images/gs_img_configmgrStart.jpg" /></para>
  62. </listitem>
  63. <listitem>
  64. <para>Using a Web browser, go to the Configuration Manager's
  65. interface:</para>
  66. <programlisting>http://&lt;<emphasis>ip of installed system</emphasis>&gt;:8015</programlisting>
  67. <para>The Configuration Manager startup wizard displays.</para>
  68. </listitem>
  69. </orderedlist>
  70. <para>There are different ways to configure your HPCC system. You can
  71. use the <emphasis role="bold">Generate environment wizard</emphasis> and
  72. use that environment or experienced users can then use the <emphasis
  73. role="bold">Advanced View</emphasis> for more specific customization.
  74. There is also the option of using <emphasis role="bold">Create blank
  75. environment</emphasis> to generate an empty environment that you could
  76. then go in and add only the components you would want.</para>
  77. <sect2 id="Env_Wizard">
  78. <title>Environment Wizard</title>
  79. <orderedlist>
  80. <listitem>
  81. <?dbfo keep-together="always"?>
  82. <para>To use the wizard select the <emphasis role="bold">Generate
  83. new environment using wizard</emphasis> button.</para>
  84. <para><graphic fileref="../../images/GS_ConfigMgrWizStart.jpg"
  85. vendor="configmgrSS" /></para>
  86. </listitem>
  87. <listitem>
  88. <para>Provide a name for the environment file.</para>
  89. <para>This will then be the name of the configuration XML file.
  90. For example, we will name our environment
  91. <emphasis>NewEnvironment</emphasis> and this will produce a
  92. configuration XML file named
  93. <emphasis>NewEnvironment.xml</emphasis> that we will
  94. use.<emphasis> </emphasis></para>
  95. </listitem>
  96. <listitem>
  97. <para>Press the Next button.</para>
  98. <para>Next you will need to define the IP addresses that your HPCC
  99. system will be using.</para>
  100. </listitem>
  101. <listitem>
  102. <?dbfo keep-together="always"?>
  103. <para>Enter the IP addresses or hostname(s).</para>
  104. <para>IP Addresses can be specified individually using semi-colon
  105. delimiters. You can also specify a range of IPs using a hyphen
  106. (for example, nnn.nnn.nnn.x-y). In the image below, we specified
  107. the IP addresses 10.239.219.1 through 10.239.219.100 using the
  108. range syntax, and also a single IP 10.239.219.111.</para>
  109. <para><graphic fileref="../../images/GS_ConfigMgrWiz002.jpg"
  110. vendor="configmgrSS" /></para>
  111. </listitem>
  112. <listitem>
  113. <para>Press the Next button.</para>
  114. <para>Now you will define how many nodes to use for the Roxie and
  115. Thor clusters.</para>
  116. </listitem>
  117. <listitem>
  118. <?dbfo keep-together="always"?>
  119. <para>Enter the appropriate values as indicated.</para>
  120. <para><graphic fileref="../../images/GS_CMWiz003.jpg"
  121. vendor="configmgrSS" /></para>
  122. <variablelist>
  123. <varlistentry>
  124. <term>Number of support nodes:</term>
  125. <listitem>
  126. <para>Specify the number of nodes to use for support
  127. components. The default is 1.</para>
  128. </listitem>
  129. </varlistentry>
  130. <varlistentry>
  131. <term>Number of nodes for Roxie cluster:</term>
  132. <listitem>
  133. <para>Specify the number of nodes to use for your Roxie
  134. cluster. Enter zero (0) if you do not want a Roxie
  135. cluster.</para>
  136. </listitem>
  137. </varlistentry>
  138. <varlistentry>
  139. <term>Number of slave nodes for Thor cluster</term>
  140. <listitem>
  141. <para>Specify the number of slave nodes to use in your Thor
  142. cluster. A Thor master node will be added automatically.
  143. Enter zero (0) if you do not want any Thor slaves.</para>
  144. </listitem>
  145. </varlistentry>
  146. <varlistentry>
  147. <term>Number of Thor slaves per node (default 1)</term>
  148. <listitem>
  149. <para>Specify the number of Thor slave processes to
  150. instantiate on each slave node. Enter zero (0) if you do not
  151. want a Thor cluster.</para>
  152. </listitem>
  153. </varlistentry>
  154. <varlistentry>
  155. <term>Enable Roxie on demand</term>
  156. <listitem>
  157. <para>Specify whether or not to allow queries to be run
  158. immediately on Roxie. This must be enabled to run the
  159. debugger. (Default is true)</para>
  160. </listitem>
  161. </varlistentry>
  162. </variablelist>
  163. </listitem>
  164. <listitem>
  165. <para>Press the <emphasis role="bold">Next</emphasis>
  166. button</para>
  167. <para>The wizard displays the configuration parameters.</para>
  168. </listitem>
  169. <listitem>
  170. <?dbfo keep-together="always"?>
  171. <para>Press the <emphasis role="bold">Finish</emphasis> button to
  172. accept these values or press the <emphasis role="bold">Advanced
  173. View</emphasis> button to edit in advanced mode.</para>
  174. <graphic fileref="../../images/GS_ConfigMgrWiz004.jpg" />
  175. </listitem>
  176. </orderedlist>
  177. <para>You will now be notified that you have completed the
  178. wizard.</para>
  179. <para><graphic fileref="../../images/GS_ConfigMgrWiz005.jpg" /></para>
  180. <para>At this point, you have created a file named NewEnvironment.xml
  181. in the <emphasis role="bold">/etc/HPCCSystems/source</emphasis>
  182. directory</para>
  183. <informaltable colsep="1" frame="all" rowsep="1">
  184. <?dbfo keep-together="always"?>
  185. <tgroup cols="2">
  186. <colspec colwidth="49.50pt" />
  187. <colspec />
  188. <tbody>
  189. <row>
  190. <entry><inlinegraphic
  191. fileref="../../images/OSSgr3.png" /></entry>
  192. <entry>Keep in mind, that your HPCC configuration may be
  193. different depending on your needs. For example, you may not
  194. need a Roxie or you may need several smaller Roxie clusters.
  195. In addition, in a production [Thor] system, you would ensure
  196. that Thor and Roxie nodes are dedicated and have no other
  197. processes running on them. This document is intended to show
  198. you how to use the configuration tools. Capacity planning and
  199. system design is covered in a training module.</entry>
  200. </row>
  201. </tbody>
  202. </tgroup>
  203. </informaltable>
  204. <sect3 id="Distrib_Conf" role="brk">
  205. <title>Distribute the Configuration</title>
  206. <orderedlist>
  207. <listitem>
  208. <?dbfo keep-together="always"?>
  209. <para>Stop the HPCC system.</para>
  210. <para>If it is running stop the HPCC system (on every node),
  211. using a command such as this:</para>
  212. <para><programlisting>sudo /sbin/service hpcc-init stop</programlisting></para>
  213. <variablelist>
  214. <varlistentry>
  215. <term>Note:</term>
  216. <listitem>
  217. <para>You may have a multi-node system and a custom script
  218. such as the one illustrated in Appendix of the <emphasis
  219. role="bluebold">Installing and Running the HPCC
  220. Platform</emphasis> document to start and stop your
  221. system. If that is the case please use the appropriate
  222. command for stopping your system on every node.</para>
  223. </listitem>
  224. </varlistentry>
  225. </variablelist>
  226. <para><informaltable colsep="1" frame="all" rowsep="1">
  227. <tgroup cols="2">
  228. <colspec colwidth="49.50pt" />
  229. <colspec />
  230. <tbody>
  231. <row>
  232. <entry><inlinegraphic
  233. fileref="../../images/caution.png" /></entry>
  234. <entry>Be sure HPCC is stopped before attempting to
  235. copy the environment.xml file.</entry>
  236. </row>
  237. </tbody>
  238. </tgroup>
  239. </informaltable></para>
  240. </listitem>
  241. <listitem>
  242. <para>Back up the original environment.xml file.</para>
  243. <programlisting># For example
  244. sudo -u hpcc cp /etc/HPCCSystems/environment.xml /etc/HPCCSystems/source/environment-date.xml </programlisting>
  245. <variablelist>
  246. <varlistentry>
  247. <term>Note:</term>
  248. <listitem>
  249. <para>The live environment.xml file is located in your
  250. <emphasis role="bold">/etc/HPCCSystems/</emphasis>
  251. directory. Configuration Manager works on files in
  252. <emphasis role="bold">/etc/HPCCSystems/source</emphasis>
  253. directory. You must copy from this location to make an
  254. environment.xml file active.</para>
  255. </listitem>
  256. </varlistentry>
  257. </variablelist>
  258. <para>You can also choose to give the environment file a more
  259. descriptive name, to help differentiate any differences.</para>
  260. <para>Having environment files under source control is a good
  261. way to archive your environment settings.</para>
  262. </listitem>
  263. <listitem>
  264. <para>Copy the new .xml file from the source directory to the
  265. /etc/HPCCSystems and rename the file to
  266. <emphasis>environment.xml</emphasis></para>
  267. <programlisting># for example
  268. sudo -u hpcc cp /etc/HPCCSystems/source/NewEnvironment.xml /etc/HPCCSystems/environment.xml</programlisting>
  269. <para></para>
  270. </listitem>
  271. <listitem>
  272. <para>Copy the <emphasis
  273. role="bold">/etc/HPCCSystems/environment.xml</emphasis> to the
  274. <emphasis role="bold">/etc/HPCCSystems/</emphasis> on to
  275. <emphasis>every</emphasis> node.</para>
  276. <para>You may want to use a script to push out the XML file to
  277. all nodes. See the <emphasis>Example Scripts</emphasis> section
  278. in the Appendix of the <emphasis role="bluebold">Installing and
  279. Running the HPCC Platform</emphasis> document. You can use the
  280. scripts as a model to create your own script to copy the
  281. environment.xml file out to all your nodes.</para>
  282. </listitem>
  283. <listitem>
  284. <para>Restart the HPCC platform on all nodes.</para>
  285. </listitem>
  286. </orderedlist>
  287. </sect3>
  288. </sect2>
  289. </sect1>
  290. </chapter>
  291. <chapter id="Advance-SysAdmin-Topic-Chapter">
  292. <title>Advanced Systems Administrator Topics</title>
  293. <para>This chapter contains information about certain advanced HPCC
  294. Systems<superscript>®</superscript> Administrators topics.</para>
  295. <sect1 id="Admin-System-Topic">
  296. <title>Admin System Topics</title>
  297. <para>This is a System Administrative topic designed to provide some
  298. insight as to an aspect of System Administration for you HPCC
  299. System.</para>
  300. </sect1>
  301. <sect1 id="System_sizings">
  302. <title>System Sizings</title>
  303. <para>This section provides some guidance in determining the sizing
  304. requirements for an initial installation of HPCC. The following are some
  305. suggested configuration guides that can be helpful when planning your
  306. system.</para>
  307. <sect2 id="SysAdmin_MinSuggestedHW" role="nobrk">
  308. <title>Minimum Suggested Hardware</title>
  309. <para>HPCC was designed to run on common commodity hardware, and could
  310. function on even lesser hardware. The following list is the suggested
  311. minimum hardware specifications. At the very minimum you should
  312. consider the following hardware components for your HPCC system. These
  313. guidelines were put together based on real world usage of mission
  314. critical (uptime) with high volume data. <informaltable border="all"
  315. colsep="1" rowsep="1">
  316. <tgroup cols="3">
  317. <colspec colwidth="94.50pt" />
  318. <colspec colwidth="84.50pt" />
  319. <tbody>
  320. <row>
  321. <entry><emphasis role="bold">Thor slave</emphasis></entry>
  322. <entry>Processor</entry>
  323. <entry>4 x 64-bit Intel Processor per</entry>
  324. </row>
  325. <row>
  326. <entry></entry>
  327. <entry>RAM</entry>
  328. <entry>8GB per daemon</entry>
  329. </row>
  330. <row>
  331. <entry></entry>
  332. <entry>Storage</entry>
  333. <entry>RAID - 200MB/sec Sequential Read/Write per
  334. node</entry>
  335. </row>
  336. <row>
  337. <entry></entry>
  338. <entry>Network</entry>
  339. <entry>1 Gb/sec bandwidth</entry>
  340. </row>
  341. <row>
  342. <entry><emphasis role="bold">Roxie</emphasis></entry>
  343. <entry>Processor</entry>
  344. <entry>4 x 64-bit Intel Processor</entry>
  345. </row>
  346. <row>
  347. <entry></entry>
  348. <entry>RAM</entry>
  349. <entry>12GB per Roxie</entry>
  350. </row>
  351. <row>
  352. <entry></entry>
  353. <entry>Storage</entry>
  354. <entry>400 IOPS &amp; 2 Volumes per (RAID optional)</entry>
  355. </row>
  356. <row>
  357. <entry></entry>
  358. <entry>Network</entry>
  359. <entry>1 Gb/sec bandwidth</entry>
  360. </row>
  361. <row>
  362. <entry><emphasis role="bold">Dali</emphasis></entry>
  363. <entry>Processor</entry>
  364. <entry>4 x 64-bit Intel Processor each</entry>
  365. </row>
  366. <row>
  367. <entry></entry>
  368. <entry>RAM</entry>
  369. <entry>24GB per Dali</entry>
  370. </row>
  371. <row>
  372. <entry></entry>
  373. <entry>Storage</entry>
  374. <entry>RAID 1, 5, 6, 10 Volume 200GB</entry>
  375. </row>
  376. <row>
  377. <entry><emphasis role="bold">Other</emphasis></entry>
  378. <entry>Processor</entry>
  379. <entry>4 x 64-bit Intel Processor</entry>
  380. </row>
  381. <row>
  382. <entry></entry>
  383. <entry>RAM</entry>
  384. <entry>12GB</entry>
  385. </row>
  386. <row>
  387. <entry></entry>
  388. <entry>Storage</entry>
  389. <entry>RAID 1, 5, 6, 10 Volume 200GB</entry>
  390. </row>
  391. <row>
  392. <entry></entry>
  393. <entry>Network</entry>
  394. <entry>1 Gb/sec bandwidth</entry>
  395. </row>
  396. </tbody>
  397. </tgroup>
  398. </informaltable></para>
  399. </sect2>
  400. </sect1>
  401. <sect1 id="Sample_Sizings">
  402. <title>Sample Sizings</title>
  403. <para>This section illustrates sample system sizings for various work
  404. environments. Unlike system requirements, the following samples are
  405. suggestions for setting up you system for various operating
  406. conditions.</para>
  407. <sect2 id="Sample-Size-HighDataVolume">
  408. <title>Sample Sizing for High Data volume (Typical)</title>
  409. <para>The most typical scenario for HPCC is utilizing it with a high
  410. volume of data. This suggested sample sizing would be appropriate for
  411. a site with large volumes of data. A good policy is to set the Thor
  412. size to 4 times the source data on your HPCC. Typically, Roxie would
  413. be about ¼ the size of Thor. This is because the data is compressed
  414. and the system does not hold any transient data in Roxie. Remember
  415. that you do not want the number of Roxie nodes to exceed the number of
  416. Thor nodes.</para>
  417. <sect3 id="SysAdm_SmplSiz_HiDataThor">
  418. <title>High Data Thor sizing considerations</title>
  419. <para>Each Thor node can hold about 2.5 TB of data (MAX), so plan
  420. for the number of Thor nodes accordingly for your data.</para>
  421. <para>If possible, SAS drives for both Thor and Roxie as they almost
  422. equal to SATA drives now. If not for both, get SAS drives at least
  423. for your Roxie cluster.</para>
  424. <para>Thor replicates data and is typically configured for two
  425. copies.</para>
  426. </sect3>
  427. <sect3 id="SysAdm_BestPrac_HiDataRoxie">
  428. <title>High Data Roxie sizing considerations</title>
  429. <para>Roxie keeps most of its data in memory, so you should allocate
  430. plenty of memory for Roxie. Calculate the approximate size of your
  431. data, and allocate appropriately. You should either increase the
  432. number of nodes, or increase the amount of memory.</para>
  433. <para>A good practice is to allocate a Dali for every Roxie
  434. cluster.</para>
  435. <para>Roxie should have a mirror. This is useful, when you need to
  436. update data. You update the mirror then make that primary and bring
  437. the other one down. This is a good practice but not really a
  438. necessity except in the case of high availability.</para>
  439. </sect3>
  440. </sect2>
  441. <sect2 id="SysAdm_BestPrac_HevyProc_LowData">
  442. <title>Sample Sizing for Heavy Processing on Low Data Volume</title>
  443. <para>The following section provides some sample sizing for heavy
  444. processing with approximately the amount of data indicated.</para>
  445. <sect3 id="SysAdm_BestPrac_750GB">
  446. <title>750 GB of Raw Data</title>
  447. <para>Thor = 3 (slaves) + 2 (management) = 5 Nodes</para>
  448. <para>Roxie = 3 (agents) + 1 (Dali) = 4 Nodes (This will mean that
  449. the environment will be down during query deployment)</para>
  450. <para>Spares = 2</para>
  451. <para>Total = 13 nodes</para>
  452. </sect3>
  453. <sect3 id="SysAdm_BestPrac_1250GB">
  454. <title>1250 GB of Raw Data</title>
  455. <para>Thor = 6 (slaves) + 2 (management) = 8 Nodes</para>
  456. <para>Roxie = 4 (agents) + 1 (Dali) = 5 Nodes (This will mean that
  457. the environment will be down during query deployment)</para>
  458. <para>Spares = 2</para>
  459. <para>Total = 17 nodes</para>
  460. </sect3>
  461. <sect3 id="SysAdm_BestPrac_2000GB">
  462. <title>2000 GB of Raw Data</title>
  463. <para>Thor = 8 (slaves) + 3 (management) = 11 Nodes</para>
  464. <para>Roxie = 4 (agents) + 1 (Dali) = 5 Nodes (This will mean that
  465. the environment will be down during query deployment)</para>
  466. <para>Spares = 2</para>
  467. <para>Total = 20 nodes</para>
  468. </sect3>
  469. <sect3 id="SysAdm_BestPrac_3500GB">
  470. <title>3500 GB of Raw Data</title>
  471. <para>Thor = 12 (slaves) + 5 (management) = 17 Nodes</para>
  472. <para>Roxie = 6 (agents) + 1 (Dali) = 7 Nodes (This will mean that
  473. the environment will be down during query deployment)</para>
  474. <para>Spares = 2</para>
  475. <para>Total = 28 nodes</para>
  476. </sect3>
  477. </sect2>
  478. </sect1>
  479. </chapter>
  480. </book>