説明なし

xwang2713 838f4dd1b2 HPCC-25791 skip test local cluster in dfu for containerized code 4 年 前
.github c17b532d2a Merge branch 'candidate-7.12.x' into candidate-8.0.x 4 年 前
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年 前
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 年 前
clienttools 250097ffbc HPCC-20550 remove target ESDL service name in ESDL. 6 年 前
cmake_modules 83dc4865a0 8.0 is now current latest build 4 年 前
common 7b2bc2ca1e Merge pull request #14873 from richardkchapman/parallel-core 4 年 前
configuration c3e9fe0e3f Merge branch 'candidate-7.12.x' into candidate-8.0.x 4 年 前
dali 838f4dd1b2 HPCC-25791 skip test local cluster in dfu for containerized code 4 年 前
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling 7 年 前
deployment e000af1c42 HPCC-25627 Fix OSX build issue - add #include <string> 4 年 前
devdoc 585d417bbb HPCC-25759 Add support for bare-metal storage planes to helm charts 4 年 前
devel cdbf9bb329 HPCC-23036 DistributePKI and safe_pki 5 年 前
dockerfiles 5a2293a15c HPCC-25613 Check roxie udp timeouts 4 年 前
docs b0c7a0bda2 Merge branch 'vfumene/8.0.0-1' into candidate-8.0.x 4 年 前
ecl 4c06918dd0 Merge pull request #14897 from ghalliday/issue25792 4 年 前
ecllibrary 13eeb254b4 HPCC-25735 Speed-up std library testing. 4 年 前
esp e1f3f9ca47 Merge pull request #14907 from GordonSmith/HPCC-25710-SLaveNumber 4 年 前
fs 9fa566f4c4 HPCC-25788 Support copy from remote system to local storage plane 4 年 前
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 年 前
helm 7b2bc2ca1e Merge pull request #14873 from richardkchapman/parallel-core 4 年 前
initfiles c3e9fe0e3f Merge branch 'candidate-7.12.x' into candidate-8.0.x 4 年 前
lib2 1895bf3a26 HPCC-25188 Fix lib2 packaging to include icu67 library 4 年 前
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 年 前
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 年 前
plugins 9b6444cf13 HPCC-25752 Roxie may crash at startup if parallel-loading R queries 4 年 前
roxie 1a4c5232f4 Merge remote-tracking branch 'origin/candidate-7.12.x' into candidate-8.0.x 4 年 前
rtl e19d0487a4 HPCC-25401 Remove c++98 standard from property for eclregex.cpp 4 年 前
services 9ff9bae797 HPCC-21758 Make StringBuffer constructors explicit 6 年 前
system 4c06918dd0 Merge pull request #14897 from ghalliday/issue25792 4 年 前
testing 8398cef083 HPCC-25800 Pre-abort related log appears in wrong place of the result report. 4 年 前
thorlcr 714bf1700d Merge branch 'candidate-7.12.x' into candidate-8.0.x 4 年 前
tools 625163d0ee Merge pull request #14839 from RussWhitehead/initLDAPSysUser-8.0.x 4 年 前
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 年 前
.gitignore 6d0726522e HPCC-25059 ECL Watch Modern Routing 4 年 前
.gitmodules a008549bf5 HPCC-24567 JWT-based security manager plugin 4 年 前
BUILD_ME.md 910c676afe HPCC-19056 Update build instructions 5 年 前
CMakeLists.txt 5617221402 HPCC-25667 Re-Add 3.1.1 spark submodule and disable brp-python-bytecode in rpmbuild 4 年 前
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 14 年 前
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 9 年 前
FUTURE b39eb133f9 Initial version of FUTURE document 14 年 前
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年 前
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 年 前
README.md 1ec7f8ed88 Fix memory manager link 6 年 前
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 10 年 前
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年 前
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 12 年 前
cmake-vs2015.bat 5db236fc6e HPCC-20752 Build and run HPCC in VS 2015 6 年 前
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 9 年 前
hpcc.png a59ea3ce9c HPCC-23354 Supply an example helm chart and other docs 5 年 前
package-lock.json 4a7ea76b58 HPCC-19093 Update to latest hpcc-js 7 年 前
version.cmake 8d398e36a4 Split off 8.0.10 4 年 前

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: