No Description

Richard Chapman 1c717508b2 Merge pull request #15946 from kenrowland/HPCC-27425 3 years ago
.devcontainer d3e0f52bf3 Merge branch 'candidate-8.4.x' 3 years ago
.github 2c54c85b39 HPCC-27428 Switch to use changed_modules action 3 years ago
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 years ago
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 years ago
clienttools 250097ffbc HPCC-20550 remove target ESDL service name in ESDL. 6 years ago
cmake_modules 82bf1f3915 HPCC-27209 VCPKG Client Tools Support 3 years ago
common 8353f5746a HPCC-27472 All ECL test code fails in Debug build. 3 years ago
configuration 72fab49096 HPCC-25212 Add support for unity builds in CMake 3 years ago
dali d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling 7 years ago
deployment 66a454af72 Updates based on changes from Shamser 3 years ago
devdoc 721bf4ff16 HPCC-26479 Improve the git authentication for remote eclccserver 3 years ago
devel cdbf9bb329 HPCC-23036 DistributePKI and safe_pki 5 years ago
dockerfiles dffc139b5b Merge pull request #15944 from ghalliday/issue27421 3 years ago
docs d55b53656b Merge pull request #15899 from g-pan/H26614-hMem 3 years ago
ecl 8353f5746a HPCC-27472 All ECL test code fails in Debug build. 3 years ago
ecllibrary e121008804 HPCC-26382 Std.File.Copy does NOT allow copying to foreign file 3 years ago
esp d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
fs 4fa2ff0947 HPCC-27047 K8s Foreign file access 3 years ago
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 years ago
helm d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
initfiles 01ef96be26 HPCC-27425 Remove references to thing finder ESP service 3 years ago
lib2 2ec6e85ec6 HPCC-27211 add processing dynamic shared lib path for files under lib2 3 years ago
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 years ago
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 years ago
plugins d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
roxie d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
rtl 72fab49096 HPCC-25212 Add support for unity builds in CMake 3 years ago
services 9ff9bae797 HPCC-21758 Make StringBuffer constructors explicit 6 years ago
system 1c717508b2 Merge pull request #15946 from kenrowland/HPCC-27425 3 years ago
testing d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
thorlcr ac017b0c45 Merge pull request #15952 from jakesmith/HPCC-27437-index-blob-leaks 3 years ago
tools d6d874c880 Merge branch 'candidate-8.6.x' 3 years ago
vcpkg @ af3a974a09 d450fd553d HPCC-27325 Bump vcpkg to latest 3 years ago
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 years ago
.gitignore 747492367d HPCC-27145 Untrack dockerfiles/platform-build-incremental/hpcc.gitpatch 3 years ago
.gitmodules 02b9941895 HPCC-27104 Add vcpkg submodule 3 years ago
BUILD_ME.md 910c676afe HPCC-19056 Update build instructions 5 years ago
CMakeLists.txt 82bf1f3915 HPCC-27209 VCPKG Client Tools Support 3 years ago
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 13 years ago
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 8 years ago
FUTURE b39eb133f9 Initial version of FUTURE document 13 years ago
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 years ago
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 years ago
README.md 1ec7f8ed88 Fix memory manager link 6 years ago
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 9 years ago
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 years ago
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 11 years ago
cmake-vs2015.bat 5db236fc6e HPCC-20752 Build and run HPCC in VS 2015 6 years ago
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 8 years ago
hpcc.png a59ea3ce9c HPCC-23354 Supply an example helm chart and other docs 5 years ago
vcpkg.json 76125c8a8b HPCC-27164 Add support for USE_BOOST_REGEX to vcpkg 3 years ago
vcpkg.md 82bf1f3915 HPCC-27209 VCPKG Client Tools Support 3 years ago
version.cmake 188408eaca Split off 8.6.0 3 years ago

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: