Keine Beschreibung

wangkx 6ad4f9bf70 HPCC-21126 Refactor CXRefExBuilderThread class vor 6 Jahren
.github 90b33faa94 HPCC-18020 Add new option to pull request template vor 8 Jahren
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development vor 10 Jahren
clienttools 250097ffbc HPCC-20550 remove target ESDL service name in ESDL. vor 6 Jahren
cmake_modules 7767d97dbc HPCC-21577 Relocate macOS-specific directory tests vor 6 Jahren
common 6909c07c04 HPCC-21584 jar files cannot be extracted from manifest in standalone exes vor 6 Jahren
configuration 9f208a9103 HPCC-21562 Improve modification templates vor 6 Jahren
dali 950d9946c2 HPCC-21322 Report Index type in WsDfu.DFUQuery and DFUInfo vor 6 Jahren
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling vor 7 Jahren
deployment 52c00c08a7 HPCC-20802 Add Spark configuration support with envgen2 vor 6 Jahren
devdoc 88e1bd2280 HPCC-20089 Rationalize and consolidate the developer documentation vor 7 Jahren
docs e71cea584b Merge pull request #12184 from JamesDeFabia/HPCC-21329VARUNICODE vor 6 Jahren
ecl 9176fa2c3f Merge pull request #12244 from ghalliday/issue21581 vor 6 Jahren
ecllibrary 5c2035ae60 HPCC-21403 Add Std.System.Util. PlatformVersionCheck function vor 6 Jahren
esp 6ad4f9bf70 HPCC-21126 Refactor CXRefExBuilderThread class vor 6 Jahren
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' vor 12 Jahren
initfiles 9f208a9103 HPCC-21562 Improve modification templates vor 6 Jahren
lib2 573fdbd986 Merge pull request #11926 from xwang2713/HPCC-20970-support-openssl-1.1 vor 6 Jahren
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project vor 12 Jahren
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue vor 8 Jahren
plugins 8e79f1d0cb HPCC-21322 Report Index type in WsDfu.DFUQuery and DFUInfo vor 6 Jahren
roxie ff5f7d6231 Merge pull request #12193 from richardkchapman/blind-excessive vor 6 Jahren
rtl b4f96acb07 macOS client tools with MacPorts support vor 6 Jahren
services f9e38f2f46 HPCC-18585 Replace uses of rand() with fastRand() vor 7 Jahren
system a291b839cc Merge pull request #12241 from ghalliday/issue21576 vor 6 Jahren
testing 9f208a9103 HPCC-21562 Improve modification templates vor 6 Jahren
thorlcr d5e61bb2a8 Merge branch 'candidate-7.0.x' into candidate-7.2.x vor 6 Jahren
tools b1f2459315 HPCC-21502 Excessive seek tracing is not suppressed by blind logging vor 6 Jahren
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem vor 8 Jahren
.gitignore 0550ad53ae HPCC-17851 New config manager core library vor 7 Jahren
.gitmodules 747b791214 HPCC-18969 Implement wizard support for new config manager vor 6 Jahren
.travis.yml 9aba0126bc HPCC-18512 Switch to WebPack for ECL Watch build vor 7 Jahren
BUILD_ME.md 3bc3658b09 HPCC-19312 Update to latest cassandra driver vor 7 Jahren
CMakeLists.txt df7509211c HPCC-20752 Address various code review comments vor 6 Jahren
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection vor 14 Jahren
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing vor 8 Jahren
FUTURE b39eb133f9 Initial version of FUTURE document vor 13 Jahren
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package vor 9 Jahren
README.md 1ec7f8ed88 Fix memory manager link vor 6 Jahren
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release vor 9 Jahren
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc vor 11 Jahren
cmake-vs2015.bat 5db236fc6e HPCC-20752 Build and run HPCC in VS 2015 vor 6 Jahren
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR vor 9 Jahren
package-lock.json 4a7ea76b58 HPCC-19093 Update to latest hpcc-js vor 7 Jahren
version.cmake 0644bba9af Split of 7.2.0 vor 6 Jahren

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: