Bez popisu

Gavin Halliday cdb0a035d9 Merge pull request #13550 from richardkchapman/mastertags před 5 roky
.github 598defe8a0 HPCC-23358 Publish each gold/rc release images to Docker Hub před 5 roky
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® před 10 roky
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development před 10 roky
clienttools 250097ffbc HPCC-20550 remove target ESDL service name in ESDL. před 6 roky
cmake_modules 2405e70b4d Merge branch 'candidate-7.8.x' před 5 roky
common d29a3b4994 HPCC-23677 Improve soapcall handling of user specified GLobalID headers před 5 roky
configuration 28b0f1504c centos 6 workaround for 7.6.0 před 5 roky
dali 5f1bc5ea9b Merge branch 'candidate-7.6.x' into candidate-7.8.x před 5 roky
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling před 7 roky
deployment 2195b16127 HPCC-23582 fix envgen2 usage typo před 5 roky
devdoc 88e1bd2280 HPCC-20089 Rationalize and consolidate the developer documentation před 7 roky
devel cdbf9bb329 HPCC-23036 DistributePKI and safe_pki před 5 roky
dockerfiles 95480d5f20 HPCC-23746 Ensure type: roxie for thor agent works před 5 roky
docs af039418ad Merge pull request #13527 from g-pan/H23681-UbuDeps před 5 roky
ecl b42c3aa37e Merge pull request #13535 from GordonSmith/HPCC-23750-eclcc_keywords před 5 roky
ecllibrary 293b6e8c1b HPCC-23730 Bump std.DataPatterns to v1.6.3 před 5 roky
esp 5ba916515a Merge pull request #13541 from kenrowland/HPCC-23757 před 5 roky
fs 70640c3a3a HPCC-23255 Move RFCremove to 'slow' throttle category. před 5 roky
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' před 12 roky
initfiles 43d0a0581d HPCC-23733 Should still generate public.key.pem file před 5 roky
lib2 53e0ecb20f Various changes to help make install work on OSX catalina před 5 roky
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project před 12 roky
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue před 8 roky
plugins 2814e70f50 HPCC-23743 Fix release build errors for cassandraloggingagent před 5 roky
roxie eedeab2b1e HPCC-23723 Reduce over-use of templates in helm charts před 5 roky
rtl ceffd36b33 HPCC-23286 Stop differences in payload dataset from being treated as keyed před 5 roky
services 9ff9bae797 HPCC-21758 Make StringBuffer constructors explicit před 6 roky
system 1c0fc0a004 HPCC-23751 Fix additional problem with windows builds před 5 roky
testing 9b1eb08768 Merge pull request #13488 from ghalliday/issue23695 před 5 roky
thorlcr 8f46c21be4 HPCC-23712 Thor-on-demand with queueing před 5 roky
tools 48b005c2e7 HPCC-23549 Remove legacy code for obtaining passwords from environment před 5 roky
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem před 8 roky
.gitignore e93161d906 HPCC-23289 EclWatch option to override debug server před 5 roky
.gitmodules 53b580989a HPCC-23632 Move the AWS support libraries před 5 roky
BUILD_ME.md 910c676afe HPCC-19056 Update build instructions před 5 roky
CMakeLists.txt 7c12a59c5a HPCC-23693 Suppress "Up-to-date:" make install messages před 5 roky
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection před 14 roky
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing před 8 roky
FUTURE b39eb133f9 Initial version of FUTURE document před 14 roky
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® před 10 roky
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package před 9 roky
README.md 1ec7f8ed88 Fix memory manager link před 6 roky
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release před 9 roky
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® před 10 roky
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc před 12 roky
cmake-vs2015.bat 5db236fc6e HPCC-20752 Build and run HPCC in VS 2015 před 6 roky
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR před 9 roky
hpcc.png a59ea3ce9c HPCC-23354 Supply an example helm chart and other docs před 5 roky
package-lock.json 4a7ea76b58 HPCC-19093 Update to latest hpcc-js před 7 roky
version.cmake a073dae0a1 Split off candidate-7.8.x před 5 roky

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: