Nenhuma descrição

Richard Chapman c288863fc0 Merge pull request #14085 from afishbeck/wsecl-list 4 anos atrás
.github 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 anos atrás
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 anos atrás
clienttools 250097ffbc HPCC-20550 remove target ESDL service name in ESDL. 6 anos atrás
cmake_modules 3a203e3a2a Merge branch 'candidate-7.10.x' 4 anos atrás
common 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
configuration 28b0f1504c centos 6 workaround for 7.6.0 5 anos atrás
dali 0369ad4657 Merge pull request #14092 from richardkchapman/readiness 4 anos atrás
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling 7 anos atrás
deployment d9b73fb2f3 HPCC-24240 Potential confusion between NotFound vs notFound symbols 5 anos atrás
devdoc 0c971b503b HPCC-24513 Improve workunit documentation about specialised workflow items 4 anos atrás
devel cdbf9bb329 HPCC-23036 DistributePKI and safe_pki 5 anos atrás
dockerfiles e7813e5a2b HPCC-24667 Increase platform docker image nodejs version 4 anos atrás
docs baff37444a HPCC-24686 Update terminology in SysAdmin Guide 4 anos atrás
ecl 3a203e3a2a Merge branch 'candidate-7.10.x' 4 anos atrás
ecllibrary 7504819618 HPCC-22927 RemotePull that needs external Esp should optionally take credentials 5 anos atrás
esp f314b3731e Merge pull request #14085 from afishbeck/wsecl-list 4 anos atrás
fs 1030239734 Merge branch 'candidate-7.8.x' into candidate-7.10.x 4 anos atrás
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 anos atrás
helm f314b3731e Merge pull request #14085 from afishbeck/wsecl-list 4 anos atrás
initfiles c245416c37 Merge pull request #13533 from gfortil/HPCC-21404 4 anos atrás
lib2 53e0ecb20f Various changes to help make install work on OSX catalina 5 anos atrás
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 anos atrás
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 anos atrás
plugins 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
roxie 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
rtl 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
services 9ff9bae797 HPCC-21758 Make StringBuffer constructors explicit 6 anos atrás
system f314b3731e Merge pull request #14085 from afishbeck/wsecl-list 4 anos atrás
testing 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
thorlcr 18a0d978fb Merge branch 'candidate-7.10.x' 4 anos atrás
tools 90dba936dc Merge pull request #14082 from ghalliday/winfixes 4 anos atrás
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 anos atrás
.gitignore 7784686fd2 HPCC-24205 Add pyc files to git ignore 4 anos atrás
.gitmodules f490e45e1d HPCC-23209 Add azure blob support library 5 anos atrás
BUILD_ME.md 910c676afe HPCC-19056 Update build instructions 5 anos atrás
CMakeLists.txt b8664970ee HPCC-23313 Disable shebang process in rpm generation. 5 anos atrás
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 13 anos atrás
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 8 anos atrás
FUTURE b39eb133f9 Initial version of FUTURE document 13 anos atrás
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 anos atrás
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 anos atrás
README.md 1ec7f8ed88 Fix memory manager link 6 anos atrás
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 9 anos atrás
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 anos atrás
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 11 anos atrás
cmake-vs2015.bat 5db236fc6e HPCC-20752 Build and run HPCC in VS 2015 6 anos atrás
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 8 anos atrás
hpcc.png a59ea3ce9c HPCC-23354 Supply an example helm chart and other docs 5 anos atrás
package-lock.json 4a7ea76b58 HPCC-19093 Update to latest hpcc-js 7 anos atrás
version.cmake 7133b4eb8a Split off 7.10.x 5 anos atrás

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: