暂无描述

Gavin Halliday 9e3774eb99 Merge pull request #8869 from richardkchapman/valgrind-errors 9 年之前
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 年之前
clienttools c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
cmake_modules 58b7311f18 Merge branch 'candidate-6.0.4' 9 年之前
common 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
dali dc409bea5a HPCC-15797 Restrict spray source to designated dropzones 9 年之前
deploy c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
deployment 903ed561ed HPCC-15357 ConfigMgr - Topology Cluster should include alias attribute 9 年之前
docs 208fc8ecb2 HPCC-13583 DOCS:systemd init usage 9 年之前
ecl 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
ecllibrary 906aa58b3f HPCC-15850 Remove duplicate format tests for Std.Date module 9 年之前
esp 6d7416430b HPCC-15810 Build errors with USE_CASSANDRA selected 9 年之前
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 年之前
initfiles 7843840cb3 Merge branch 'candidate-6.0.4' 9 年之前
lib2 357d817591 Merge branch 'candidate-5.6.4' into candidate-6.0.0 9 年之前
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 年之前
plugins 321dd0f768 HPCC-15885 Cassandra cpp driver 2.4.x list type issue 9 年之前
roxie 9e3774eb99 Merge pull request #8869 from richardkchapman/valgrind-errors 9 年之前
rtl 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
services 80f36a02f3 Merge branch 'candidate-5.4.0' 10 年之前
system 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
testing 57bcee8408 Merge pull request #8873 from AttilaVamos/HPCC-15881-fix-6.2.0 9 年之前
thorlcr 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
tools 94658780c3 HPCC-12396 Add ability to time specific functions 9 年之前
.gitattributes 7f4953af04 Issue #254 Switches template reading to use jlib 14 年之前
.gitignore a8ff38a9ca Minor code cleaup to avoid false positives from Eclipse 13 年之前
.gitmodules 821c455a75 HPCC-9920 Add and use new LZ4 compression algos for spill 9 年之前
.travis.yml 50beb15156 HPCC-13601 Travis-CI 10 年之前
BUILD_ME.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md 10 年之前
CMakeLists.txt 611082fe9e Merge branch 'candidate-6.0.2' 9 年之前
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 14 年之前
CONTRIBUTORS c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
FUTURE b39eb133f9 Initial version of FUTURE document 13 年之前
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 年之前
README.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md 10 年之前
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 9 年之前
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 11 年之前
sourcedoc.xml c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 年之前
version.cmake 335f6424b7 Merge branch 'candidate-6.0.2' 9 年之前

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions