Ei kuvausta

Michael Gardner 2779f77e3d HPCC-18380 WsSQL migration into platform 7 vuotta sitten
.github 90b33faa94 HPCC-18020 Add new option to pull request template 8 vuotta sitten
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 vuotta sitten
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 vuotta sitten
clienttools 42178124e1 HPCC-18501 Fix ESDL batch file path 7 vuotta sitten
cmake_modules 2779f77e3d HPCC-18380 WsSQL migration into platform 7 vuotta sitten
common 6a80d4a418 Merge pull request #10594 from ghalliday/issue18652 7 vuotta sitten
configuration 6221b46d45 HPCC-18652 Fix various windows compile warnings 7 vuotta sitten
dali 6a80d4a418 Merge pull request #10594 from ghalliday/issue18652 7 vuotta sitten
deploy c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 vuotta sitten
deployment 6221b46d45 HPCC-18652 Fix various windows compile warnings 7 vuotta sitten
docs c0de40410b Merge pull request #10578 from JamesDeFabia/HPCC-18628 7 vuotta sitten
ecl 4c23d7a2a3 Merge pull request #10575 from richardkchapman/payload5 7 vuotta sitten
ecllibrary 8a1a2a7e41 HPCC-18329 Only test chinese breakiterator semantics if ICU version >= 50 7 vuotta sitten
esp 2779f77e3d HPCC-18380 WsSQL migration into platform 7 vuotta sitten
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 vuotta sitten
initfiles 2779f77e3d HPCC-18380 WsSQL migration into platform 7 vuotta sitten
lib2 4d49720da6 HPCC-15414 clean lib2 for lib name changes on Mac OS 8 vuotta sitten
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 vuotta sitten
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 vuotta sitten
plugins 1632bccefa HPCC-18536 Provide default value for the playProperties() property filter 7 vuotta sitten
roxie 6221b46d45 HPCC-18652 Fix various windows compile warnings 7 vuotta sitten
rtl 6696ea2e3e Merge pull request #10607 from richardkchapman/coverity-rtlrecord 7 vuotta sitten
services f9e38f2f46 HPCC-18585 Replace uses of rand() with fastRand() 7 vuotta sitten
system 4602753bd9 Merge pull request #10581 from ghalliday/issue18636 7 vuotta sitten
testing d1a514f81c HPCC-18617 Refactor Dali unit tests 7 vuotta sitten
thorlcr 73832d4080 Merge pull request #10291 from shamser/issue17699 7 vuotta sitten
tools 4ff9c50f88 Merge pull request #10615 from richardkchapman/coverity-null 7 vuotta sitten
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 vuotta sitten
.gitignore bcc5a9e382 HPCC-15799 Add minimal linting 9 vuotta sitten
.gitmodules 80bf0e00e9 Merge branch 'candidate-6.4.0' 8 vuotta sitten
.travis.yml e7f2dd8ae4 HPCC-17169 Silence/fix lint warnings 8 vuotta sitten
BUILD_ME.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md 10 vuotta sitten
CMakeLists.txt 2779f77e3d HPCC-18380 WsSQL migration into platform 7 vuotta sitten
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 14 vuotta sitten
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 9 vuotta sitten
FUTURE b39eb133f9 Initial version of FUTURE document 14 vuotta sitten
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 vuotta sitten
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 vuotta sitten
README.md c0253694b9 Merge branch 'candidate-6.4.0' 8 vuotta sitten
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 10 vuotta sitten
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 vuotta sitten
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 12 vuotta sitten
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 9 vuotta sitten
sourcedoc.xml c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 10 vuotta sitten
version.cmake 947fe84327 Split off candidate-6.4.0 8 vuotta sitten

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: