説明なし

Gavin Halliday c86254dcd1 Merge pull request #11621 from jakesmith/hpcc-20426 6 年 前
.github 90b33faa94 HPCC-18020 Add new option to pull request template 7 年 前
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 年 前
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 年 前
clienttools a571cf51db HPCC-17408 Add Clienttools bin to Windows PATH variable 6 年 前
cmake_modules 9d4a7f3939 Merge pull request #11569 from Michael-Gardner/HPCC-18913 6 年 前
common 0be6faf8bd HPCC-20369 Fix Roxie SSL connection logs assertion after timeout 6 年 前
configuration 143998aa51 HPCC-20114 Don't allow service binding to https with no certificate 6 年 前
dali 895f504ccb HPCC-20426 Refactor crypto helper and compile into jlib. 6 年 前
deploy 1b5f9692cb HPCC-19467 Fix problems with windows stand alone compiling 7 年 前
deployment 43b3bb34f1 HPCC-16183 Output slave port and slaves per node in configgen 6 年 前
devdoc 88e1bd2280 HPCC-20089 Rationalize and consolidate the developer documentation 6 年 前
docs 9a158c620b HPCC-20171 Update Landing Zones image 6 年 前
ecl b48dba12f8 Merge pull request #11636 from ghalliday/issue20402 6 年 前
ecllibrary 7c1df8268b Merge pull request #11607 from AttilaVamos/HPCC-19489-improvement-7.2.0 6 年 前
esp 6fe6f10e78 Merge pull request #11664 from kunalaswani/HPCC-19866new 6 年 前
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 年 前
initfiles 83586bf146 Merge pull request #11667 from kenrowland/HPCC-20424 6 年 前
lib2 4d49720da6 HPCC-15414 clean lib2 for lib name changes on Mac OS 8 年 前
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 年 前
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 年 前
plugins 7c1df8268b Merge pull request #11607 from AttilaVamos/HPCC-19489-improvement-7.2.0 6 年 前
roxie ae2f9a6a5c HPCC-20282 Std.System.Job.User() fails on Roxie 6 年 前
rtl beabcf74c9 HPCC-20477 Add placeholders for 6.x functions so old dlls can be loaded 6 年 前
services f9e38f2f46 HPCC-18585 Replace uses of rand() with fastRand() 7 年 前
spark ffcc59865a HPCC-20168 Include Spark conditionally into platform package 6 年 前
system c86254dcd1 Merge pull request #11621 from jakesmith/hpcc-20426 6 年 前
testing c86254dcd1 Merge pull request #11621 from jakesmith/hpcc-20426 6 年 前
thorlcr 4e73f9cdbc Merge branch 'candidate-6.4.26' 6 年 前
tools ac6fef5196 HPCC-18183 Direct ESDL.exe errors to stderr 6 年 前
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 年 前
.gitignore 0550ad53ae HPCC-17851 New config manager core library 7 年 前
.gitmodules 800b472e3d HPCC-18512 Update ECL Watch stats to use WebPack 7 年 前
.travis.yml 9aba0126bc HPCC-18512 Switch to WebPack for ECL Watch build 7 年 前
BUILD_ME.md 3bc3658b09 HPCC-19312 Update to latest cassandra driver 7 年 前
CMakeLists.txt 9d4a7f3939 Merge pull request #11569 from Michael-Gardner/HPCC-18913 6 年 前
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 13 年 前
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 8 年 前
FUTURE b39eb133f9 Initial version of FUTURE document 13 年 前
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 年 前
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 年 前
README.md c0253694b9 Merge branch 'candidate-6.4.0' 8 年 前
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 9 年 前
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 年 前
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 11 年 前
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 8 年 前
package-lock.json 4a7ea76b58 HPCC-19093 Update to latest hpcc-js 7 年 前
version.cmake 88eda098e4 Community Edition 7.0.0-rc1 Release Candidate 1 6 年 前

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: